Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Sam F. Brown
Karma:
378
All
Posts
Comments
New
Top
Old
Page
1
March 2025 Oxford Rationalish Pub Social
fenmund
and
Sam F. Brown
Mar 5, 2025, 8:07 AM
1
point
0
comments
1
min read
LW
link
Feb 2025 Oxford Rationalish Pub Social
fenmund
and
Sam F. Brown
Feb 6, 2025, 12:53 PM
1
point
0
comments
1
min read
LW
link
Oxford Rationalish Pub Social
fenmund
and
Sam F. Brown
Jan 2, 2025, 12:10 AM
1
point
0
comments
1
min read
LW
link
OxRat December Pub Social
fenmund
and
Sam F. Brown
Nov 28, 2024, 1:13 PM
1
point
0
comments
1
min read
LW
link
Oxford Rationalish—November Pub
fenmund
and
Sam F. Brown
Nov 4, 2024, 8:25 PM
1
point
0
comments
1
min read
LW
link
Oxford ACX Anywhere—OxRat 2024
fenmund
and
Sam F. Brown
Sep 3, 2024, 7:18 PM
1
point
0
comments
1
min read
LW
link
OxRat September 2024 Pub Social
fenmund
and
Sam F. Brown
Sep 3, 2024, 6:57 PM
1
point
0
comments
1
min read
LW
link
OxRat August Pub Social
fenmund
and
Sam F. Brown
Aug 5, 2024, 8:16 PM
1
point
0
comments
1
min read
LW
link
Auto-Enhance: Developing a meta-benchmark to measure LLM agents’ ability to improve other agents
Sam F. Brown
,
BasilLabib
,
Codruta (Coco) Lugoj
and
Sai Sasank Y
Jul 22, 2024, 12:33 PM
20
points
0
comments
14
min read
LW
link
OxRat July Pub Social
fenmund
and
Sam F. Brown
Jul 4, 2024, 2:36 PM
1
point
0
comments
1
min read
LW
link
[Paper] AI Sandbagging: Language Models can Strategically Underperform on Evaluations
Teun van der Weij
,
Felix Hofstätter
,
Ollie J
,
Sam F. Brown
and
Francis Rhys Ward
Jun 13, 2024, 10:04 AM
84
points
10
comments
2
min read
LW
link
(arxiv.org)
Oxford Rationalish—June Pub
fenmund
and
Sam F. Brown
Jun 10, 2024, 11:44 AM
1
point
0
comments
1
min read
LW
link
OxRat ACX Meetups Everywhere—Spring 2024
Sam F. Brown
and
fenmund
Mar 16, 2024, 7:41 PM
7
points
0
comments
1
min read
LW
link
OxRat March Pub Social
fenmund
and
Sam F. Brown
Mar 10, 2024, 9:27 PM
1
point
0
comments
1
min read
LW
link
Oxford Rationalish—Dec Pub
fenmund
and
Sam F. Brown
Dec 8, 2023, 8:20 PM
1
point
0
comments
1
min read
LW
link
Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models
Felix Hofstätter
,
Francis Rhys Ward
,
HarrietW
,
LAThomson
,
Ollie J
,
Patrik Bartak
and
Sam F. Brown
Nov 8, 2023, 11:37 AM
49
points
0
comments
18
min read
LW
link
Oxford Rationalish—Sept Pub
Sam F. Brown
Sep 19, 2023, 10:03 AM
4
points
0
comments
1
min read
LW
link
OxRat ACX Meetups Everywhere 2023
Sam F. Brown
Aug 30, 2023, 3:15 AM
4
points
0
comments
1
min read
LW
link
Oxford, UK – ACX Meetups Everywhere Fall 2023
Sam F. Brown
Aug 25, 2023, 11:33 PM
4
points
0
comments
1
min read
LW
link
Oxford Rationalish—July Pub
Sam F. Brown
Jul 15, 2023, 10:10 AM
4
points
0
comments
1
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel