Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
paulfchristiano
Karma:
27,791
All
Posts
Comments
New
Top
Old
Page
1
Matrix completion prize results
paulfchristiano
Dec 20, 2023, 3:40 PM
41
points
0
comments
2
min read
LW
link
(www.alignment.org)
Thoughts on responsible scaling policies and regulation
paulfchristiano
Oct 24, 2023, 10:21 PM
220
points
33
comments
6
min read
LW
link
Thoughts on sharing information about language model capabilities
paulfchristiano
Jul 31, 2023, 4:04 PM
210
points
44
comments
11
min read
LW
link
1
review
Self-driving car bets
paulfchristiano
Jul 29, 2023, 6:10 PM
235
points
44
comments
5
min read
LW
link
(sideways-view.com)
ARC is hiring theoretical researchers
paulfchristiano
,
Jacob_Hilton
and
Mark Xu
Jun 12, 2023, 6:50 PM
126
points
12
comments
4
min read
LW
link
(www.alignment.org)
Prizes for matrix completion problems
paulfchristiano
May 3, 2023, 11:30 PM
164
points
52
comments
1
min read
LW
link
(www.alignment.org)
My views on “doom”
paulfchristiano
Apr 27, 2023, 5:50 PM
250
points
37
comments
2
min read
LW
link
1
review
(ai-alignment.com)
Christiano (ARC) and GA (Conjecture) Discuss Alignment Cruxes
Andrea_Miotti
,
paulfchristiano
,
Gabriel Alfour
and
OliviaJ
Feb 24, 2023, 11:03 PM
61
points
7
comments
47
min read
LW
link
Thoughts on the impact of RLHF research
paulfchristiano
Jan 25, 2023, 5:23 PM
252
points
102
comments
9
min read
LW
link
Can we efficiently distinguish different mechanisms?
paulfchristiano
Dec 27, 2022, 12:20 AM
88
points
30
comments
16
min read
LW
link
(ai-alignment.com)
Three reasons to cooperate
paulfchristiano
Dec 24, 2022, 5:40 PM
82
points
14
comments
10
min read
LW
link
(sideways-view.com)
Can we efficiently explain model behaviors?
paulfchristiano
Dec 16, 2022, 7:40 PM
64
points
3
comments
9
min read
LW
link
(ai-alignment.com)
AI alignment is distinct from its near-term applications
paulfchristiano
Dec 13, 2022, 7:10 AM
255
points
21
comments
2
min read
LW
link
(ai-alignment.com)
Finding gliders in the game of life
paulfchristiano
Dec 1, 2022, 8:40 PM
101
points
8
comments
16
min read
LW
link
(ai-alignment.com)
Mechanistic anomaly detection and ELK
paulfchristiano
Nov 25, 2022, 6:50 PM
134
points
22
comments
21
min read
LW
link
(ai-alignment.com)
Decision theory and dynamic inconsistency
paulfchristiano
Jul 3, 2022, 10:20 PM
80
points
33
comments
10
min read
LW
link
(sideways-view.com)
AI-Written Critiques Help Humans Notice Flaws
paulfchristiano
Jun 25, 2022, 5:22 PM
137
points
5
comments
3
min read
LW
link
(openai.com)
Where I agree and disagree with Eliezer
paulfchristiano
Jun 19, 2022, 7:15 PM
898
points
223
comments
18
min read
LW
link
2
reviews
What is causality to an evidential decision theorist?
paulfchristiano
Apr 17, 2022, 4:00 PM
45
points
26
comments
5
min read
LW
link
(sideways-view.com)
ELK prize results
paulfchristiano
and
Mark Xu
Mar 9, 2022, 12:01 AM
138
points
50
comments
21
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel