Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
paulfchristiano
Karma:
27,485
All
Posts
Comments
New
Top
Old
Page
1
Matrix completion prize results
paulfchristiano
20 Dec 2023 15:40 UTC
41
points
0
comments
2
min read
LW
link
(www.alignment.org)
Thoughts on responsible scaling policies and regulation
paulfchristiano
24 Oct 2023 22:21 UTC
220
points
33
comments
6
min read
LW
link
Thoughts on sharing information about language model capabilities
paulfchristiano
31 Jul 2023 16:04 UTC
208
points
36
comments
11
min read
LW
link
Self-driving car bets
paulfchristiano
29 Jul 2023 18:10 UTC
234
points
43
comments
5
min read
LW
link
(sideways-view.com)
ARC is hiring theoretical researchers
paulfchristiano
,
Jacob_Hilton
and
Mark Xu
12 Jun 2023 18:50 UTC
126
points
12
comments
4
min read
LW
link
(www.alignment.org)
Prizes for matrix completion problems
paulfchristiano
3 May 2023 23:30 UTC
164
points
52
comments
1
min read
LW
link
(www.alignment.org)
My views on “doom”
paulfchristiano
27 Apr 2023 17:50 UTC
245
points
35
comments
2
min read
LW
link
(ai-alignment.com)
Christiano (ARC) and GA (Conjecture) Discuss Alignment Cruxes
Andrea_Miotti
,
paulfchristiano
,
Gabriel Alfour
and
OliviaJ
24 Feb 2023 23:03 UTC
61
points
7
comments
47
min read
LW
link
Thoughts on the impact of RLHF research
paulfchristiano
25 Jan 2023 17:23 UTC
250
points
102
comments
9
min read
LW
link
Can we efficiently distinguish different mechanisms?
paulfchristiano
27 Dec 2022 0:20 UTC
88
points
30
comments
16
min read
LW
link
(ai-alignment.com)
Three reasons to cooperate
paulfchristiano
24 Dec 2022 17:40 UTC
82
points
14
comments
10
min read
LW
link
(sideways-view.com)
Can we efficiently explain model behaviors?
paulfchristiano
16 Dec 2022 19:40 UTC
64
points
3
comments
9
min read
LW
link
(ai-alignment.com)
AI alignment is distinct from its near-term applications
paulfchristiano
13 Dec 2022 7:10 UTC
255
points
21
comments
2
min read
LW
link
(ai-alignment.com)
Finding gliders in the game of life
paulfchristiano
1 Dec 2022 20:40 UTC
101
points
7
comments
16
min read
LW
link
(ai-alignment.com)
Mechanistic anomaly detection and ELK
paulfchristiano
25 Nov 2022 18:50 UTC
133
points
22
comments
21
min read
LW
link
(ai-alignment.com)
Decision theory and dynamic inconsistency
paulfchristiano
3 Jul 2022 22:20 UTC
80
points
33
comments
10
min read
LW
link
(sideways-view.com)
AI-Written Critiques Help Humans Notice Flaws
paulfchristiano
25 Jun 2022 17:22 UTC
137
points
5
comments
3
min read
LW
link
(openai.com)
Where I agree and disagree with Eliezer
paulfchristiano
19 Jun 2022 19:15 UTC
888
points
220
comments
18
min read
LW
link
2
reviews
What is causality to an evidential decision theorist?
paulfchristiano
17 Apr 2022 16:00 UTC
45
points
26
comments
5
min read
LW
link
(sideways-view.com)
ELK prize results
paulfchristiano
and
Mark Xu
9 Mar 2022 0:01 UTC
138
points
50
comments
21
min read
LW
link
Back to top
Next