Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
paulfchristiano
Karma:
27,900
All
Posts
Comments
New
Top
Old
Page
3
Teaching ML to answer questions honestly instead of predicting human answers
paulfchristiano
May 28, 2021, 5:30 PM
53
points
18
comments
16
min read
LW
link
(ai-alignment.com)
Decoupling deliberation from competition
paulfchristiano
May 25, 2021, 6:50 PM
84
points
17
comments
9
min read
LW
link
1
review
(ai-alignment.com)
Mundane solutions to exotic problems
paulfchristiano
May 4, 2021, 6:20 PM
56
points
8
comments
5
min read
LW
link
(ai-alignment.com)
Low-stakes alignment
paulfchristiano
Apr 30, 2021, 12:10 AM
87
points
11
comments
7
min read
LW
link
1
review
(ai-alignment.com)
AMA: Paul Christiano, alignment researcher
paulfchristiano
Apr 28, 2021, 6:55 PM
117
points
197
comments
1
min read
LW
link
Announcing the Alignment Research Center
paulfchristiano
Apr 26, 2021, 11:30 PM
178
points
6
comments
1
min read
LW
link
(ai-alignment.com)
Another (outer) alignment failure story
paulfchristiano
Apr 7, 2021, 8:12 PM
249
points
38
comments
12
min read
LW
link
1
review
My research methodology
paulfchristiano
Mar 22, 2021, 9:20 PM
159
points
38
comments
16
min read
LW
link
1
review
(ai-alignment.com)
Demand offsetting
paulfchristiano
Mar 21, 2021, 6:20 PM
133
points
41
comments
5
min read
LW
link
(sideways-view.com)
It’s not economically inefficient for a UBI to reduce recipient’s employment
paulfchristiano
Nov 22, 2020, 4:40 PM
93
points
60
comments
4
min read
LW
link
(sideways-view.com)
Hiring engineers and researchers to help align GPT-3
paulfchristiano
Oct 1, 2020, 6:54 PM
206
points
13
comments
3
min read
LW
link
“Unsupervised” translation as an (intent) alignment problem
paulfchristiano
Sep 30, 2020, 12:50 AM
62
points
15
comments
4
min read
LW
link
(ai-alignment.com)
Distributed public goods provision
paulfchristiano
Sep 26, 2020, 9:20 PM
27
points
3
comments
5
min read
LW
link
(sideways-view.com)
Better priors as a safety problem
paulfchristiano
Jul 5, 2020, 9:20 PM
66
points
7
comments
5
min read
LW
link
(ai-alignment.com)
Learning the prior
paulfchristiano
Jul 5, 2020, 9:00 PM
92
points
28
comments
8
min read
LW
link
(ai-alignment.com)
Inaccessible information
paulfchristiano
Jun 3, 2020, 5:10 AM
83
points
17
comments
14
min read
LW
link
2
reviews
(ai-alignment.com)
Writeup: Progress on AI Safety via Debate
Beth Barnes
and
paulfchristiano
Feb 5, 2020, 9:04 PM
103
points
18
comments
33
min read
LW
link
Hedonic asymmetries
paulfchristiano
Jan 26, 2020, 2:10 AM
98
points
22
comments
2
min read
LW
link
(sideways-view.com)
Moral public goods
paulfchristiano
Jan 26, 2020, 12:10 AM
147
points
74
comments
4
min read
LW
link
(sideways-view.com)
Of arguments and wagers
paulfchristiano
Jan 10, 2020, 10:20 PM
52
points
6
comments
6
min read
LW
link
(ai-alignment.com)
Back to first
Previous
Back to top
Next