paulfchristiano

Karma: 27,900

Teaching ML to answer questions honestly instead of predicting human answers

paulfchristianoMay 28, 2021, 5:30 PM

53 points

18 comments16 min readLW link

(ai-alignment.com)

Decoupling deliberation from competition

paulfchristianoMay 25, 2021, 6:50 PM

84 points

17 comments9 min readLW link 1 review

(ai-alignment.com)

Mundane solutions to exotic problems

paulfchristianoMay 4, 2021, 6:20 PM

56 points

8 comments5 min readLW link

(ai-alignment.com)

Low-stakes alignment

paulfchristianoApr 30, 2021, 12:10 AM

87 points

11 comments7 min readLW link 1 review

(ai-alignment.com)

AMA: Paul Christiano, alignment researcher

paulfchristianoApr 28, 2021, 6:55 PM

117 points

197 comments1 min readLW link

Announcing the Alignment Research Center

paulfchristianoApr 26, 2021, 11:30 PM

178 points

6 comments1 min readLW link

(ai-alignment.com)

Another (outer) alignment failure story

paulfchristianoApr 7, 2021, 8:12 PM

249 points

38 comments12 min readLW link 1 review

My research methodology

paulfchristianoMar 22, 2021, 9:20 PM

159 points

38 comments16 min readLW link 1 review

(ai-alignment.com)

Demand offsetting

paulfchristianoMar 21, 2021, 6:20 PM

133 points

41 comments5 min readLW link

(sideways-view.com)

It’s not economically inefficient for a UBI to reduce recipient’s employment

paulfchristianoNov 22, 2020, 4:40 PM

93 points

60 comments4 min readLW link

(sideways-view.com)

Hiring engineers and researchers to help align GPT-3

paulfchristianoOct 1, 2020, 6:54 PM

206 points

13 comments3 min readLW link

“Unsupervised” translation as an (intent) alignment problem

paulfchristianoSep 30, 2020, 12:50 AM

62 points

15 comments4 min readLW link

(ai-alignment.com)

Distributed public goods provision

paulfchristianoSep 26, 2020, 9:20 PM

27 points

3 comments5 min readLW link

(sideways-view.com)

Better priors as a safety problem

paulfchristianoJul 5, 2020, 9:20 PM

66 points

7 comments5 min readLW link

(ai-alignment.com)

Learning the prior

paulfchristianoJul 5, 2020, 9:00 PM

92 points

28 comments8 min readLW link

(ai-alignment.com)

Inaccessible information

paulfchristianoJun 3, 2020, 5:10 AM

83 points

17 comments14 min readLW link 2 reviews

(ai-alignment.com)

Writeup: Progress on AI Safety via Debate

Beth Barnes and paulfchristiano

Feb 5, 2020, 9:04 PM

103 points

18 comments33 min readLW link

Hedonic asymmetries

paulfchristianoJan 26, 2020, 2:10 AM

98 points

22 comments2 min readLW link

(sideways-view.com)

Moral public goods

paulfchristianoJan 26, 2020, 12:10 AM

147 points

74 comments4 min readLW link

(sideways-view.com)

Of arguments and wagers

paulfchristianoJan 10, 2020, 10:20 PM

52 points

6 comments6 min readLW link

(ai-alignment.com)