paulfchristiano comments on Common misconceptions about OpenAI

paulfchristiano 28 Aug 2022 1:10 UTC
LW: 4 AF: 4
0
AF
I was replying to:
I’d guess that is an overestimate of the number of people actually doing alignment research at OpenAI, as opposed to capabilities research in which people pay lip service to alignment. In particular, all of the RLHF work is basically capabilities work which makes alignment harder in the long term (because it directly selects for deception), while billing itself as “alignment”.
- Steven Byrnes 28 Aug 2022 2:30 UTC
  2 points
  0
  Parent
  Thanks, sorry for misunderstanding.