Steven Byrnes comments on Common misconceptions about OpenAI

Steven Byrnes 27 Aug 2022 20:52 UTC
LW: 10 AF: 6
0
AF
I think Jacob (OP) said “OpenAI is trying to directly build safe AGI.” and cited the charter and other statements as evidence of this claim. Then John replied that the charter and other statements are “not much evidence” either for or against this claim, because talk is cheap. I think that’s a reasonable point.

Separately, maybe John in fact believes that the charter and other statements are insincere lip service. If so, I would agree with you (Paul) that John’s belief is probably incorrect, based on my very limited knowledge. [Where I disagree with OpenAI, I presume that top leadership is acting sincerely to make a good future with safe AGI, but that they have mistaken beliefs about the hardness of alignment and other topics.]
- paulfchristiano 28 Aug 2022 1:10 UTC
  LW: 4 AF: 4
  0
  AF Parent
  I was replying to:
  I’d guess that is an overestimate of the number of people actually doing alignment research at OpenAI, as opposed to capabilities research in which people pay lip service to alignment. In particular, all of the RLHF work is basically capabilities work which makes alignment harder in the long term (because it directly selects for deception), while billing itself as “alignment”.
  - Steven Byrnes 28 Aug 2022 2:30 UTC
    2 points
    0
    Parent
    Thanks, sorry for misunderstanding.