HoldenKarnofsky comments on Why Not Just Outsource Alignment Research To An AI?

HoldenKarnofsky 18 Mar 2023 5:18 UTC
LW: 10 AF: 8
7
AF
I don’t agree with this characterization, at least for myself. I think people should be doing object-level alignment research now, partly (maybe mostly?) to be in better position to automate it later. I expect alignment researchers to be central to automation attempts.
It seems to me like the basic equation is something like: “If today’s alignment researchers would be able to succeed given a lot more time, then they also are reasonably likely to succeed given access to a lot of human-level-ish AIs.” There are reasons this could fail (perhaps future alignment research will require major adaptations and different skills such that today’s top alignment researchers will be unable to assess it; perhaps there are parallelization issues, though AIs can give significant serial speedup), but the argument in this post seems far from a knockdown.
Also, it seems worth noting that non-experts work productively with experts all the time. There are lots of shortcomings and failure modes, but the video is a parody.
- johnswentworth 18 Mar 2023 16:49 UTC
  LW: 4 AF: 3
  2
  AF Parent
  I don’t agree with this characterization, at least for myself. I think people should be doing object-level alignment research now, partly (maybe mostly?) to be in better position to automate it later.
  Indeed, I think you’re a good role model in this regard and hope more people will follow your example.