Chris_Leong comments on Can we get an AI to “do our alignment homework for us”?

Chris_Leong 27 Feb 2024 13:17 UTC
4 points
0
I think an important crux here is whether you think that we can build institutions which are reasonably good at checking the quality of AI safety work done by humans
Why is this an important crux? Is it necessarily the case that if we can reliably check AI safety work done by humans that we we reliably check AI safety work done by Ai’s which may be optimising against us?
- ryan_greenblatt 27 Feb 2024 16:00 UTC
  4 points
  0
  Parent
  It’s not necessarily the case. But in practice this tends to be a key line of disagreement.