paulfchristiano comments on Richard Ngo’s Shortform

paulfchristiano 5 Jan 2023 18:47 UTC
2 points
Because it’s easy to detect and correct (except that correcting it might push you into one of the other regimes).
- Tom Davidson 7 Jan 2023 0:12 UTC
  1 point
  Parent
  So far causally upstream of the human evaluator’s opinion? Eg an AI counselor optimizing for getting to know you