William the Kiwi comments on What are the flaws in this argument about p(Doom)?

William the Kiwi 12 Aug 2023 13:56 UTC
1 point
0
How would an AI be directed without using a reward function? Are there some examples I can read?
- ProgramCrafter 13 Aug 2023 15:50 UTC
  1 point
  0
  Parent
  Current AIs are mostly not explicit expected-utility-maximizers. I think this is illustrated by RLHF (https://huggingface.co/blog/rlhf).
  - William the Kiwi 18 Aug 2023 19:17 UTC
    1 point
    0
    Parent
    But isn’t that also using a reward function? The AI is trying to maximise the reward it receives from the Reward Model. The Reward Model that was trained using Human Feedback.