ProgramCrafter comments on What are the flaws in this argument about p(Doom)?

ProgramCrafter 13 Aug 2023 15:50 UTC
1 point
0
Current AIs are mostly not explicit expected-utility-maximizers. I think this is illustrated by RLHF (https://huggingface.co/blog/rlhf).
- William the Kiwi 18 Aug 2023 19:17 UTC
  1 point
  0
  Parent
  But isn’t that also using a reward function? The AI is trying to maximise the reward it receives from the Reward Model. The Reward Model that was trained using Human Feedback.