aogara comments on o1 is a bad idea

aogara 14 Nov 2024 14:08 UTC
2 points
0
Wouldn’t that conflict with the quote? (Though maybe they’re not doing what they’ve implied in the quote)
- RohanS 15 Nov 2024 20:54 UTC
  5 points
  2
  Parent
  My best guess is that there was process supervision for capabilities but not for safety. i.e. training to make the CoT useful for solving problems, but not for “policy compliance or user preferences.” This way they make it useful, and they don’t incentivize it to hide dangerous thoughts. I’m not confident about this though.