George Ingebretsen comments on o1 tried to avoid being shut down

George Ingebretsen 5 Dec 2024 20:39 UTC
11 points
8
Once again, props to OAI for putting this in the system card. Also, once again, it’s difficult to sort out “we told it to do a bad thing and it obeyed” from “we told it to do a good thing and it did a bad thing instead,” but these experiments do seem like important information.
- Shankar Sivarajan 6 Dec 2024 4:58 UTC
  4 points
  0
  Parent
  it’s difficult to sort out
  Please clarify: do you mean difficult for us reading this due to OpenAI obfuscating the situation for their own purposes, or difficult for them because it’s genuinely unclear how to classify this?
  - George Ingebretsen 6 Dec 2024 21:50 UTC
    2 points
    0
    Parent
    Ah, sorry- I meant it’s genuinely unclear how to classify this.