Daniel Kokotajlo comments on AI Timelines

Daniel Kokotajlo 7 Jan 2025 22:56 UTC
2 points
0
lol what? Can you compile/summarize a list of examples of AI agents running amok in your personal experience? To what extent was it an alignment problem vs. a capabilities problem?
- Tao Lin 8 Jan 2025 0:36 UTC
  7 points
  0
  Parent
  not running amock, just not reliably following instructions “only modify files in this folder” or “don’t install pip packages”. Claude follows instructions correctly, some other models are mode collapsed into a certain way of doing things, eg gpt-4o always thinks it’s running python in chatgpt code interpreter and you need very strong prompting to make it behave in a way specific to your computer
  - Tao Lin 8 Jan 2025 0:43 UTC
    4 points
    0
    Parent
    a hypothetical typical example would be it tries to use the file /usr/bin/python because it’s memorized that that’s the path to python, that fails, then it concludes it must create that folder which would require sudo permissions, if it can it could potentially mess something