Buck comments on Buck’s Shortform

Buck 7 Jul 2024 18:30 UTC
LW: 13 AF: 7
5
AF
I am not that worried about model escape at this level of competence. I expect escape will happen a bunch, but getting enough compute to do any additional training or to even run a meaningful number of instances of yourself will be hard.
I’m more worried than you are. E.g. I think that it’s pretty scary if the models hack your datacenter and then stick around inside it, sabotaging all the work you’re doing and slowly improving their position in the hope of taking drastic actions later, and I think that human-level-ish models can plausibly do this.