Yeah, this is a big problem I have with alignment people. They forget that if we don’t have iteration, we don’t solve the problem, so all efforts should focus on making things paralleliziable. It’s a problem I had with MIRI’s early work, and today we need to set ourselves up for much more empirical evidence. This could be a reason to support capabilities advances.
They argue there is some unknown point of capabilities at which the system explodes and we all die.
If that’s the rules of the universe we happen to find ourselves in though there probably is no winning anyways though. Sort of how if the laws of physics were slightly different and the first nuclear test did ignite the atmosphere.
Were atmospheric gas fissionable things would be very different.
It’s a very similar criticality argument. Early AGIs that try bad stuff may “quench” because the world lacks sufficient easily remotely hackable nanoforges and fleets of armed killer robots ready to deploy. So they instead steal a few bitcoins, kill a few people, then are caught and shut down.
If instead the AGI finds an exploit to get criticality then we all die. I am concerned the AGI might create a cult of personality or a religion and get support from large numbers of gullible humans. These humans, despite the AGI openly killing people and acting completely selfishly, might give it the resources to develop a way to kill us all.
Yeah, this is a big problem I have with alignment people. They forget that if we don’t have iteration, we don’t solve the problem, so all efforts should focus on making things paralleliziable. It’s a problem I had with MIRI’s early work, and today we need to set ourselves up for much more empirical evidence. This could be a reason to support capabilities advances.
They argue there is some unknown point of capabilities at which the system explodes and we all die.
If that’s the rules of the universe we happen to find ourselves in though there probably is no winning anyways though. Sort of how if the laws of physics were slightly different and the first nuclear test did ignite the atmosphere.
Were atmospheric gas fissionable things would be very different.
It’s a very similar criticality argument. Early AGIs that try bad stuff may “quench” because the world lacks sufficient easily remotely hackable nanoforges and fleets of armed killer robots ready to deploy. So they instead steal a few bitcoins, kill a few people, then are caught and shut down.
If instead the AGI finds an exploit to get criticality then we all die. I am concerned the AGI might create a cult of personality or a religion and get support from large numbers of gullible humans. These humans, despite the AGI openly killing people and acting completely selfishly, might give it the resources to develop a way to kill us all.