Rolf Nelson suggested that we should make precomitment now that our future friendly superintelligence, if it ever appear, will test all possible evil superintelligneces in multilevel simulations. Therefore any future evil superintelligence will doubt, if it in simulation or not.
hmm spending a bunch of compute simulating agents that try to break out by screwing you up seems like a way to end up with catastrophic inner misalignment to me
Rolf Nelson suggested that we should make precomitment now that our future friendly superintelligence, if it ever appear, will test all possible evil superintelligneces in multilevel simulations. Therefore any future evil superintelligence will doubt, if it in simulation or not.
hmm spending a bunch of compute simulating agents that try to break out by screwing you up seems like a way to end up with catastrophic inner misalignment to me
You can start simulating them when you become Galactic size AI and there is no risk. For acausal timeless deals time doesn’t matter.