Why would it hack itself to think it’s getting paperclips if it’s originally programmed to want real paperclips? It would not be incentivized to make that hack because that hack would make it NOT get paperclips.
As I said though, how do you program it to want REAL paperclips as opposed to just perceiving that it is getting paperclips.
Why would it hack itself to think it’s getting paperclips if it’s originally programmed to want real paperclips? It would not be incentivized to make that hack because that hack would make it NOT get paperclips.
As I said though, how do you program it to want REAL paperclips as opposed to just perceiving that it is getting paperclips.