We have a choice here: “solve complex, value-laden problem” or “undertake cheap preparations so that the agent doesn’t have to deal with these scenarios”. Why not just run the agent from a secure server room where we look after it, shutting it down if it does bad things?
We have a choice here: “solve complex, value-laden problem” or “undertake cheap preparations so that the agent doesn’t have to deal with these scenarios”. Why not just run the agent from a secure server room where we look after it, shutting it down if it does bad things?