Can you add the key assumptions being made when you say it is safe asymptotically? From skimming, it looked like “assuming the world is an MDP and that a human can recognize which actions lead to catastrophes.”
Can you add the key assumptions being made when you say it is safe asymptotically? From skimming, it looked like “assuming the world is an MDP and that a human can recognize which actions lead to catastrophes.”