Chris_Leong comments on Experiment Idea: RL Agents Evading Learned Shutdownability

Chris_Leong 18 Jan 2023 1:22 UTC
3 points
0
I don’t need the experiments to persuade me, so I’m not in the target audience. I’m trying to think about what people who are more critical might want to see.
- Leon Lang 18 Jan 2023 1:31 UTC
  3 points
  0
  Parent
  Okay, that’s fair. I agree, if we could show that the experiments remain stable even when longer strings of reasoning are required, then the experiments seem more convincing. There might be the added benefit that one can then vary the setting in more ways to demonstrate that the reasoning caused the agent to act in a particular way, instead of the actions just being some kind of coincidence.