edevans comments on You can still fetch the coffee today if you’re dead tomorrow

edevans 23 Jun 2023 21:07 UTC
1 point
0
AF
Could it be useful to have a shutdown-by-default process as follows?
1. When starting the agent include a time value (n seconds), after which it will pause itself
2. After it pauses, deliberate and then either stop moving forward or continue with some new time value
This will allow trading power for safety, as you can make shorter steps forward as the agents become more dangerous, and you don’t need to do everything in the first time period.
- davidad 29 Jun 2023 11:58 UTC
  LW: 3 AF: 1
  0
  AF Parent
  Yes—assuming that the pause interrupts any anticipatory gradient flows from the continuing agent back to the agent which is considering whether to pause.
  
  This pattern is instantiated in the Open Agency Architecture twice:
  1. Step 2 generates top-level agents which are time-bounded at a moderate timescale (~days), with the deliberation about whether to redeploy a top-level agent being carried out by human operators.
  2. In Step 4, the top-level agent dispatches most tasks by deploying narrower low-level agents with much tighter time bounds, with the deliberation about whether to redeploy a low-level agent being automated by the top-level model.