evhub comments on Conditions for Mesa-Optimization

evhub 3 Jun 2019 22:36 UTC
LW: 1 AF: 1
AF
The idea would be that all of this would be learned—if the optimization machinery is entirely internal to the system, it can choose how to use that optimization machinery arbitrarily. We talk briefly about systems where the optimization is hard-coded, but those aren’t mesa-optimizers. Rather, we’re interested in situations where your learned algorithm itself performs optimization internal to its own workings—optimization it could re-use to do prediction or vice versa.
- Rohin Shah 4 Jun 2019 7:53 UTC
  LW: 7 AF: 3
  AF Parent
  It sounds like there was a misunderstanding somewhere—I’m aware that all of this would be learned; my point is that the learned policy contains an optimizer rather than being an optimizer, which seems like a significant point, and your original definition sounded like you wanted the learned policy to be an optimizer.