Chris_Leong comments on Intuitions about goal-directed behavior

Chris_Leong 3 Dec 2018 14:14 UTC
2 points
“I find this concept most useful when thinking about the problem of inner optimizers, where in the course of optimization through a rich space you stumble across a member of the space that is itself doing optimization, but for a related but still misspecified metric.”—Could you clarify what kind of algorithm you are imagining being run?
- Rohin Shah 3 Dec 2018 14:26 UTC
  4 points
  Parent
  I could imagine this happening with standard deep RL over a long enough time horizon with enough compute. Again though, I want to defer to the upcoming sequence on the topic, which should have a good in-depth explanation.