lukeprog comments on Intelligence Explosion analysis draft: types of digital intelligence

lukeprog 15 Nov 2011 13:28 UTC
3 points
I don’t endorse Legg’s formalization because it is limited to reinforcement learning agents.
- lessdazed 15 Nov 2011 17:45 UTC
  4 points
  Parent
  That’s a good reason, and you should make that explicit.
  - lukeprog 15 Nov 2011 21:05 UTC
    0 points
    Parent
    Good point.
- timtyler 18 Nov 2011 0:25 UTC
  0 points
  Parent
  You can substitute “utility” for “reward”, if you prefer. Reinforcement learning is a fairly general framework, except for its insistence on a scalar reward signal. If you talk to RL folk about the need for multiple reward signals, they say that sticking that information in the sensory channels is mathematically equivalent—which is kinda true.