paulfchristiano comments on Hedonic asymmetries

paulfchristiano 28 Jan 2020 18:10 UTC
3 points
I think this part of the reversed argument is wrong:
The agent will randomly seek behaviours that get rewarded, but as long as these behaviours are reasonably rare (and are not that bad) then that’s not too costly
Even if the behaviors are very rare, and have a “normal” reward, then the agent will seek them out and so miss out on actually good states.
- Pattern 28 Jan 2020 18:38 UTC
  1 point
  Parent
  But there are behaviors we always seek out. Trivially, eating, and sleeping.