Measure comments on Feature Selection

Measure 1 Nov 2021 11:58 UTC
15 points
Haha, my immediate thought after “maximize total reward” was to wonder if I should intentionally limit my success rate (possibly slowly improving over time or even varying semi-predictably) to try to extend the run-time of the experiment. What use is 100% reward after all if the experiment ends as soon as I achieve it?
- Mati_Roy 5 Nov 2021 19:50 UTC
  1 point
  Parent
  you’re projecting your own desire for long run-time; the AI only wants to maximize rewards
  - Measure 5 Nov 2021 21:30 UTC
    10 points
    Parent
    I’m maximizing total reward over the run rather than rate of reward.
    - Mati_Roy 6 Nov 2021 19:30 UTC
      5 points
      Parent
      ah, ok yeah i see!