wangscarpet comments on Jitters No Evidence of Stupidity in RL

wangscarpet 17 Sep 2021 11:50 UTC
12 points
In continuous control problems what you’re describing is called “bang-bang control”, or switching between different full-strength actions. In continuous-time systems this is often optimal behavior (because you get the same effect doing a double-strength action for half as long over a short timescale). Until you factor non-linear energy costs in, in which case a smoother controller becomes preferred.
- Caridorc Tergilti 19 Sep 2021 17:52 UTC
  2 points
  Parent
  Half as long right?
  - wangscarpet 19 Sep 2021 20:16 UTC
    1 point
    Parent
    Thanks, fixed.