Would a (small) energy cost make most RL agents look less stupid without sacrificing effectiveness?
It might discourage exploration and lead to more stasis in local optimums.
Would a (small) energy cost make most RL agents look less stupid without sacrificing effectiveness?
It might discourage exploration and lead to more stasis in local optimums.