Would this work for highly non-monotonic utility functions?
It would work at least as well as the original proposal, because your utility function could just be whatever metric of “getting closer to the target states” would be used in the original proposal.
Would this work for highly non-monotonic utility functions?
It would work at least as well as the original proposal, because your utility function could just be whatever metric of “getting closer to the target states” would be used in the original proposal.