Thanks, this helps a lot. The point about a lot of the work being in the mapping from actions to utilities is well-taken.
This line of thinking was from a vague intuition that it ought to be possible to do what evolution and gradient descent can’t, but do it faster than argmax, and have it be not-impossibly-complicated. But sounds like the way I was thinking about that was not productive though; thanks.
Thanks, this helps a lot. The point about a lot of the work being in the mapping from actions to utilities is well-taken.
This line of thinking was from a vague intuition that it ought to be possible to do what evolution and gradient descent can’t, but do it faster than argmax, and have it be not-impossibly-complicated. But sounds like the way I was thinking about that was not productive though; thanks.