Mostly I would expect such a system to overfit on the training data, and perform no better than chance when tested.
[...]
Reinforcement learning, meanwhile, will indeed become manipulative (in my expectation).
I’m confused why reinforcement learning would be well suited for the task, if it doesn’t work at all in the supervised learning case.
I’m confused why reinforcement learning would be well suited for the task, if it doesn’t work at all in the supervised learning case.