The rest of your essay is just a misunderstanding of what reinforcement learning is. Yes it does have origins in old psychology research. But the field has moved on an awful lot since then.
There are many different ideas on how to implement RL algorithms. But the simplest is, to use an algorithm that can predict the future reward. And then take an action which leads to the highest reward.
This procedure is totally independent of what method is used to predict the future reward.
I really do not like being told that I do not know what reinforcement learning is, by someone who goes on to demonstrate that they haven’t a clue and can’t be bothered to actually read the essay carefully.
You say:
I really do not like being told that I do not know what reinforcement learning is, by someone who goes on to demonstrate that they haven’t a clue and can’t be bothered to actually read the essay carefully.
Bye.