I do understand that these statements are broad generalisations for what really does occur
though the premise is that a successful choice would be made from wieighting options provided from the scenarios.
As with genetics and other systems the beneficial error scenario which can be described in situations such as a miskeyed note on a keyboard leading to a variation of the sequence that is favourable seems excluded from these scenarios.
Improvisation based on self introduced errors may also be a core to these utilities being able to evolve reason.
Model-based system: Figure out what’s going on, and what actions maximize returns, and do them.
Model-free system: Do the thingy that worked before again!
Pavlovian system: Avoid the unpleasant thing and go to the pleasant thing. Repeat as necessary.
Just some initial thoughts,
I do understand that these statements are broad generalisations for what really does occur though the premise is that a successful choice would be made from wieighting options provided from the scenarios.
As with genetics and other systems the beneficial error scenario which can be described in situations such as a miskeyed note on a keyboard leading to a variation of the sequence that is favourable seems excluded from these scenarios.
Improvisation based on self introduced errors may also be a core to these utilities being able to evolve reason.
Model-based system: Figure out what’s going on, and what actions maximize returns, and do them.
Model-free system: Do the thingy that worked before again!
Pavlovian system: Avoid the unpleasant thing and go to the pleasant thing. Repeat as necessary.