I think the most fundamental thing might be taking in a sequences of bits (or distribution over sequences if you think it’s important to be analog) and outputting bits (or, again, distributions) that happen to control actions.
All this talk about taking causal models as an input is merely a useful abstraction of what happens when we do sequence prediction in our causal universe, and it might always be possible to find some plausible excuse to violate this abstraction.
I think the most fundamental thing might be taking in a sequences of bits (or distribution over sequences if you think it’s important to be analog) and outputting bits (or, again, distributions) that happen to control actions.
All this talk about taking causal models as an input is merely a useful abstraction of what happens when we do sequence prediction in our causal universe, and it might always be possible to find some plausible excuse to violate this abstraction.