David Johnston comments on GPTs are Predictors, not Imitators

David Johnston 10 Apr 2023 0:45 UTC
2 points
1
I was just trying to clarify the limits of autoregressive vs other learning methods. Autoregressive learning is at an apparent disadvantage if $P (X_{t} | X_{t - 1})$ is hard to compute and the reverse is easy and low entropy. It can “make up for this” somewhat if it can do a good job of predicting $X_{t}$ from $X_{t - 2}$ , but it’s still at a disadvantage if, for example, that’s relatively high entropy compared to $X_{t - 1}$ from $X_{t}$ . That’s it, I’m satisfied.