And so since a specific author is just an especially small group
That’s nicely said.
Another current MATS scholar is modeling this group identification very abstractly as: given a pool of token-generating finite-state automata, how quickly (as it receives more tokens) can a transformer trained on the output of those processes point with confidence to the one of those processes that’s producing the current token stream? I’ve been finding that a very useful mental model.
That’s nicely said.
Another current MATS scholar is modeling this group identification very abstractly as: given a pool of token-generating finite-state automata, how quickly (as it receives more tokens) can a transformer trained on the output of those processes point with confidence to the one of those processes that’s producing the current token stream? I’ve been finding that a very useful mental model.