I don’t recall any interpretability experiments with TinyStories offhand, but I’d be surprised if there aren’t any.
I don’t recall any interpretability experiments with TinyStories offhand, but I’d be surprised if there aren’t any.