gwern comments on LLMs Universally Learn a Feature Representing Token Frequency /​ Rarity