Neel Nanda comments on One-layer transformers aren’t equivalent to a set of skip-trigrams