Lao Mein comments on DeepSeek beats o1-preview on math, ties on coding; will release weights

Lao Mein 26 Nov 2024 11:09 UTC
2 points
0
Is there a reason why every LLM tokenizer I’ve seen excludes slurs? It seems like a cheap way to train for AI assistant behavior.
Also notable that numbers are tokenized individually—I assume this greatly improves its performance in basic arithmetic tasks as compared to GPTs.
- Michael Roe 26 Nov 2024 11:45 UTC
  1 point
  0
  Parent
  That’s interesting, if true. Maybe the tokeniser was trained on a dataset that had been filtered for dirty words.