Michael Roe comments on DeepSeek beats o1-preview on math, ties on coding; will release weights

Michael Roe 26 Nov 2024 11:42 UTC
1 point
0
I suppose we might worry that LlMs might learn to do RLHF evasion this way—human evaluator sees Chinese character they don’t understand, assumes it’s ok, and then the LLM learns you can look acceptable to humans by writing it in Chinese.
Some old books (which are almost certainly in the training set) used Latin for the dirty bits. Translations of Sanskrit poetry, and various works by that reprobate Richard Burton, do this.