Daniel Kokotajlo comments on Christiano, Cotra, and Yudkowsky on AI progress

Daniel Kokotajlo 27 Nov 2021 20:18 UTC
6 points
Is that one dense or sparse/MoE? How many data points was it trained for? Does it set SOTA on anything? (I’m skeptical; I’m wondering if they only trained it for a tiny amount, for example.)