Quintin Pope comments on NVIDIA and Microsoft releases 530B parameter transformer model, Megatron-Turing NLG

Quintin Pope 11 Oct 2021 19:43 UTC
6 points
0
“…even small differences of 1% means there is a noticeable difference in intelligence when using the models for text generation.”

I wish we had better automated metrics for that sort of subjective quality measure. A user study of subjective quality/usefulness would have been good too. That’s not too much to ask of Microsoft, and since they’re presumably aiming to sell access to this and similar models, it would be good for them to provide some indication of how capable the model feels to humans.