Nathan Helm-Burger comments on Language models seem to be much better than humans at next-token prediction

Nathan Helm-Burger 12 Aug 2022 17:50 UTC
1 point
0
In one of Rohin’s comments he describes a ‘next level up’ of difficulty for the model.

″...you can reduce the variance by sampling continuations for a given prompt rather than sampling sequences unprompted...” So basically, predicting the next n tokens where n > 1. Two? Three? Sentence completion multiple choice? Not sure how best to implement. I’d be excited to see this next level turned into a web app somehow! I suspect we might find the LLMs do worse than n==1, but still surprisingly well.

Edit: played the second game w the probabilities and found it a fun challenge. Beating the 2-layer is easy, not thinking too hard, but couldn’t catch that darn 12 layer!