tin482 comments on Google’s new 540 billion parameter language model

tin482 5 Apr 2022 18:40 UTC
7 points
Personally, I think approaches like STaR (28 March 2022) will be important: bootstrap from weak chain-of-thought reasoners to strong ones by retraining on successful inner monologues. They also implement “backward chaining”: training on monologues generated with the correct answer visible.