ryan_greenblatt comments on Naive self-supervised approaches to truthful AI

ryan_greenblatt 29 Oct 2021 9:04 UTC
1 point
I would imagine that if you have a limited question pool used for self-supervision, then applying this constraint while training from scratch would result in overfitting with less generalization (but I’m not super confident in this, and there might be descent ways to avoid this).

If the question pool is very large/generated or the constraint is generally enforced on text generation (I’m not sure this makes much sense), then this might do something interesting.

I don’t have the resources to run an experiment like this at the moment (particularly not with a very large model like GPT-J).