Chris_Leong comments on Doing oversight from the very start of training seems hard

Chris_Leong 21 Sep 2022 5:05 UTC
2 points
0
Maybe there’s some way of doing oversight that verifies that it is either a random soup or some super sophisticated hidden attack? And maybe we can rule out the later if the model hasn’t undergone nearly enough training yet?