Is RLHF updating abstract circuits an established fact? Why would it suffer from mode collapse in that case?
Is RLHF updating abstract circuits an established fact? Why would it suffer from mode collapse in that case?