Ryan Greenblatt has provided a few mechanisms for detecting/​preventing alignment faking in his comment here.
Ryan Greenblatt has provided a few mechanisms for detecting/​preventing alignment faking in his comment here.