How many of the ~150 hours with Christiano used the Double Crux procedure or had other involvement with CFAR staff? I know that CFAR had some controversies, but that has no bearing on the fact that alignment researchers have a different skillset from CFAR, and even CFAR staff are only as good as the rationality techniques that have already been discovered.
Did you set up termination/dayslong breaks so that either party could privately have space to change their mind, with minimal loss of face/reputation? Did you set up plausible deniability so that choosing to take a break is not seen as an admission of defeat? Did you set up to something like “I was wrong” insurance where the winner of the debate pays the loser, or commits to never absorbing/dismantling their faction, or setting up a long buffer period before notifying everyone?
I’d like to see a few more conversations that explicitly follow CFAR’s Double Crux procedure.
How many of the ~150 hours with Christiano used the Double Crux procedure or had other involvement with CFAR staff? I know that CFAR had some controversies, but that has no bearing on the fact that alignment researchers have a different skillset from CFAR, and even CFAR staff are only as good as the rationality techniques that have already been discovered.
Did you set up termination/dayslong breaks so that either party could privately have space to change their mind, with minimal loss of face/reputation? Did you set up plausible deniability so that choosing to take a break is not seen as an admission of defeat? Did you set up to something like “I was wrong” insurance where the winner of the debate pays the loser, or commits to never absorbing/dismantling their faction, or setting up a long buffer period before notifying everyone?