Richard_Ngo comments on Why I’m excited about Debate

Richard_Ngo 17 Jan 2021 1:16 UTC
LW: 18 AF: 9
AF
I think I agree with all of this. In fact, this argument is one reason why I think Debate could be valuable, because it will hopefully increase the maximum complexity of arguments that humans can reliably evaluate.
This eventually fails at some point, but hopefully it fails after the point at which we can use Debate to solve alignment in a more scalable way. (I don’t have particularly strong intuitions about whether this hope is justified, though.)