evhub comments on Alignment proposals and complexity classes

evhub 17 Jul 2020 4:36 UTC
LW: 9 AF: 4
AF
I agree with the gist that it implies that arguments about the equilibrium policy don’t necessarily translate to real models, though I disagree that that’s necessarily bad news for the alignment scheme—it just means you need to find some guarantees that work even when you’re not at equilibrium.