Ofer comments on A dilemma for prosaic AI alignment

Ofer 18 Dec 2019 17:02 UTC
LW: 1 AF: 1
AF
Whoops, (2) came out cryptic, and is incorrect, sorry. The (correct?) idea I was trying to convey is the following:
If ‘the safety scheme’ in plan 1 requires anything at all that ruins competitiveness—for example, some human-in-the-loop process that occurs recurrently during training—then no further assumptions (such as that conjecture) are necessary for the reasoning in the OP, AFAICT.
This idea no longer seems to me to amount to making the conjecture strictly weaker.