Donald Hobson comments on Safe Scrambling?

Donald Hobson 29 Aug 2020 23:31 UTC
LW: 6 AF: 3
AF
If you have an AI training method that passes the test >50% of the time, then you don’t need scrambling.
If you have an approach that takes >1,000,000,000 tries to get right, then you still have to test, so even perfect scrambling won’t help.
Ie, this approach might help if your alignment process is missing between 1 and 30 bits of information.
I am not sure what sort of proposals would do this, but amplified oversight might be one of them.