Regarding disagreement (7): I’d like to see more people using AI to try and make useful contributions to alignment.
More broadly, I think the space of alignment working methods, literally the techniques researchers would use day-to-day, has been under-explored.
If the fate of the world is at stake, shouldn’t someone at least try hokey idea-generation techniques lifted from corporations? Idea-combinations generators? Wacky proof-helper softwares? Weird physical-office setups like that 10-chambered linear room thing I saw somewhere but can’t find now? I don’t expect these to help a ton, and I expect high degrees of failure, but I also expect surviving worlds to have tried them already and maybe written up (short, low-effort, low-conscientiousness, low-mental-cycles) notes on how well they worked.
Regarding disagreement (7): I’d like to see more people using AI to try and make useful contributions to alignment.
More broadly, I think the space of alignment working methods, literally the techniques researchers would use day-to-day, has been under-explored.
If the fate of the world is at stake, shouldn’t someone at least try hokey idea-generation techniques lifted from corporations? Idea-combinations generators? Wacky proof-helper softwares? Weird physical-office setups like that 10-chambered linear room thing I saw somewhere but can’t find now? I don’t expect these to help a ton, and I expect high degrees of failure, but I also expect surviving worlds to have tried them already and maybe written up (short, low-effort, low-conscientiousness, low-mental-cycles) notes on how well they worked.