This is an excellent point. My own proposed alignment methods are vulnerable to this criticism. And they’re most likely to be used unless something changes, afaict. I have worried about this, but not written about it publicly. And it’s good to make.it formal and explicit.
The other argument is that people seem to be rushing in headlong even when they don’t know of any promising alignment methods at all, sooo...
This is an excellent point. My own proposed alignment methods are vulnerable to this criticism. And they’re most likely to be used unless something changes, afaict. I have worried about this, but not written about it publicly. And it’s good to make.it formal and explicit.
The other argument is that people seem to be rushing in headlong even when they don’t know of any promising alignment methods at all, sooo...