I don’t see how you arrive at these conclusions at all. I agree that how alignment of the current models works there’s some vague hope that things might keep going like this even when capabilities increase. Is there any specific thing that makes you update more strongly?
I don’t see how you arrive at these conclusions at all. I agree that how alignment of the current models works there’s some vague hope that things might keep going like this even when capabilities increase. Is there any specific thing that makes you update more strongly?