It might well be that 1) people who already know RL shouldn’t be much surprised by this result and 2) people who don’t know much RL are justified in updating on this info (towards mesa-optimizers arising more easily).
I agree. It seems pretty bad if the participants of a forum about AI alignment don’t know RL.
I agree. It seems pretty bad if the participants of a forum about AI alignment don’t know RL.