The paper sounds fine quality-wise to me, I just find it implausible that it’s relevant for important alignment work, since the proposed mechanism is mainly an aversion to building new capabilities.
The paper sounds fine quality-wise to me, I just find it implausible that it’s relevant for important alignment work, since the proposed mechanism is mainly an aversion to building new capabilities.