There are the AI safety subprojects designed for elucidating “model splintering” and “learning the preferences of irrational agents”.
There are the AI safety subprojects designed for elucidating “model splintering” and “learning the preferences of irrational agents”.