RSS

joshc

Karma: 1,501

Train­ing AI to do al­ign­ment re­search we don’t already know how to do

joshc24 Feb 2025 19:19 UTC
34 points
23 comments7 min readLW link