ArchiveSequencesAbout

QuestionsEventsShortformAlignment ForumAF Comments

HomeFeaturedAllTagsRecent Comments

leogao comments on Vote on Interesting Disagreements

leogao 10 Nov 2023 21:09 UTC
21 points
−1
It is possible to make meaningful progress on deceptive alignment using experiments on current models