RSS

bfitzgerald3132

Karma: 4

AI mod­els in­her­ently al­ter “hu­man val­ues.” So, al­ign­ment-based AI safety ap­proaches must bet­ter ac­count for value drift

bfitzgerald313213 Jan 2025 19:22 UTC
5 points
2 comments13 min readLW link