RSS

jacob_drori

Karma: 80

Open Source Au­to­mated In­ter­pretabil­ity for Sparse Au­toen­coder Features

30 Jul 2024 21:11 UTC
67 points
1 comment13 min readLW link
(blog.eleuther.ai)

A thought ex­per­i­ment to help per­suade skep­tics that power-seek­ing AI is plausible

jacob_drori25 Nov 2023 23:26 UTC
2 points
4 comments5 min readLW link