RSS

Andrew Mack

Karma: 163

Mechanis­ti­cally Elic­it­ing La­tent Be­hav­iors in Lan­guage Models

30 Apr 2024 18:51 UTC
164 points
37 comments45 min readLW link