RSS

scasper

Karma: 1,991

https://​​stephencasper.com/​​

Refram­ing AI Safety as a Nev­erend­ing In­sti­tu­tional Challenge

scasperMar 23, 2025, 12:13 AM
50 points
12 comments5 min readLW link

EIS XV: A New Proof of Con­cept for Use­ful Interpretability

scasperMar 17, 2025, 8:05 PM
28 points
2 comments3 min readLW link

EIS XIV: Is mechanis­tic in­ter­pretabil­ity about to be prac­ti­cally use­ful?

scasperOct 11, 2024, 10:13 PM
68 points
4 comments7 min readLW link

Can Gen­er­al­ized Ad­ver­sar­ial Test­ing En­able More Ri­gor­ous LLM Safety Evals?

scasperJul 30, 2024, 2:57 PM
25 points
0 comments4 min readLW link

EIS XIII: Reflec­tions on An­thropic’s SAE Re­search Circa May 2024

scasperMay 21, 2024, 8:15 PM
157 points
16 comments3 min readLW link

Analo­gies be­tween scal­ing labs and mis­al­igned su­per­in­tel­li­gent AI

scasperFeb 21, 2024, 7:29 PM
76 points
5 comments4 min readLW link