RSS

Itay Yona

Karma: 25

Reflec­tions on Trust­ing Trust & AI

Itay YonaJan 16, 2023, 6:36 AM
10 points
1 comment3 min readLW link
(mentaleap.ai)

Analo­gies be­tween Soft­ware Re­v­erse Eng­ineer­ing and Mechanis­tic Interpretability

Dec 26, 2022, 12:26 PM
34 points
6 comments11 min readLW link
(www.neelnanda.io)