RSS

Sonia Joseph

Karma: 116

Getting PhD in multimodal interpretability and alignment at Mila.

Twitter: @soniajoseph_

Liti­gate-for-Im­pact: Prepar­ing Le­gal Ac­tion against an AGI Fron­tier Lab Leader

Sonia JosephDec 7, 2024, 9:42 PM
38 points
7 comments2 min readLW link

Bridg­ing the VLM and mech in­terp com­mu­ni­ties for mul­ti­modal in­ter­pretabil­ity

Sonia JosephOct 28, 2024, 2:41 PM
19 points
5 comments15 min readLW link

In­ter­pretabil­ity in Ac­tion: Ex­plo­ra­tory Anal­y­sis of VPT, a Minecraft Agent

Jul 18, 2024, 5:02 PM
9 points
0 comments1 min readLW link
(arxiv.org)

Lay­ing the Foun­da­tions for Vi­sion and Mul­ti­modal Mechanis­tic In­ter­pretabil­ity & Open Problems

Mar 13, 2024, 5:09 PM
44 points
13 comments14 min readLW link