RSS

Clement Neo

Karma: 183

Twitter: _clementneo
Site: clementneo.com

Analysing Ad­ver­sar­ial At­tacks with Lin­ear Probing

17 Jun 2024 14:16 UTC
9 points
0 comments8 min readLW link

Sparse au­toen­coders find com­posed fea­tures in small toy mod­els

14 Mar 2024 18:00 UTC
33 points
12 comments15 min readLW link

Multi-Agent Se­cu­rity Hackathon

5 Feb 2024 22:51 UTC
6 points
0 comments1 min readLW link

We Found An Neu­ron in GPT-2

11 Feb 2023 18:27 UTC
143 points
23 comments7 min readLW link
(clementneo.com)