RSS

dsbowen

Karma: 25

Research Scientist at FAR AI.

Illu­sory Safety: Redteam­ing Deep­Seek R1 and the Strongest Fine-Tun­able Models of OpenAI, An­thropic, and Google

7 Feb 2025 3:57 UTC
29 points
0 comments10 min readLW link