RSS

smallsilo

Karma: 113

Illu­sory Safety: Redteam­ing Deep­Seek R1 and the Strongest Fine-Tun­able Models of OpenAI, An­thropic, and Google

Feb 7, 2025, 3:57 AM
29 points
0 comments10 min readLW link

AISafety.info Distil­la­tion Hackathon

smallsiloOct 1, 2023, 6:54 PM
2 points
0 comments1 min readLW link

Join AISafety.info’s Distil­la­tion Hackathon (Oct 6-9th)

smallsiloOct 1, 2023, 6:43 PM
21 points
0 comments2 min readLW link
(forum.effectivealtruism.org)

GPT-pow­ered EA/​LW weekly summary

smallsiloAug 25, 2023, 6:19 PM
18 points
1 comment11 min readLW link
(forum.effectivealtruism.org)

AISafety.info’s Writ­ing & Edit­ing Hackathon

smallsiloAug 5, 2023, 5:14 PM
2 points
0 comments1 min readLW link

Join AISafety.info’s Writ­ing & Edit­ing Hackathon (Aug 25-28) (Prizes to be won!)

smallsiloAug 5, 2023, 2:08 PM
19 points
3 comments1 min readLW link
(forum.effectivealtruism.org)

All AGI Safety ques­tions wel­come (es­pe­cially ba­sic ones) [July 2023]

smallsiloJul 20, 2023, 8:20 PM
38 points
40 comments2 min readLW link
(forum.effectivealtruism.org)