RSS

gasteigerjo

Karma: 167

Working on Alignment Science at Anthropic

AI Safety at the Fron­tier: Paper High­lights, Oc­to­ber ’24

gasteigerjo31 Oct 2024 0:09 UTC
3 points
0 comments9 min readLW link
(aisafetyfrontier.substack.com)

AI Safety at the Fron­tier: Paper High­lights, Septem­ber ’24

gasteigerjo2 Oct 2024 9:49 UTC
13 points
0 comments7 min readLW link
(aisafetyfrontier.substack.com)

AI Safety at the Fron­tier: Paper High­lights, Au­gust ’24

gasteigerjo3 Sep 2024 19:17 UTC
28 points
0 comments6 min readLW link
(aisafetyfrontier.substack.com)

AI Safety at the Fron­tier: Paper High­lights, July ’24

gasteigerjo5 Aug 2024 13:00 UTC
8 points
0 comments7 min readLW link
(aisafetyfrontier.substack.com)

Dis­cus­sion: Challenges with Un­su­per­vised LLM Knowl­edge Discovery

18 Dec 2023 11:58 UTC
147 points
21 comments10 min readLW link