RSS

Dan H

Karma: 3,149

newsletter.safe.ai

newsletter.mlsafety.org

AISN #44: The Trump Cir­cle on AI Safety Plus, Chi­nese re­searchers used Llama to cre­ate a mil­i­tary tool for the PLA, a Google AI sys­tem dis­cov­ered a zero-day cy­ber­se­cu­rity vuln­er­a­bil­ity, and Com­plex Sys­tems

19 Nov 2024 16:36 UTC
7 points
0 comments5 min readLW link
(newsletter.safe.ai)

AI Safety Newslet­ter #43: White House Is­sues First Na­tional Se­cu­rity Memo on AI Plus, AI and Job Dis­place­ment, and AI Takes Over the Nobels

28 Oct 2024 16:03 UTC
6 points
0 comments6 min readLW link
(newsletter.safe.ai)

AI Safety Newslet­ter #42: New­som Ve­toes SB 1047 Plus, OpenAI’s o1, and AI Gover­nance Summary

1 Oct 2024 20:35 UTC
8 points
0 comments6 min readLW link
(newsletter.safe.ai)

AI Safety Newslet­ter #41: The Next Gen­er­a­tion of Com­pute Scale Plus, Rank­ing Models by Sus­cep­ti­bil­ity to Jailbreak­ing, and Ma­chine Ethics

11 Sep 2024 19:14 UTC
5 points
1 comment5 min readLW link
(newsletter.safe.ai)

AI fore­cast­ing bots incoming

9 Sep 2024 19:14 UTC
29 points
44 comments4 min readLW link
(www.safe.ai)

AI Safety Newslet­ter #40: Cal­ifor­nia AI Leg­is­la­tion Plus, NVIDIA De­lays Chip Pro­duc­tion, and Do AI Safety Bench­marks Ac­tu­ally Mea­sure Safety?

21 Aug 2024 18:09 UTC
11 points
0 comments6 min readLW link
(newsletter.safe.ai)

The Bit­ter Les­son for AI Safety Research

2 Aug 2024 18:39 UTC
57 points
5 comments3 min readLW link

AI Safety Newslet­ter #39: Im­pli­ca­tions of a Trump Ad­minis­tra­tion for AI Policy Plus, Safety Engineering

29 Jul 2024 17:50 UTC
17 points
1 comment6 min readLW link
(newsletter.safe.ai)

AISN #38: Supreme Court De­ci­sion Could Limit Fed­eral Abil­ity to Reg­u­late AI Plus, “Cir­cuit Break­ers” for AI sys­tems, and up­dates on China’s AI industry

9 Jul 2024 19:28 UTC
5 points
0 comments5 min readLW link
(newsletter.safe.ai)

UC Berkeley course on LLMs and ML Safety

Dan H9 Jul 2024 15:40 UTC
36 points
1 comment1 min readLW link
(rdi.berkeley.edu)

AI Safety Newslet­ter #37: US Launches An­titrust In­ves­ti­ga­tions Plus, re­cent crit­i­cisms of OpenAI and An­thropic, and a sum­mary of Si­tu­a­tional Awareness

18 Jun 2024 18:07 UTC
8 points
0 comments5 min readLW link
(newsletter.safe.ai)

AISN #36: Vol­un­tary Com­mit­ments are In­suffi­cient Plus, a Se­nate AI Policy Roadmap, and Chap­ter 1: An Overview of Catas­trophic Risks

5 Jun 2024 17:45 UTC
9 points
0 comments5 min readLW link
(newsletter.safe.ai)