RSS

Dan H

Karma: 3,153

newsletter.safe.ai

newsletter.mlsafety.org

AISN #44: The Trump Cir­cle on AI Safety Plus, Chi­nese re­searchers used Llama to cre­ate a mil­i­tary tool for the PLA, a Google AI sys­tem dis­cov­ered a zero-day cy­ber­se­cu­rity vuln­er­a­bil­ity, and Com­plex Sys­tems

19 Nov 2024 16:36 UTC
9 points
0 comments5 min readLW link
(newsletter.safe.ai)

AI Safety Newslet­ter #43: White House Is­sues First Na­tional Se­cu­rity Memo on AI Plus, AI and Job Dis­place­ment, and AI Takes Over the Nobels

28 Oct 2024 16:03 UTC
6 points
0 comments6 min readLW link
(newsletter.safe.ai)

AI Safety Newslet­ter #42: New­som Ve­toes SB 1047 Plus, OpenAI’s o1, and AI Gover­nance Summary

1 Oct 2024 20:35 UTC
8 points
0 comments6 min readLW link
(newsletter.safe.ai)

AI Safety Newslet­ter #41: The Next Gen­er­a­tion of Com­pute Scale Plus, Rank­ing Models by Sus­cep­ti­bil­ity to Jailbreak­ing, and Ma­chine Ethics

11 Sep 2024 19:14 UTC
5 points
1 comment5 min readLW link
(newsletter.safe.ai)

AI fore­cast­ing bots incoming

9 Sep 2024 19:14 UTC
29 points
44 comments4 min readLW link
(www.safe.ai)

AI Safety Newslet­ter #40: Cal­ifor­nia AI Leg­is­la­tion Plus, NVIDIA De­lays Chip Pro­duc­tion, and Do AI Safety Bench­marks Ac­tu­ally Mea­sure Safety?

21 Aug 2024 18:09 UTC
11 points
0 comments6 min readLW link
(newsletter.safe.ai)

The Bit­ter Les­son for AI Safety Research

2 Aug 2024 18:39 UTC
57 points
5 comments3 min readLW link

AI Safety Newslet­ter #39: Im­pli­ca­tions of a Trump Ad­minis­tra­tion for AI Policy Plus, Safety Engineering

29 Jul 2024 17:50 UTC
17 points
1 comment6 min readLW link
(newsletter.safe.ai)

AISN #38: Supreme Court De­ci­sion Could Limit Fed­eral Abil­ity to Reg­u­late AI Plus, “Cir­cuit Break­ers” for AI sys­tems, and up­dates on China’s AI industry

9 Jul 2024 19:28 UTC
5 points
0 comments5 min readLW link
(newsletter.safe.ai)

UC Berkeley course on LLMs and ML Safety

Dan H9 Jul 2024 15:40 UTC
36 points
1 comment1 min readLW link
(rdi.berkeley.edu)

AI Safety Newslet­ter #37: US Launches An­titrust In­ves­ti­ga­tions Plus, re­cent crit­i­cisms of OpenAI and An­thropic, and a sum­mary of Si­tu­a­tional Awareness

18 Jun 2024 18:07 UTC
8 points
0 comments5 min readLW link
(newsletter.safe.ai)

AISN #36: Vol­un­tary Com­mit­ments are In­suffi­cient Plus, a Se­nate AI Policy Roadmap, and Chap­ter 1: An Overview of Catas­trophic Risks

5 Jun 2024 17:45 UTC
9 points
0 comments5 min readLW link
(newsletter.safe.ai)

AISN #35: Lob­by­ing on AI Reg­u­la­tion Plus, New Models from OpenAI and Google, and Le­gal Regimes for Train­ing on Copy­righted Data

16 May 2024 14:29 UTC
2 points
3 comments6 min readLW link
(newsletter.safe.ai)

AISN #34: New Mili­tary AI Sys­tems Plus, AI Labs Fail to Uphold Vol­un­tary Com­mit­ments to UK AI Safety In­sti­tute, and New AI Policy Pro­pos­als in the US Senate

2 May 2024 16:12 UTC
6 points
0 comments8 min readLW link
(newsletter.safe.ai)

AISN #33: Re­assess­ing AI and Biorisk Plus, Con­soli­da­tion in the Cor­po­rate AI Land­scape, and Na­tional In­vest­ments in AI

12 Apr 2024 16:10 UTC
13 points
0 comments9 min readLW link
(newsletter.safe.ai)

AISN #32: Mea­sur­ing and Re­duc­ing Hazardous Knowl­edge in LLMs Plus, Fore­cast­ing the Fu­ture with LLMs, and Reg­u­la­tory Markets

7 Mar 2024 16:39 UTC
8 points
0 comments8 min readLW link
(newsletter.safe.ai)

AISN #31: A New AI Policy Bill in Cal­ifor­nia Plus, Prece­dents for AI Gover­nance and The EU AI Office

21 Feb 2024 21:58 UTC
17 points
0 comments6 min readLW link
(newsletter.safe.ai)

AISN #30: In­vest­ments in Com­pute and Mili­tary AI Plus, Ja­pan and Sin­ga­pore’s Na­tional AI Safety Institutes

24 Jan 2024 19:38 UTC
27 points
1 comment6 min readLW link
(newsletter.safe.ai)

AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copy­right In­fringe­ment, and Con­gres­sional Ques­tions about Re­search Stan­dards in AI Safety

4 Jan 2024 16:09 UTC
8 points
0 comments6 min readLW link
(newsletter.safe.ai)

AISN #28: Cen­ter for AI Safety 2023 Year in Review

23 Dec 2023 21:31 UTC
30 points
1 comment5 min readLW link
(newsletter.safe.ai)