RSS

Dan H

Karma: 3,439

newsletter.safe.ai

newsletter.mlsafety.org

AISN #49: Su­per­in­tel­li­gence Strategy

Mar 6, 2025, 5:46 PM
6 points
1 comment5 min readLW link
(newsletter.safe.ai)

In­tro­duc­ing MASK: A Bench­mark for Mea­sur­ing Hon­esty in AI Systems

Mar 5, 2025, 10:56 PM
35 points
5 comments2 min readLW link
(www.mask-benchmark.ai)

On the Ra­tion­al­ity of Deter­ring ASI

Dan HMar 5, 2025, 4:11 PM
167 points
32 comments4 min readLW link
(nationalsecurity.ai)

AISN #48: Utility Eng­ineer­ing and EnigmaEval

Feb 18, 2025, 7:15 PM
4 points
0 comments4 min readLW link
(newsletter.safe.ai)

AISN #47: Rea­son­ing Models

Feb 6, 2025, 6:52 PM
3 points
0 comments4 min readLW link
(newsletter.safe.ai)

AISN #46: The Transition

Jan 23, 2025, 6:09 PM
8 points
0 comments5 min readLW link
(newsletter.safe.ai)

AISN #45: Cen­ter for AI Safety 2024 Year in Review

Dec 19, 2024, 6:15 PM
13 points
0 comments4 min readLW link
(newsletter.safe.ai)

AISN #44: The Trump Cir­cle on AI Safety Plus, Chi­nese re­searchers used Llama to cre­ate a mil­i­tary tool for the PLA, a Google AI sys­tem dis­cov­ered a zero-day cy­ber­se­cu­rity vuln­er­a­bil­ity, and Com­plex Sys­tems

Nov 19, 2024, 4:36 PM
9 points
0 comments5 min readLW link
(newsletter.safe.ai)

AI Safety Newslet­ter #43: White House Is­sues First Na­tional Se­cu­rity Memo on AI Plus, AI and Job Dis­place­ment, and AI Takes Over the Nobels

Oct 28, 2024, 4:03 PM
6 points
0 comments6 min readLW link
(newsletter.safe.ai)

AI Safety Newslet­ter #42: New­som Ve­toes SB 1047 Plus, OpenAI’s o1, and AI Gover­nance Summary

Oct 1, 2024, 8:35 PM
8 points
0 comments6 min readLW link
(newsletter.safe.ai)

AI Safety Newslet­ter #41: The Next Gen­er­a­tion of Com­pute Scale Plus, Rank­ing Models by Sus­cep­ti­bil­ity to Jailbreak­ing, and Ma­chine Ethics

Sep 11, 2024, 7:14 PM
5 points
1 comment5 min readLW link
(newsletter.safe.ai)

AI fore­cast­ing bots incoming

Sep 9, 2024, 7:14 PM
29 points
44 comments4 min readLW link
(www.safe.ai)

AI Safety Newslet­ter #40: Cal­ifor­nia AI Leg­is­la­tion Plus, NVIDIA De­lays Chip Pro­duc­tion, and Do AI Safety Bench­marks Ac­tu­ally Mea­sure Safety?

Aug 21, 2024, 6:09 PM
11 points
0 comments6 min readLW link
(newsletter.safe.ai)

The Bit­ter Les­son for AI Safety Research

Aug 2, 2024, 6:39 PM
57 points
5 comments3 min readLW link