Zach Stein-Perlman

Karma: 9,133

AI strategy & governance. ailabwatch.org. ailabwatch.substack.com.

AI labs can boost external safety research

Zach Stein-PerlmanJul 31, 2024, 7:30 PM

31 points

1 comment1 min readLW link

Safety consultations for AI lab employees

Zach Stein-PerlmanJul 27, 2024, 3:00 PM

181 points

4 comments1 min readLW link

New page: Integrity

Zach Stein-PerlmanJul 10, 2024, 3:00 PM

91 points

3 comments1 min readLW link

Claude 3.5 Sonnet

Zach Stein-PerlmanJun 20, 2024, 6:00 PM

75 points

41 comments1 min readLW link

(www.anthropic.com)

Anthropic’s Certificate of Incorporation

Zach Stein-PerlmanJun 12, 2024, 1:00 PM

115 points

7 comments4 min readLW link

Companies’ safety plans neglect risks from scheming AI

Zach Stein-PerlmanJun 3, 2024, 3:00 PM

73 points

4 comments6 min readLW link

AI companies’ commitments

Zach Stein-PerlmanMay 29, 2024, 11:00 AM

36 points

0 comments1 min readLW link

Maybe Anthropic’s Long-Term Benefit Trust is powerless

Zach Stein-PerlmanMay 27, 2024, 1:00 PM

202 points

21 comments2 min readLW link

AI companies aren’t really using external evaluators

Zach Stein-PerlmanMay 24, 2024, 4:01 PM

242 points

15 comments4 min readLW link

New voluntary commitments (AI Seoul Summit)

Zach Stein-PerlmanMay 21, 2024, 11:00 AM

81 points

17 comments7 min readLW link

(www.gov.uk)

DeepMind’s “Frontier Safety Framework” is weak and unambitious

Zach Stein-PerlmanMay 18, 2024, 3:00 AM

159 points

14 comments4 min readLW link

DeepMind: Frontier Safety Framework

Zach Stein-PerlmanMay 17, 2024, 5:30 PM

64 points

0 comments3 min readLW link

(deepmind.google)

Ilya Sutskever and Jan Leike resign from OpenAI [updated]

Zach Stein-PerlmanMay 15, 2024, 12:45 AM

246 points

95 comments2 min readLW link

Questions for labs

Zach Stein-PerlmanApr 30, 2024, 10:15 PM

77 points

11 comments8 min readLW link

Introducing AI Lab Watch

Zach Stein-PerlmanApr 30, 2024, 5:00 PM

225 points

30 comments1 min readLW link

(ailabwatch.org)

Staged release

Zach Stein-PerlmanApr 17, 2024, 4:00 PM

11 points

4 comments2 min readLW link

DeepMind: Evaluating Frontier Models for Dangerous Capabilities

Zach Stein-PerlmanMar 21, 2024, 3:00 AM

61 points

8 comments1 min readLW link

(arxiv.org)

OpenAI: Preparedness framework

Zach Stein-PerlmanDec 18, 2023, 6:30 PM

70 points

23 comments4 min readLW link

(openai.com)

Anthropic, Google, Microsoft & OpenAI announce Executive Director of the Frontier Model Forum & over $10 million for a new AI Safety Fund

Zach Stein-PerlmanOct 25, 2023, 3:20 PM

31 points

8 comments4 min readLW link

(www.frontiermodelforum.org)

OpenAI-Microsoft partnership

Zach Stein-PerlmanOct 3, 2023, 8:01 PM

51 points

19 comments1 min readLW link