RSS

Zach Stein-Perlman

Karma: 9,133

AI strategy & governance. ailabwatch.org. ailabwatch.substack.com.

AI labs can boost ex­ter­nal safety research

Zach Stein-PerlmanJul 31, 2024, 7:30 PM
31 points
1 comment1 min readLW link

Safety con­sul­ta­tions for AI lab employees

Zach Stein-PerlmanJul 27, 2024, 3:00 PM
181 points
4 comments1 min readLW link

New page: Integrity

Zach Stein-PerlmanJul 10, 2024, 3:00 PM
91 points
3 comments1 min readLW link

Claude 3.5 Sonnet

Zach Stein-PerlmanJun 20, 2024, 6:00 PM
75 points
41 comments1 min readLW link
(www.anthropic.com)

An­thropic’s Cer­tifi­cate of Incorporation

Zach Stein-PerlmanJun 12, 2024, 1:00 PM
115 points
7 comments4 min readLW link

Com­pa­nies’ safety plans ne­glect risks from schem­ing AI

Zach Stein-PerlmanJun 3, 2024, 3:00 PM
73 points
4 comments6 min readLW link

AI com­pa­nies’ commitments

Zach Stein-PerlmanMay 29, 2024, 11:00 AM
36 points
0 comments1 min readLW link

Maybe An­thropic’s Long-Term Benefit Trust is powerless

Zach Stein-PerlmanMay 27, 2024, 1:00 PM
202 points
21 comments2 min readLW link

AI com­pa­nies aren’t re­ally us­ing ex­ter­nal evaluators

Zach Stein-PerlmanMay 24, 2024, 4:01 PM
242 points
15 comments4 min readLW link

New vol­un­tary com­mit­ments (AI Seoul Sum­mit)

Zach Stein-PerlmanMay 21, 2024, 11:00 AM
81 points
17 comments7 min readLW link
(www.gov.uk)

Deep­Mind’s “​​Fron­tier Safety Frame­work” is weak and unambitious

Zach Stein-PerlmanMay 18, 2024, 3:00 AM
159 points
14 comments4 min readLW link

Deep­Mind: Fron­tier Safety Framework

Zach Stein-PerlmanMay 17, 2024, 5:30 PM
64 points
0 comments3 min readLW link
(deepmind.google)

Ilya Sutskever and Jan Leike re­sign from OpenAI [up­dated]

Zach Stein-PerlmanMay 15, 2024, 12:45 AM
246 points
95 comments2 min readLW link

Ques­tions for labs

Zach Stein-PerlmanApr 30, 2024, 10:15 PM
77 points
11 comments8 min readLW link

In­tro­duc­ing AI Lab Watch

Zach Stein-PerlmanApr 30, 2024, 5:00 PM
225 points
30 comments1 min readLW link
(ailabwatch.org)

Staged release

Zach Stein-PerlmanApr 17, 2024, 4:00 PM
11 points
4 comments2 min readLW link

Deep­Mind: Eval­u­at­ing Fron­tier Models for Danger­ous Capabilities

Zach Stein-PerlmanMar 21, 2024, 3:00 AM
61 points
8 comments1 min readLW link
(arxiv.org)

OpenAI: Pre­pared­ness framework

Zach Stein-PerlmanDec 18, 2023, 6:30 PM
70 points
23 comments4 min readLW link
(openai.com)

An­thropic, Google, Microsoft & OpenAI an­nounce Ex­ec­u­tive Direc­tor of the Fron­tier Model Fo­rum & over $10 mil­lion for a new AI Safety Fund

Zach Stein-PerlmanOct 25, 2023, 3:20 PM
31 points
8 comments4 min readLW link
(www.frontiermodelforum.org)

OpenAI-Microsoft partnership

Zach Stein-PerlmanOct 3, 2023, 8:01 PM
51 points
19 comments1 min readLW link