RSS

Zach Stein-Perlman(Zachary Stein-Perlman)

Karma: 4,759

AI strategy & governance. ailabwatch.org. Looking for new projects.

As of late May 2024, I’m focusing on blogging. In June I expect to focus on exploring a version of ailabwatch.org that could get more attention. I’m most excited to receive offers to help with projects like ailabwatch.org. I’m also excited to be pitched blogposts/​projects.

AI com­pa­nies’ commitments

Zach Stein-Perlman29 May 2024 11:00 UTC
34 points
0 comments1 min readLW link

Maybe An­thropic’s Long-Term Benefit Trust is powerless

Zach Stein-Perlman27 May 2024 13:00 UTC
191 points
19 comments2 min readLW link

AI com­pa­nies aren’t re­ally us­ing ex­ter­nal evaluators

Zach Stein-Perlman24 May 2024 16:01 UTC
239 points
14 comments4 min readLW link

New vol­un­tary com­mit­ments (AI Seoul Sum­mit)

Zach Stein-Perlman21 May 2024 11:00 UTC
75 points
17 comments7 min readLW link
(www.gov.uk)

Deep­Mind’s “​​Fron­tier Safety Frame­work” is weak and unambitious

Zach Stein-Perlman18 May 2024 3:00 UTC
157 points
14 comments4 min readLW link

Deep­Mind: Fron­tier Safety Framework

Zach Stein-Perlman17 May 2024 17:30 UTC
64 points
0 comments3 min readLW link
(deepmind.google)

Ilya Sutskever and Jan Leike re­sign from OpenAI [up­dated]

Zach Stein-Perlman15 May 2024 0:45 UTC
246 points
95 comments3 min readLW link

Ques­tions for labs

Zach Stein-Perlman30 Apr 2024 22:15 UTC
76 points
11 comments8 min readLW link

In­tro­duc­ing AI Lab Watch

Zach Stein-Perlman30 Apr 2024 17:00 UTC
219 points
31 comments1 min readLW link
(ailabwatch.org)

Staged release

Zach Stein-Perlman17 Apr 2024 16:00 UTC
9 points
4 comments2 min readLW link

Deep­Mind: Eval­u­at­ing Fron­tier Models for Danger­ous Capabilities

Zach Stein-Perlman21 Mar 2024 3:00 UTC
61 points
5 comments1 min readLW link
(arxiv.org)

OpenAI: Pre­pared­ness framework

Zach Stein-Perlman18 Dec 2023 18:30 UTC
70 points
23 comments4 min readLW link
(openai.com)

An­thropic, Google, Microsoft & OpenAI an­nounce Ex­ec­u­tive Direc­tor of the Fron­tier Model Fo­rum & over $10 mil­lion for a new AI Safety Fund

Zach Stein-Perlman25 Oct 2023 15:20 UTC
31 points
8 comments4 min readLW link
(www.frontiermodelforum.org)

OpenAI-Microsoft partnership

Zach Stein-Perlman3 Oct 2023 20:01 UTC
51 points
18 comments1 min readLW link

[Question] Cur­rent AI safety tech­niques?

Zach Stein-Perlman3 Oct 2023 19:30 UTC
30 points
2 comments2 min readLW link

ARC Evals: Re­spon­si­ble Scal­ing Policies

Zach Stein-Perlman28 Sep 2023 4:30 UTC
40 points
9 comments2 min readLW link
(evals.alignment.org)

How to think about slow­ing AI

Zach Stein-Perlman17 Sep 2023 16:00 UTC
14 points
2 comments3 min readLW link
(forum.effectivealtruism.org)

Cruxes for overhang

Zach Stein-Perlman14 Sep 2023 17:00 UTC
12 points
5 comments6 min readLW link
(blog.aiimpacts.org)

Cruxes on US lead for some do­mes­tic AI regulation

Zach Stein-Perlman10 Sep 2023 18:00 UTC
26 points
3 comments2 min readLW link

[Question] Which paths to pow­er­ful AI should be boosted?

Zach Stein-Perlman23 Aug 2023 16:00 UTC
1 point
0 comments1 min readLW link