RSS

Zach Stein-Perlman(Zachary Stein-Perlman)

Karma: 7,297

AI strategy & governance. ailabwatch.org.

Model evals for dan­ger­ous capabilities

Zach Stein-Perlman23 Sep 2024 11:00 UTC
50 points
8 comments3 min readLW link

OpenAI o1

Zach Stein-Perlman12 Sep 2024 17:30 UTC
144 points
41 comments1 min readLW link

Demis Hass­abis — Google Deep­Mind: The Podcast

Zach Stein-Perlman16 Aug 2024 0:00 UTC
55 points
8 comments3 min readLW link
(www.youtube.com)

GPT-4o Sys­tem Card

Zach Stein-Perlman8 Aug 2024 20:30 UTC
68 points
11 comments2 min readLW link
(openai.com)

AI labs can boost ex­ter­nal safety research

Zach Stein-Perlman31 Jul 2024 19:30 UTC
15 points
0 comments1 min readLW link

Safety con­sul­ta­tions for AI lab employees

Zach Stein-Perlman27 Jul 2024 15:00 UTC
181 points
4 comments1 min readLW link

New page: Integrity

Zach Stein-Perlman10 Jul 2024 15:00 UTC
91 points
3 comments1 min readLW link

Claude 3.5 Sonnet

Zach Stein-Perlman20 Jun 2024 18:00 UTC
75 points
41 comments1 min readLW link
(www.anthropic.com)

An­thropic’s Cer­tifi­cate of Incorporation

Zach Stein-Perlman12 Jun 2024 13:00 UTC
115 points
3 comments4 min readLW link

Com­pa­nies’ safety plans ne­glect risks from schem­ing AI

Zach Stein-Perlman3 Jun 2024 15:00 UTC
73 points
4 comments6 min readLW link

AI com­pa­nies’ commitments

Zach Stein-Perlman29 May 2024 11:00 UTC
36 points
0 comments1 min readLW link

Maybe An­thropic’s Long-Term Benefit Trust is powerless

Zach Stein-Perlman27 May 2024 13:00 UTC
199 points
21 comments2 min readLW link

AI com­pa­nies aren’t re­ally us­ing ex­ter­nal evaluators

Zach Stein-Perlman24 May 2024 16:01 UTC
240 points
15 comments4 min readLW link

New vol­un­tary com­mit­ments (AI Seoul Sum­mit)

Zach Stein-Perlman21 May 2024 11:00 UTC
81 points
17 comments7 min readLW link
(www.gov.uk)

Deep­Mind’s “​​Fron­tier Safety Frame­work” is weak and unambitious

Zach Stein-Perlman18 May 2024 3:00 UTC
159 points
14 comments4 min readLW link

Deep­Mind: Fron­tier Safety Framework

Zach Stein-Perlman17 May 2024 17:30 UTC
64 points
0 comments3 min readLW link
(deepmind.google)

Ilya Sutskever and Jan Leike re­sign from OpenAI [up­dated]

Zach Stein-Perlman15 May 2024 0:45 UTC
246 points
95 comments2 min readLW link

Ques­tions for labs

Zach Stein-Perlman30 Apr 2024 22:15 UTC
77 points
11 comments8 min readLW link

In­tro­duc­ing AI Lab Watch

Zach Stein-Perlman30 Apr 2024 17:00 UTC
222 points
30 comments1 min readLW link
(ailabwatch.org)

Staged release

Zach Stein-Perlman17 Apr 2024 16:00 UTC
9 points
4 comments2 min readLW link