Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Zach Stein-Perlman
(Zachary Stein-Perlman)
Karma:
7,297
AI strategy & governance.
ailabwatch.org
.
All
Posts
Comments
New
Top
Old
Page
1
Model evals for dangerous capabilities
Zach Stein-Perlman
23 Sep 2024 11:00 UTC
50
points
8
comments
3
min read
LW
link
OpenAI o1
Zach Stein-Perlman
12 Sep 2024 17:30 UTC
144
points
41
comments
1
min read
LW
link
Demis Hassabis — Google DeepMind: The Podcast
Zach Stein-Perlman
16 Aug 2024 0:00 UTC
55
points
8
comments
3
min read
LW
link
(www.youtube.com)
GPT-4o System Card
Zach Stein-Perlman
8 Aug 2024 20:30 UTC
68
points
11
comments
2
min read
LW
link
(openai.com)
AI labs can boost external safety research
Zach Stein-Perlman
31 Jul 2024 19:30 UTC
15
points
0
comments
1
min read
LW
link
Safety consultations for AI lab employees
Zach Stein-Perlman
27 Jul 2024 15:00 UTC
181
points
4
comments
1
min read
LW
link
New page: Integrity
Zach Stein-Perlman
10 Jul 2024 15:00 UTC
91
points
3
comments
1
min read
LW
link
Claude 3.5 Sonnet
Zach Stein-Perlman
20 Jun 2024 18:00 UTC
75
points
41
comments
1
min read
LW
link
(www.anthropic.com)
Anthropic’s Certificate of Incorporation
Zach Stein-Perlman
12 Jun 2024 13:00 UTC
115
points
3
comments
4
min read
LW
link
Companies’ safety plans neglect risks from scheming AI
Zach Stein-Perlman
3 Jun 2024 15:00 UTC
73
points
4
comments
6
min read
LW
link
AI companies’ commitments
Zach Stein-Perlman
29 May 2024 11:00 UTC
36
points
0
comments
1
min read
LW
link
Maybe Anthropic’s Long-Term Benefit Trust is powerless
Zach Stein-Perlman
27 May 2024 13:00 UTC
199
points
21
comments
2
min read
LW
link
AI companies aren’t really using external evaluators
Zach Stein-Perlman
24 May 2024 16:01 UTC
240
points
15
comments
4
min read
LW
link
New voluntary commitments (AI Seoul Summit)
Zach Stein-Perlman
21 May 2024 11:00 UTC
81
points
17
comments
7
min read
LW
link
(www.gov.uk)
DeepMind’s “Frontier Safety Framework” is weak and unambitious
Zach Stein-Perlman
18 May 2024 3:00 UTC
159
points
14
comments
4
min read
LW
link
DeepMind: Frontier Safety Framework
Zach Stein-Perlman
17 May 2024 17:30 UTC
64
points
0
comments
3
min read
LW
link
(deepmind.google)
Ilya Sutskever and Jan Leike resign from OpenAI [updated]
Zach Stein-Perlman
15 May 2024 0:45 UTC
246
points
95
comments
2
min read
LW
link
Questions for labs
Zach Stein-Perlman
30 Apr 2024 22:15 UTC
77
points
11
comments
8
min read
LW
link
Introducing AI Lab Watch
Zach Stein-Perlman
30 Apr 2024 17:00 UTC
222
points
30
comments
1
min read
LW
link
(ailabwatch.org)
Staged release
Zach Stein-Perlman
17 Apr 2024 16:00 UTC
9
points
4
comments
2
min read
LW
link
Back to top
Next