RSS

Zach Stein-Perlman

Karma: 8,387

AI strategy & governance. ailabwatch.org. ailabwatch.substack.com.

List of AI safety pa­pers from com­pa­nies, 2023–2024

Zach Stein-Perlman15 Jan 2025 18:00 UTC
10 points
0 comments1 min readLW link

An­thropic lead­er­ship conversation

Zach Stein-Perlman20 Dec 2024 22:00 UTC
67 points
17 comments6 min readLW link
(www.youtube.com)

o3

Zach Stein-Perlman20 Dec 2024 18:30 UTC
154 points
156 comments1 min readLW link

Deep­Seek beats o1-pre­view on math, ties on cod­ing; will re­lease weights

Zach Stein-Perlman20 Nov 2024 23:50 UTC
113 points
26 comments1 min readLW link

An­thropic: Three Sketches of ASL-4 Safety Case Components

Zach Stein-Perlman6 Nov 2024 16:00 UTC
95 points
33 comments1 min readLW link
(alignment.anthropic.com)

The cur­rent state of RSPs

Zach Stein-Perlman4 Nov 2024 16:00 UTC
23 points
2 comments9 min readLW link

Miles Brundage: Find­ing Ways to Cred­ibly Sig­nal the Benign­ness of AI Devel­op­ment and De­ploy­ment is an Ur­gent Priority

Zach Stein-Perlman28 Oct 2024 17:00 UTC
22 points
4 comments3 min readLW link
(milesbrundage.substack.com)

UK AISI: Early les­sons from eval­u­at­ing fron­tier AI systems

Zach Stein-Perlman25 Oct 2024 19:00 UTC
26 points
0 comments2 min readLW link
(www.aisi.gov.uk)

Lab gov­er­nance read­ing list

Zach Stein-Perlman25 Oct 2024 18:00 UTC
20 points
3 comments1 min readLW link

IAPS: Map­ping Tech­ni­cal Safety Re­search at AI Companies

Zach Stein-Perlman24 Oct 2024 20:30 UTC
42 points
13 comments1 min readLW link
(www.iaps.ai)

What AI com­pa­nies should do: Some rough ideas

Zach Stein-Perlman21 Oct 2024 14:00 UTC
33 points
10 comments5 min readLW link

An­thropic rewrote its RSP

Zach Stein-Perlman15 Oct 2024 14:25 UTC
46 points
19 comments6 min readLW link

Model evals for dan­ger­ous capabilities

Zach Stein-Perlman23 Sep 2024 11:00 UTC
51 points
11 comments3 min readLW link

OpenAI o1

Zach Stein-Perlman12 Sep 2024 17:30 UTC
147 points
41 comments1 min readLW link

Demis Hass­abis — Google Deep­Mind: The Podcast

Zach Stein-Perlman16 Aug 2024 0:00 UTC
55 points
8 comments3 min readLW link
(www.youtube.com)

GPT-4o Sys­tem Card

Zach Stein-Perlman8 Aug 2024 20:30 UTC
68 points
11 comments2 min readLW link
(openai.com)

AI labs can boost ex­ter­nal safety research

Zach Stein-Perlman31 Jul 2024 19:30 UTC
31 points
1 comment1 min readLW link

Safety con­sul­ta­tions for AI lab employees

Zach Stein-Perlman27 Jul 2024 15:00 UTC
181 points
4 comments1 min readLW link

New page: Integrity

Zach Stein-Perlman10 Jul 2024 15:00 UTC
91 points
3 comments1 min readLW link

Claude 3.5 Sonnet

Zach Stein-Perlman20 Jun 2024 18:00 UTC
75 points
41 comments1 min readLW link
(www.anthropic.com)