RSS

AI Safety Cases

TagLast edit: 19 Nov 2024 22:17 UTC by Rauno Arike

A safety case is a structured argument showing that a system is acceptably safe for a specific use in a specific environment. Safety cases typically include:

Near- and medium-term AI Con­trol Safety Cases

Martín Soto23 Dec 2024 17:37 UTC
9 points
0 comments6 min readLW link

New re­port: Safety Cases for AI

joshc20 Mar 2024 16:45 UTC
89 points
14 comments1 min readLW link
(twitter.com)

Toward Safety Cases For AI Scheming

31 Oct 2024 17:20 UTC
60 points
1 comment2 min readLW link

An­thropic: Three Sketches of ASL-4 Safety Case Components

Zach Stein-Perlman6 Nov 2024 16:00 UTC
95 points
33 comments1 min readLW link
(alignment.anthropic.com)
No comments.