RSS

AI Safety Cases

TagLast edit: 19 Nov 2024 22:17 UTC by Rauno Arike

A safety case is a structured argument showing that a system is acceptably safe for a specific use in a specific environment. Safety cases typically include:

New re­port: Safety Cases for AI

joshc20 Mar 2024 16:45 UTC
89 points
14 comments1 min readLW link
(twitter.com)

Toward Safety Cases For AI Scheming

31 Oct 2024 17:20 UTC
60 points
1 comment2 min readLW link

An­thropic: Three Sketches of ASL-4 Safety Case Components

Zach Stein-Perlman6 Nov 2024 16:00 UTC
93 points
32 comments1 min readLW link
(alignment.anthropic.com)
No comments.