RSS

Guaran­teed Safe AI

TagLast edit: 9 Aug 2024 23:22 UTC by bgold

Towards Guaran­teed Safe AI: A Frame­work for En­sur­ing Ro­bust and Reli­able AI Systems

Gunnar_Zarncke16 May 2024 13:09 UTC
51 points
20 comments1 min readLW link
(arxiv.org)

Towards Guaran­teed Safe AI: A Frame­work for En­sur­ing Ro­bust and Reli­able AI Systems

Joar Skalse17 May 2024 19:13 UTC
65 points
10 comments2 min readLW link

Davi­dad’s Prov­ably Safe AI Ar­chi­tec­ture—ARIA’s Pro­gramme Thesis

simeon_c1 Feb 2024 21:30 UTC
69 points
17 comments1 min readLW link
(www.aria.org.uk)

Limi­ta­tions on For­mal Ver­ifi­ca­tion for AI Safety

Andrew Dickson19 Aug 2024 23:03 UTC
133 points
60 comments23 min readLW link

Prov­ably Safe AI: Wor­ld­view and Projects

9 Aug 2024 23:21 UTC
51 points
43 comments7 min readLW link

Prov­ably Safe AI

PeterMcCluskey5 Oct 2023 22:18 UTC
33 points
15 comments4 min readLW link
(bayesianinvestor.com)

Can a Bayesian Or­a­cle Prevent Harm from an Agent? (Ben­gio et al. 2024)

mattmacdermott1 Sep 2024 7:46 UTC
26 points
0 comments5 min readLW link
(yoshuabengio.org)
No comments.