RSS

TW123

Karma: 1,234

Risks from AI Overview: Summary

Aug 18, 2023, 1:21 AM
25 points
1 comment13 min readLW link
(www.safe.ai)

Catas­trophic Risks from AI #6: Dis­cus­sion and FAQ

Jun 27, 2023, 11:23 PM
24 points
1 comment13 min readLW link
(arxiv.org)

Catas­trophic Risks from AI #5: Rogue AIs

Jun 27, 2023, 10:06 PM
15 points
0 comments22 min readLW link
(arxiv.org)

Catas­trophic Risks from AI #4: Or­ga­ni­za­tional Risks

Jun 26, 2023, 7:36 PM
23 points
0 comments21 min readLW link
(arxiv.org)

Catas­trophic Risks from AI #3: AI Race

Jun 23, 2023, 7:21 PM
18 points
9 comments29 min readLW link
(arxiv.org)

Catas­trophic Risks from AI #2: Mal­i­cious Use

Jun 22, 2023, 5:10 PM
38 points
1 comment17 min readLW link
(arxiv.org)

Catas­trophic Risks from AI #1: Introduction

Jun 22, 2023, 5:09 PM
40 points
1 comment5 min readLW link
(arxiv.org)

[MLSN #9] Ver­ify­ing large train­ing runs, se­cu­rity risks from LLM ac­cess to APIs, why nat­u­ral se­lec­tion may fa­vor AIs over humans

Apr 11, 2023, 4:03 PM
11 points
0 comments6 min readLW link
(newsletter.mlsafety.org)

[MLSN #8] Mechanis­tic in­ter­pretabil­ity, us­ing law to in­form AI al­ign­ment, scal­ing laws for proxy gaming

Feb 20, 2023, 3:54 PM
20 points
0 comments4 min readLW link
(newsletter.mlsafety.org)