RSS

Davidmanheim

Karma: 4,466

Biorisk is an Un­helpful Anal­ogy for AI Risk

Davidmanheim6 May 2024 6:20 UTC
4 points
17 comments1 min readLW link

A Dozen Ways to Get More Dakka

Davidmanheim8 Apr 2024 4:45 UTC
99 points
10 comments3 min readLW link

“Open Source AI” isn’t Open Source

Davidmanheim15 Feb 2024 8:59 UTC
16 points
15 comments1 min readLW link
(davidmanheim.substack.com)

Tech­nolo­gies and Ter­minol­ogy: AI isn’t Soft­ware, it’s… Deep­ware?

13 Feb 2024 13:37 UTC
40 points
9 comments8 min readLW link

Safe Sta­sis Fallacy

Davidmanheim5 Feb 2024 10:54 UTC
54 points
2 comments1 min readLW link

AI Is Not Software

Davidmanheim2 Jan 2024 7:58 UTC
56 points
29 comments5 min readLW link

Public Call for In­ter­est in Math­e­mat­i­cal Alignment

Davidmanheim22 Nov 2023 13:22 UTC
89 points
9 comments1 min readLW link

What is au­ton­omy, and how does it lead to greater risk from AI?

Davidmanheim1 Aug 2023 7:58 UTC
30 points
0 comments6 min readLW link

A Defense of Work on Math­e­mat­i­cal AI Safety

Davidmanheim6 Jul 2023 14:15 UTC
28 points
13 comments3 min readLW link
(forum.effectivealtruism.org)

“Safety Cul­ture for AI” is im­por­tant, but isn’t go­ing to be easy

Davidmanheim26 Jun 2023 12:52 UTC
47 points
2 comments2 min readLW link
(forum.effectivealtruism.org)

“LLMs Don’t Have a Co­her­ent Model of the World”—What it Means, Why it Mat­ters

Davidmanheim1 Jun 2023 7:46 UTC
31 points
2 comments7 min readLW link

Sys­tems that can­not be un­safe can­not be safe

Davidmanheim2 May 2023 8:53 UTC
62 points
27 comments2 min readLW link

Beyond a bet­ter world

Davidmanheim14 Dec 2022 10:18 UTC
14 points
7 comments4 min readLW link
(progressforum.org)

Far-UVC Light Up­date: No, LEDs are not around the cor­ner (tweet­storm)

Davidmanheim2 Nov 2022 12:57 UTC
70 points
27 comments4 min readLW link
(twitter.com)

An­nounc­ing AISIC 2022 - the AI Safety Is­rael Con­fer­ence, Oc­to­ber 19-20

Davidmanheim21 Sep 2022 19:32 UTC
13 points
0 comments1 min readLW link

Re­hovot, Is­rael – ACX Mee­tups Every­where 2022

Davidmanheim25 Aug 2022 18:01 UTC
3 points
0 comments1 min readLW link

AI Gover­nance across Slow/​Fast Take­off and Easy/​Hard Align­ment spectra

Davidmanheim3 Apr 2022 7:45 UTC
27 points
6 comments3 min readLW link

Ar­gu­ments about Highly Reli­able Agent De­signs as a Use­ful Path to Ar­tifi­cial In­tel­li­gence Safety

27 Jan 2022 13:13 UTC
27 points
0 comments1 min readLW link
(arxiv.org)

Elic­i­ta­tion for Model­ing Trans­for­ma­tive AI Risks

Davidmanheim16 Dec 2021 15:24 UTC
30 points
2 comments9 min readLW link

Model­ling Trans­for­ma­tive AI Risks (MTAIR) Pro­ject: Introduction

16 Aug 2021 7:12 UTC
91 points
0 comments9 min readLW link