RSS

davidad

Karma: 2,130

Programme Director at UK Advanced Research + Invention Agency focusing on safe transformative AI; formerly Protocol Labs, FHI/​Oxford, Harvard Biophysics, MIT Mathematics And Computation.

What does davi­dad want from «bound­aries»?

Feb 6, 2024, 5:45 PM
47 points
1 comment5 min readLW link

Does davi­dad’s up­load­ing moon­shot work?

Nov 3, 2023, 2:21 AM
146 points
35 comments25 min readLW link

A list of core AI safety prob­lems and how I hope to solve them

davidadAug 26, 2023, 3:12 PM
165 points
29 comments5 min readLW link

Com­pute Thresh­olds: pro­posed rules to miti­gate risk of a “lab leak” ac­ci­dent dur­ing AI train­ing runs

davidadJul 22, 2023, 6:09 PM
80 points
2 comments2 min readLW link

An Open Agency Ar­chi­tec­ture for Safe Trans­for­ma­tive AI

davidadDec 20, 2022, 1:04 PM
80 points
22 comments4 min readLW link

AI Ne­o­re­al­ism: a threat model & suc­cess crite­rion for ex­is­ten­tial safety

davidadDec 15, 2022, 1:42 PM
67 points
1 comment3 min readLW link

Side-chan­nels: in­put ver­sus output

davidadDec 12, 2022, 12:32 PM
44 points
16 comments2 min readLW link

Refram­ing in­ner alignment

davidadDec 11, 2022, 1:53 PM
53 points
13 comments4 min readLW link

You can still fetch the coffee to­day if you’re dead tomorrow

davidadDec 9, 2022, 2:06 PM
96 points
19 comments5 min readLW link

Cryptoepistemology

davidadFeb 24, 2022, 8:34 PM
31 points
3 comments2 min readLW link

The Promise and Peril of Finite Sets

davidadDec 10, 2021, 12:29 PM
42 points
5 comments6 min readLW link

davi­dad’s Shortform

davidadDec 9, 2021, 6:16 PM
4 points
13 comments1 min readLW link

Why I Moved from AI to Neu­ro­science, or: Upload­ing Worms

davidadApr 13, 2012, 7:10 AM
67 points
58 comments1 min readLW link