davidad

Karma: 2,130

Programme Director at UK Advanced Research + Invention Agency focusing on safe transformative AI; formerly Protocol Labs, FHI/Oxford, Harvard Biophysics, MIT Mathematics And Computation.

What does davidad want from «boundaries»?

Chipmonk and davidad

Feb 6, 2024, 5:45 PM

47 points

1 comment5 min readLW link

Does davidad’s uploading moonshot work?

Bird Concept, lisathiergart, Anders_Sandberg, davidad and Arenamontanus

Nov 3, 2023, 2:21 AM

146 points

35 comments25 min readLW link

A list of core AI safety problems and how I hope to solve them

davidadAug 26, 2023, 3:12 PM

165 points

29 comments5 min readLW link

Compute Thresholds: proposed rules to mitigate risk of a “lab leak” accident during AI training runs

davidadJul 22, 2023, 6:09 PM

80 points

2 comments2 min readLW link

An Open Agency Architecture for Safe Transformative AI

davidadDec 20, 2022, 1:04 PM

80 points

22 comments4 min readLW link

AI Neorealism: a threat model & success criterion for existential safety

davidadDec 15, 2022, 1:42 PM

67 points

1 comment3 min readLW link

Side-channels: input versus output

davidadDec 12, 2022, 12:32 PM

44 points

16 comments2 min readLW link

Reframing inner alignment

davidadDec 11, 2022, 1:53 PM

53 points

13 comments4 min readLW link

You can still fetch the coffee today if you’re dead tomorrow

davidadDec 9, 2022, 2:06 PM

96 points

19 comments5 min readLW link

Cryptoepistemology

davidadFeb 24, 2022, 8:34 PM

31 points

3 comments2 min readLW link

The Promise and Peril of Finite Sets

davidadDec 10, 2021, 12:29 PM

42 points

5 comments6 min readLW link

davidad’s Shortform

davidadDec 9, 2021, 6:16 PM

4 points

13 comments1 min readLW link

Why I Moved from AI to Neuroscience, or: Uploading Worms

davidadApr 13, 2012, 7:10 AM

67 points

58 comments1 min readLW link