JustinShovelain

Karma: 594

I am the co founder of and researcher at the quantitative long term strategy organization Convergence (see here for our growing list of publications). Over the last sixteen years I have worked with MIRI, CFAR, EA Global, and Founders Fund, and done work in EA strategy, fundraising, networking, teaching, cognitive enhancement, and AI safety research. I have a MS degree in computer science and BS degrees in computer science, mathematics, and physics.

Goodhart Typology via Structure, Function, and Randomness Distributions

JustinShovelain and Mateusz Bagiński

Mar 25, 2025, 4:01 PM

32 points

0 comments15 min readLW link

Bounded AI might be viable

Mateusz Bagiński and JustinShovelain

Mar 6, 2025, 12:55 PM

17 points

0 comments20 min readLW link

Information-Theoretic Boxing of Superintelligences

JustinShovelain and Elliot Mckernon

Nov 30, 2023, 2:31 PM

30 points

0 comments7 min readLW link

The risk-reward tradeoff of interpretability research

JustinShovelain and Elliot Mckernon

Jul 5, 2023, 5:05 PM

15 points

1 comment6 min readLW link

Aligning AI by optimizing for “wisdom”

JustinShovelain and Elliot Mckernon

Jun 27, 2023, 3:20 PM

27 points

8 comments12 min readLW link

Improving the safety of AI evals

JustinShovelain and Elliot Mckernon

May 17, 2023, 10:24 PM

13 points

7 comments7 min readLW link

Keep humans in the loop

JustinShovelain and Elliot Mckernon

Apr 19, 2023, 3:34 PM

23 points

1 comment10 min readLW link

Updating Utility Functions

JustinShovelain and Joar Skalse

May 9, 2022, 9:44 AM

41 points

6 comments8 min readLW link

Goodhart’s Law Causal Diagrams

JustinShovelain and Jeremy Gillen

Apr 11, 2022, 1:52 PM

34 points

6 comments6 min readLW link

How Money Fails to Track Value

JustinShovelainApr 2, 2022, 12:32 PM

17 points

0 comments5 min readLW link

Evaluating expertise: a clear box model

JustinShovelainOct 15, 2020, 2:18 PM

36 points

3 comments5 min readLW link

Good and bad ways to think about downside risks

MichaelA and JustinShovelain

Jun 11, 2020, 1:38 AM

19 points

12 comments11 min readLW link

COVID-19: An opportunity to help by modelling testing and tracing to inform the UK government

JustinShovelainApr 17, 2020, 5:21 PM

14 points

2 comments2 min readLW link

[Question] Testing and contact tracing impact assessment model?

JustinShovelainApr 9, 2020, 5:42 PM

6 points

3 comments1 min readLW link

COVID-19: List of ideas to reduce the direct harm from the virus, with an emphasis on unusual ideas

JustinShovelainApr 9, 2020, 11:33 AM

30 points

12 comments7 min readLW link

Memetic downside risks: How ideas can evolve and cause harm

MichaelA, JustinShovelain and algekalipso

Feb 25, 2020, 7:47 PM

27 points

3 comments15 min readLW link

Information hazards: Why you should care and what you can do

MichaelA, JustinShovelain, David_Kristoffersson and algekalipso

Feb 23, 2020, 8:47 PM

18 points

4 comments15 min readLW link

Mapping downside risks and information hazards

MichaelA, JustinShovelain and David_Kristoffersson

Feb 20, 2020, 2:46 PM

23 points

0 comments9 min readLW link

Using vector fields to visualise preferences and make them consistent

MichaelA and JustinShovelain

Jan 28, 2020, 7:44 PM

42 points

32 comments11 min readLW link

AI alignment concepts: philosophical breakers, stoppers, and distorters

JustinShovelainJan 24, 2020, 7:23 PM

20 points

3 comments3 min readLW link