All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 101112 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Carrying the Torch: A Response to Anna Salamon by the Guild of the Rose

moridinamaelJul 6, 2022, 2:20 PM

136 points

16 comments6 min readLW link

Predicting Parental Emotional Changes?

jefftkJul 6, 2022, 1:50 PM

39 points

11 comments2 min readLW link

(www.jefftk.com)

Berlin AI Safety Open Meetup July 2022

pranomostroJul 6, 2022, 12:41 PM

6 points

0 comments1 min readLW link

Forecasting Through Fiction

YitzJul 6, 2022, 5:03 AM

5 points

2 comments8 min readLW link

Introducing the Fund for Alignment Research (We’re Hiring!)

AdamGleave, Scott Emmons, Ethan Perez and Claudia Shi

Jul 6, 2022, 2:07 AM

62 points

0 comments4 min readLW link

My vision of a good future, part I

Jeffrey LadishJul 6, 2022, 1:23 AM

66 points

18 comments9 min readLW link

Imperial Russia was doing fine without the Soviets

Davis KedroskyJul 5, 2022, 10:24 PM

6 points

3 comments14 min readLW link

(daviskedrosky.substack.com)

A Pattern Language For Rationality

VaniverJul 5, 2022, 7:08 PM

75 points

14 comments15 min readLW link

How to destroy the universe with a hypercomputer

Trevor CappalloJul 5, 2022, 7:05 PM

2 points

3 comments1 min readLW link

The curious case of Pretty Good human inner/outer alignment

PavleMihaJul 5, 2022, 7:04 PM

41 points

45 comments4 min readLW link

When is it appropriate to use statistical models and probabilities for decision making ?

Younes KamelJul 5, 2022, 12:34 PM

10 points

7 comments4 min readLW link

(youneskamel.substack.com)

Goal Factoring

CFAR!DuncanJul 5, 2022, 7:10 AM

92 points

2 comments8 min readLW link

Assorted thoughts about abstraction

Adam ZernerJul 5, 2022, 6:40 AM

16 points

9 comments7 min readLW link

[AN #172] Sorry for the long hiatus!

Rohin ShahJul 5, 2022, 6:20 AM

54 points

0 comments3 min readLW link

(mailchi.mp)

Outline: The Rectifying of Maps

hamnoxJul 5, 2022, 5:14 AM

7 points

0 comments2 min readLW link

[Question] Seeking opinions on the current and forward state of cryptocurrencies.

jmhJul 5, 2022, 5:01 AM

6 points

6 comments1 min readLW link

ITT-passing and civility are good; “charity” is bad; steelmanning is niche

Rob BensingerJul 5, 2022, 12:15 AM

163 points

36 comments6 min readLW link 1 review

Please help us communicate AI xrisk. It could save the world.

otto.bartenJul 4, 2022, 9:47 PM

4 points

7 comments2 min readLW link

Benchmark for successful concept extrapolation/avoiding goal misgeneralization

Stuart_ArmstrongJul 4, 2022, 8:48 PM

83 points

12 comments4 min readLW link

Procedural Executive Function, Part 1

DaystarEldJul 4, 2022, 6:51 PM

52 points

8 comments14 min readLW link

(daystareld.com)

Anthropic’s SoLU (Softmax Linear Unit)

Joel BurgetJul 4, 2022, 6:38 PM

21 points

1 comment4 min readLW link

(transformer-circuits.pub)

Book Review: The Righteous Mind

ErnestScribblerJul 4, 2022, 5:45 PM

34 points

8 comments35 min readLW link

My Most Likely Reason to Die Young is AI X-Risk

AISafetyIsNotLongtermistJul 4, 2022, 5:08 PM

61 points

24 comments4 min readLW link

(forum.effectivealtruism.org)

Is General Intelligence “Compact”?

DragonGodJul 4, 2022, 1:27 PM

27 points

6 comments22 min readLW link

Remaking EfficientZero (as best I can)

HoagyJul 4, 2022, 11:03 AM

36 points

9 comments22 min readLW link

We Need a Consolidated List of Bad AI Alignment Solutions

DoubleJul 4, 2022, 6:54 AM

9 points

14 comments1 min readLW link

AI Forecasting: One Year In

jsteinhardtJul 4, 2022, 5:10 AM

132 points

12 comments6 min readLW link

(bounded-regret.ghost.io)

A compressed take on recent disagreements

kmanJul 4, 2022, 4:39 AM

33 points

9 comments1 min readLW link

New US Senate Bill on X-Risk Mitigation [Linkpost]

Evan R. MurphyJul 4, 2022, 1:25 AM

35 points

12 comments1 min readLW link

(www.hsgac.senate.gov)

Monthly Shorts 6/22

CelerJul 3, 2022, 11:40 PM

5 points

2 comments5 min readLW link

(keller.substack.com)

Decision theory and dynamic inconsistency

paulfchristianoJul 3, 2022, 10:20 PM

80 points

33 comments10 min readLW link

(sideways-view.com)

Five routes of access to scientific literature

DirectedEvolutionJul 3, 2022, 8:53 PM

13 points

4 comments6 min readLW link

Toni Kurz and the Insanity of Climbing Mountains

GeneSmithJul 3, 2022, 8:51 PM

271 points

67 comments11 min readLW link 2 reviews

Wonder and The Golden AI Rule

JeffreyKJul 3, 2022, 6:21 PM

0 points

4 comments6 min readLW link

Nature abhors an immutable replicator… usually

MSRayneJul 3, 2022, 3:08 PM

28 points

10 comments3 min readLW link

Post hoc justifications as Compression Algorithm

Johannes C. MayerJul 3, 2022, 5:02 AM

8 points

0 comments1 min readLW link

SOMA—A story about Consciousness

Johannes C. MayerJul 3, 2022, 4:46 AM

10 points

0 comments1 min readLW link

(www.youtube.com)

Sexual self-acceptance

Johannes C. MayerJul 3, 2022, 4:26 AM

11 points

6 comments1 min readLW link

Donohue, Levitt, Roe, and Wade: T-minus 20 years to a massive crime wave?

Paul LoganJul 3, 2022, 3:03 AM

−24 points

6 comments3 min readLW link

(laulpogan.substack.com)

Can we achieve AGI Alignment by balancing multiple human objectives?

Ben SmithJul 3, 2022, 2:51 AM

11 points

1 comment4 min readLW link

Trigger-Action Planning

CFAR!DuncanJul 3, 2022, 1:42 AM

90 points

14 comments13 min readLW link 2 reviews

[Question] Which one of these two academic routes should I take to end up in AI Safety?

Martín SotoJul 3, 2022, 1:05 AM

5 points

2 comments1 min readLW link

Naive Hypotheses on AI Alignment

Shoshannah TekofskyJul 2, 2022, 7:03 PM

98 points

29 comments5 min readLW link

The Tree of Life: Stanford AI Alignment Theory of Change

Gabe MJul 2, 2022, 6:36 PM

25 points

0 comments14 min readLW link

Follow along with Columbia EA’s Advanced AI Safety Fellowship!

RohanSJul 2, 2022, 5:45 PM

3 points

0 comments2 min readLW link

(forum.effectivealtruism.org)

Welcome to Analogia! (Chapter 7)

Justin BullockJul 2, 2022, 5:04 PM

5 points

0 comments11 min readLW link

[Question] What about transhumans and beyond?

AlignmentMirrorJul 2, 2022, 1:58 PM

7 points

6 comments1 min readLW link

Goal-directedness: tackling complexity

Morgan_RogersJul 2, 2022, 1:51 PM

8 points

0 comments38 min readLW link

Literature recommendations July 2022

ChristianKlJul 2, 2022, 9:14 AM

17 points

9 comments1 min readLW link

Deontological Evil

lsusrJul 2, 2022, 6:57 AM

46 points

4 comments2 min readLW link