All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Anthropic’s SoLU (Softmax Linear Unit)

Joel BurgetJul 4, 2022, 6:38 PM

21 points

1 comment4 min readLW link

(transformer-circuits.pub)

Book Review: The Righteous Mind

ErnestScribblerJul 4, 2022, 5:45 PM

34 points

8 comments35 min readLW link

My Most Likely Reason to Die Young is AI X-Risk

AISafetyIsNotLongtermistJul 4, 2022, 5:08 PM

61 points

24 comments4 min readLW link

(forum.effectivealtruism.org)

Is General Intelligence “Compact”?

DragonGodJul 4, 2022, 1:27 PM

27 points

6 comments22 min readLW link

Remaking EfficientZero (as best I can)

HoagyJul 4, 2022, 11:03 AM

36 points

9 comments22 min readLW link

We Need a Consolidated List of Bad AI Alignment Solutions

DoubleJul 4, 2022, 6:54 AM

9 points

14 comments1 min readLW link

AI Forecasting: One Year In

jsteinhardtJul 4, 2022, 5:10 AM

132 points

12 comments6 min readLW link

(bounded-regret.ghost.io)

A compressed take on recent disagreements

kmanJul 4, 2022, 4:39 AM

33 points

9 comments1 min readLW link

New US Senate Bill on X-Risk Mitigation [Linkpost]

Evan R. MurphyJul 4, 2022, 1:25 AM

35 points

12 comments1 min readLW link

(www.hsgac.senate.gov)

Monthly Shorts 6/22

CelerJul 3, 2022, 11:40 PM

5 points

2 comments5 min readLW link

(keller.substack.com)

Decision theory and dynamic inconsistency

paulfchristianoJul 3, 2022, 10:20 PM

80 points

33 comments10 min readLW link

(sideways-view.com)

Five routes of access to scientific literature

DirectedEvolutionJul 3, 2022, 8:53 PM

13 points

4 comments6 min readLW link

Toni Kurz and the Insanity of Climbing Mountains

GeneSmithJul 3, 2022, 8:51 PM

271 points

67 comments11 min readLW link 2 reviews

Wonder and The Golden AI Rule

JeffreyKJul 3, 2022, 6:21 PM

0 points

4 comments6 min readLW link

Nature abhors an immutable replicator… usually

MSRayneJul 3, 2022, 3:08 PM

28 points

10 comments3 min readLW link

Post hoc justifications as Compression Algorithm

Johannes C. MayerJul 3, 2022, 5:02 AM

8 points

0 comments1 min readLW link

SOMA—A story about Consciousness

Johannes C. MayerJul 3, 2022, 4:46 AM

10 points

0 comments1 min readLW link

(www.youtube.com)

Sexual self-acceptance

Johannes C. MayerJul 3, 2022, 4:26 AM

11 points

6 comments1 min readLW link

Donohue, Levitt, Roe, and Wade: T-minus 20 years to a massive crime wave?

Paul LoganJul 3, 2022, 3:03 AM

−24 points

6 comments3 min readLW link

(laulpogan.substack.com)

Can we achieve AGI Alignment by balancing multiple human objectives?

Ben SmithJul 3, 2022, 2:51 AM

11 points

1 comment4 min readLW link

Trigger-Action Planning

CFAR!DuncanJul 3, 2022, 1:42 AM

90 points

14 comments13 min readLW link 2 reviews

[Question] Which one of these two academic routes should I take to end up in AI Safety?

Martín SotoJul 3, 2022, 1:05 AM

5 points

2 comments1 min readLW link

Naive Hypotheses on AI Alignment

Shoshannah TekofskyJul 2, 2022, 7:03 PM

98 points

29 comments5 min readLW link

The Tree of Life: Stanford AI Alignment Theory of Change

Gabe MJul 2, 2022, 6:36 PM

25 points

0 comments14 min readLW link

Follow along with Columbia EA’s Advanced AI Safety Fellowship!

RohanSJul 2, 2022, 5:45 PM

3 points

0 comments2 min readLW link

(forum.effectivealtruism.org)

Welcome to Analogia! (Chapter 7)

Justin BullockJul 2, 2022, 5:04 PM

5 points

0 comments11 min readLW link

[Question] What about transhumans and beyond?

AlignmentMirrorJul 2, 2022, 1:58 PM

7 points

6 comments1 min readLW link

Goal-directedness: tackling complexity

Morgan_RogersJul 2, 2022, 1:51 PM

8 points

0 comments38 min readLW link

Literature recommendations July 2022

ChristianKlJul 2, 2022, 9:14 AM

17 points

9 comments1 min readLW link

Deontological Evil

lsusrJul 2, 2022, 6:57 AM

45 points

4 comments2 min readLW link

Could an AI Alignment Sandbox be useful?

Michael SoareverixJul 2, 2022, 5:06 AM

2 points

1 comment1 min readLW link

Five views of Bayes’ Theorem

Adam ScherlisJul 2, 2022, 2:25 AM

38 points

4 comments1 min readLW link

[Linkpost] Existential Risk Analysis in Empirical Research Papers

Dan HJul 2, 2022, 12:09 AM

40 points

0 comments1 min readLW link

(arxiv.org)

Agenty AGI – How Tempting?

PeterMcCluskeyJul 1, 2022, 11:40 PM

22 points

3 comments5 min readLW link

(www.bayesianinvestor.com)

AXRP Episode 16 - Preparing for Debate AI with Geoffrey Irving

DanielFilanJul 1, 2022, 10:20 PM

20 points

0 comments37 min readLW link

[Question] Examples of practical implications of Judea Pearl’s Causality work

ChristianKlJul 1, 2022, 8:58 PM

23 points

6 comments1 min readLW link

Minerva

AlgonJul 1, 2022, 8:06 PM

36 points

6 comments2 min readLW link

(ai.googleblog.com)

Disarming status

sanoJul 1, 2022, 8:00 PM

−4 points

1 comment6 min readLW link

Paper: Forecasting world events with neural nets

Owain_Evans, Dan H and Joe Kwon

Jul 1, 2022, 7:40 PM

39 points

3 comments4 min readLW link

Reframing the AI Risk

Thane RuthenisJul 1, 2022, 6:44 PM

26 points

7 comments6 min readLW link

Who is this MSRayne person anyway?

MSRayneJul 1, 2022, 5:32 PM

32 points

30 comments11 min readLW link

Limerence Messes Up Your Rationality Real Bad, Yo

RaemonJul 1, 2022, 4:53 PM

128 points

41 comments3 min readLW link 2 reviews

[Link] On the paradox of tolerance in relation to fascism and online content moderation – Unstable Ontology

KennyJul 1, 2022, 4:43 PM

5 points

0 comments1 min readLW link

Trends in GPU price-performance

Marius Hobbhahn and Tamay

Jul 1, 2022, 3:51 PM

85 points

13 comments1 min readLW link 1 review

(epochai.org)

[Question] How to deal with non-schedulable one-off stimulus-response-pair-like situations when planning/organising projects?

mikbpJul 1, 2022, 3:22 PM

2 points

3 comments1 min readLW link

What Is The True Name of Modularity?

CallumMcDougall, Lucius Bushnaq and Avery

Jul 1, 2022, 2:55 PM

39 points

10 comments12 min readLW link

Defining Optimization in a Deeper Way Part 1

J BostockJul 1, 2022, 2:03 PM

7 points

0 comments2 min readLW link

Safetywashing

Adam SchollJul 1, 2022, 11:56 AM

261 points

20 comments1 min readLW link 2 reviews

[Question] AGI alignment with what?

AlignmentMirrorJul 1, 2022, 10:22 AM

6 points

10 comments1 min readLW link

Open & Welcome Thread—July 2022

Kaj_SotalaJul 1, 2022, 7:47 AM

20 points

61 comments1 min readLW link