All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 192021 22 23 24 25 26 27 28 29 30

Let’s See You Write That Corrigibility Tag

Eliezer Yudkowsky19 Jun 2022 21:11 UTC

124 points

70 comments1 min readLW link

Half-baked alignment idea: training to generalize

Aaron Bergman19 Jun 2022 20:16 UTC

10 points

2 comments4 min readLW link

Where I agree and disagree with Eliezer

paulfchristiano19 Jun 2022 19:15 UTC

911 points

224 comments18 min readLW link 2 reviews

[Question] AI misalignment risk from GPT-like systems?

fiso6419 Jun 2022 17:35 UTC

10 points

8 comments1 min readLW link

[Link-post] On Deference and Yudkowsky’s AI Risk Estimates

bmg19 Jun 2022 17:25 UTC

29 points

8 comments1 min readLW link

Hebbian Learning Is More Common Than You Think

Aleksi Liimatainen19 Jun 2022 15:57 UTC

8 points

2 comments1 min readLW link

The Malthusian Trap: An Extremely Short Introduction

Davis Kedrosky19 Jun 2022 15:25 UTC

5 points

0 comments6 min readLW link

(daviskedrosky.substack.com)

Parliaments without the Parties

Yair Halberstadt19 Jun 2022 14:06 UTC

18 points

18 comments2 min readLW link

Lamda is not an LLM

Kevin19 Jun 2022 11:13 UTC

7 points

10 comments1 min readLW link

(www.wired.com)

[Linkpost] The importance of stupidity in scientific research

Pattern19 Jun 2022 5:17 UTC

17 points

1 comment1 min readLW link

(journals.biologists.com)

ETH is probably undervalued right now

mukashi19 Jun 2022 2:20 UTC

−7 points

22 comments1 min readLW link

Juneberry Cake

jefftk19 Jun 2022 1:40 UTC

29 points

0 comments1 min readLW link

(www.jefftk.com)

Agent level parallelism

Johannes C. Mayer18 Jun 2022 20:56 UTC

5 points

5 comments1 min readLW link

What are our outs to play to?

Hastings18 Jun 2022 19:32 UTC

7 points

0 comments2 min readLW link

[Question] What’s the information value of government hearings?

Kenny18 Jun 2022 17:13 UTC

6 points

4 comments2 min readLW link

The best ‘free solo’ (rock climbing) video

Kenny18 Jun 2022 15:29 UTC

14 points

4 comments2 min readLW link

[Question] What’s the name of this fallacy/reasoning antipattern?

David Gross18 Jun 2022 14:04 UTC

9 points

6 comments1 min readLW link

“Brain enthusiasts” in AI Safety

Jan and Samuel Nellessen

18 Jun 2022 9:59 UTC

64 points

5 comments10 min readLW link

(universalprior.substack.com)

To what extent have ideas and scientific discoveries gotten harder to find?

lsusr18 Jun 2022 7:15 UTC

33 points

10 comments6 min readLW link

[Question] What’s the goal in life?

Konstantin Weitz18 Jun 2022 6:09 UTC

5 points

6 comments1 min readLW link

Can DALL-E understand simple geometry?

Isaac King18 Jun 2022 4:37 UTC

25 points

2 comments1 min readLW link

Scott Aaronson is joining OpenAI to work on AI safety

peterbarnett18 Jun 2022 4:06 UTC

117 points

31 comments1 min readLW link

(scottaaronson.blog)

[Question] Why don’t we think we’re in the simplest universe with intelligent life?

ADifferentAnonymous18 Jun 2022 3:05 UTC

30 points

33 comments1 min readLW link

Do yourself a FAVAR: security mindset

lemonhope18 Jun 2022 2:08 UTC

20 points

2 comments2 min readLW link

Forecasting Fusion Power

Daniel Kokotajlo18 Jun 2022 0:04 UTC

29 points

8 comments1 min readLW link

(astralcodexten.substack.com)

Pivotal outcomes and pivotal processes

Andrew_Critch17 Jun 2022 23:43 UTC

97 points

31 comments4 min readLW link

Quantifying General Intelligence

JasonB17 Jun 2022 21:57 UTC

9 points

6 comments13 min readLW link

Apply for Productivity Coaching and AI Alignment Mentorship

Nick17 Jun 2022 21:36 UTC

12 points

1 comment1 min readLW link

Things That Make Me Enjoy Giving Career Advice

Neel Nanda17 Jun 2022 20:49 UTC

16 points

0 comments9 min readLW link

(www.neelnanda.io)

The Unified Theory of Normative Ethics

Thane Ruthenis17 Jun 2022 19:55 UTC

8 points

0 comments6 min readLW link

1689: Uncovering the World New Institutionalism Created

Davis Kedrosky17 Jun 2022 19:32 UTC

7 points

0 comments9 min readLW link

(daviskedrosky.substack.com)

[Question] Is there an unified way to make sense of ai failure modes?

walking_mushroom17 Jun 2022 18:00 UTC

3 points

1 comment1 min readLW link

In defense of flailing, with foreword by Bill Burr

lc17 Jun 2022 16:40 UTC

88 points

6 comments4 min readLW link

An Approach to Land Value Taxation

harsimony17 Jun 2022 15:53 UTC

4 points

12 comments4 min readLW link

(harsimony.wordpress.com)

Value extrapolation vs Wireheading

Stuart_Armstrong17 Jun 2022 15:02 UTC

16 points

1 comment1 min readLW link

#SAT with Tensor Networks

Adam Jermyn17 Jun 2022 13:20 UTC

4 points

0 comments2 min readLW link

Announcing the Clearer Thinking Regrants program

spencerg17 Jun 2022 13:14 UTC

36 points

1 comment1 min readLW link

Singapore—Small casual dinner in Chinatown #3: DALL-E 2 edition

Joe Rocca17 Jun 2022 8:32 UTC

2 points

2 comments1 min readLW link

[Question] Is civilizational alignment on the table?

Aleksi Liimatainen17 Jun 2022 8:27 UTC

5 points

1 comment1 min readLW link

Apply to the Machine Learning For Good bootcamp in France

Alexandre Variengien17 Jun 2022 7:32 UTC

10 points

0 comments1 min readLW link

What’s it like to have sex with Duncan?

Duncan Sabien (Inactive)17 Jun 2022 2:32 UTC

54 points

19 comments17 min readLW link

wrapper-minds are the enemy

nostalgebraist17 Jun 2022 1:58 UTC

108 points

43 comments8 min readLW link

A Litany Missing from the Canon

benwr17 Jun 2022 1:39 UTC

39 points

3 comments1 min readLW link

(www.benwr.net)

[Question] Why did Russia invade Ukraine?

bohaska17 Jun 2022 1:36 UTC

0 points

5 comments1 min readLW link

A transparency and interpretability tech tree

evhub16 Jun 2022 23:44 UTC

163 points

11 comments18 min readLW link 1 review

BBC Future covers progress studies

jasoncrawford16 Jun 2022 22:44 UTC

21 points

6 comments3 min readLW link

(rootsofprogress.org)

Humans are very reliable agents

alyssavance16 Jun 2022 22:02 UTC

270 points

35 comments3 min readLW link

Towards Gears-Level Understanding of Agency

Thane Ruthenis16 Jun 2022 22:00 UTC

25 points

4 comments18 min readLW link

A possible AI-inoculation due to early “robot uprising”

Shmi16 Jun 2022 21:21 UTC

16 points

2 comments1 min readLW link

AI Risk, as Seen on Snapchat

dkirmani16 Jun 2022 19:31 UTC

23 points

8 comments1 min readLW link