All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 131415 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Evolution is a bad analogy for AGI: inner alignment

Quintin PopeAug 13, 2022, 10:15 PM

79 points

15 comments8 min readLW link

An Uncanny Prison

Nathan1123Aug 13, 2022, 9:40 PM

3 points

3 comments2 min readLW link

Florida Elections

DoubleAug 13, 2022, 8:10 PM

−3 points

8 comments1 min readLW link

Cultivating Valiance

Shoshannah TekofskyAug 13, 2022, 6:47 PM

35 points

4 comments4 min readLW link

An extended rocket alignment analogy

rememberAug 13, 2022, 6:22 PM

28 points

3 comments4 min readLW link

[Question] The OpenAI playground for GPT-3 is a terrible interface. Is there any great local (or web) app for exploring/learning with language models?

avivAug 13, 2022, 4:34 PM

3 points

1 comment1 min readLW link

[Question] What is an agent in reductionist materialism?

ValentineAug 13, 2022, 3:39 PM

7 points

17 comments1 min readLW link

Refine’s First Blog Post Day

adamShimiAug 13, 2022, 10:23 AM

55 points

3 comments1 min readLW link

The Dumbest Possible Gets There First

ArtaxerxesAug 13, 2022, 10:20 AM

44 points

7 comments2 min readLW link

I missed the crux of the alignment problem the whole time

zeshenAug 13, 2022, 10:11 AM

53 points

7 comments3 min readLW link

Shapes of Mind and Pluralism in Alignment

adamShimiAug 13, 2022, 10:01 AM

33 points

2 comments2 min readLW link

How I think about alignment

Linda LinseforsAug 13, 2022, 10:01 AM

31 points

11 comments5 min readLW link

Steelmining via Analogy

Paul BricmanAug 13, 2022, 9:59 AM

24 points

0 comments2 min readLW link

(paulbricman.com)

Appendix: Jargon Dictionary

CFAR!DuncanAug 13, 2022, 8:09 AM

34 points

5 comments21 min readLW link

Appendix: Hamming Questions

CFAR!DuncanAug 13, 2022, 8:07 AM

41 points

0 comments2 min readLW link

Building a Bugs List prompts

CFAR!DuncanAug 13, 2022, 8:00 AM

69 points

9 comments2 min readLW link

Cambridge LW Meetup: Constructive Complaining

Tony WangAug 13, 2022, 4:52 AM

2 points

0 comments1 min readLW link

Gradient descent doesn’t select for inner search

Ivan VendrovAug 13, 2022, 4:15 AM

47 points

23 comments4 min readLW link

[Question] How to bet against civilizational adequacy?

Wei DaiAug 12, 2022, 11:33 PM

54 points

20 comments1 min readLW link

Infant AI Scenario

Nathan1123Aug 12, 2022, 9:20 PM

1 point

0 comments3 min readLW link

DeepMind alignment team opinions on AGI ruin arguments

VikaAug 12, 2022, 9:06 PM

395 points

37 comments14 min readLW link 1 review

Dissolve: The Petty Crimes of Blaise Pascal

SebastianG Aug 12, 2022, 8:04 PM

17 points

4 comments6 min readLW link

The Host Minds of HBO’s Westworld.

NerretAug 12, 2022, 6:53 PM

1 point

0 comments3 min readLW link

What is estimational programming? Squiggle in context

QuinnAug 12, 2022, 6:39 PM

14 points

7 comments7 min readLW link

Oversight Misses 100% of Thoughts The AI Does Not Think

johnswentworthAug 12, 2022, 4:30 PM

110 points

49 comments1 min readLW link

Timelines explanation post part 1 of ?

Nathan Helm-BurgerAug 12, 2022, 4:13 PM

10 points

1 comment2 min readLW link

A little playing around with Blenderbot3

Nathan Helm-BurgerAug 12, 2022, 4:06 PM

9 points

0 comments1 min readLW link

Refining the Sharp Left Turn threat model, part 1: claims and mechanisms

Vika, Vikrant Varma, Ramana Kumar and Mary Phuong

Aug 12, 2022, 3:17 PM

86 points

4 comments3 min readLW link 1 review

(vkrakovna.wordpress.com)

Argument by Intellectual Ordeal

lcAug 12, 2022, 1:03 PM

26 points

5 comments5 min readLW link

Anti-squatted AI x-risk domains index

plexAug 12, 2022, 12:01 PM

59 points

6 comments1 min readLW link

[Question] Perfect Predictors

aditya malikAug 12, 2022, 11:51 AM

2 points

5 comments1 min readLW link

[Question] What are some good arguments against building new nuclear power plants?

RomanSAug 12, 2022, 7:32 AM

16 points

15 comments2 min readLW link

Seeking PCK (Pedagogical Content Knowledge)

CFAR!DuncanAug 12, 2022, 4:15 AM

62 points

11 comments5 min readLW link

Artificial intelligence wireheading

Big TonyAug 12, 2022, 3:06 AM

5 points

2 comments1 min readLW link

Dissected boxed AI

Nathan1123Aug 12, 2022, 2:37 AM

−8 points

2 comments1 min readLW link

Troll Timers

ScrewtapeAug 12, 2022, 12:55 AM

29 points

13 comments4 min readLW link

[Question] Seriously, what goes wrong with “reward the agent when it makes you smile”?

TurnTroutAug 11, 2022, 10:22 PM

87 points

43 comments2 min readLW link

Encultured AI Pre-planning, Part 2: Providing a Service

Andrew_Critch and Nick Hay

Aug 11, 2022, 8:11 PM

33 points

4 comments3 min readLW link

My summary of the alignment problem

Peter HroššoAug 11, 2022, 7:42 PM

15 points

3 comments2 min readLW link

(threadreaderapp.com)

Language models seem to be much better than humans at next-token prediction

Buck, Fabien Roger and LawrenceC

Aug 11, 2022, 5:45 PM

182 points

60 comments13 min readLW link 1 review

Introducing Pastcasting: A tool for forecasting practice

Sage FutureAug 11, 2022, 5:38 PM

95 points

10 comments2 min readLW link 2 reviews

Pendulums, Policy-Level Decisionmaking, Saving State

CFAR!DuncanAug 11, 2022, 4:47 PM

30 points

3 comments8 min readLW link

Covid 8/11/22: The End Is Never The End

ZviAug 11, 2022, 4:20 PM

28 points

11 comments16 min readLW link

(thezvi.wordpress.com)

Singapore—Small casual dinner in Chinatown #4

Joe RoccaAug 11, 2022, 12:30 PM

3 points

3 comments1 min readLW link

Thoughts on the good regulator theorem

JonasMossAug 11, 2022, 12:08 PM

12 points

0 comments4 min readLW link

How and why to turn everything into audio

KatWoods and AmberDawn

Aug 11, 2022, 8:55 AM

55 points

20 comments5 min readLW link

Shard Theory: An Overview

David UdellAug 11, 2022, 5:44 AM

166 points

34 comments10 min readLW link

[Question] Do advancements in Decision Theory point towards moral absolutism?

Nathan1123Aug 11, 2022, 12:59 AM

0 points

4 comments4 min readLW link

The alignment problem from a deep learning perspective

Richard_NgoAug 10, 2022, 10:46 PM

107 points

15 comments27 min readLW link 1 review

How much alignment data will we need in the long run?

Jacob_HiltonAug 10, 2022, 9:39 PM

37 points

15 comments4 min readLW link