All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 212223 24 25 26 27 28 29 30

Security Mindset: Lessons from 20+ years of Software Security Failures Relevant to AGI Alignment

elspoodJun 21, 2022, 11:55 PM

362 points

42 comments7 min readLW link 1 review

A Quick List of Some Problems in AI Alignment As A Field

Nicholas / Heather KrossJun 21, 2022, 11:23 PM

75 points

12 comments6 min readLW link

(www.thinkingmuchbetter.com)

[Question] What is the difference between AI misalignment and bad programming?

puzzleGuzzleJun 21, 2022, 9:52 PM

6 points

2 comments1 min readLW link

What I mean by the phrase “getting intimate with reality”

LuiseJun 21, 2022, 7:42 PM

6 points

0 comments2 min readLW link

(forum.effectivealtruism.org)

What I mean by the phrase “taking ideas seriously”

LuiseJun 21, 2022, 7:42 PM

5 points

2 comments1 min readLW link

(forum.effectivealtruism.org)

Hydrophobic Glasses Coating Review

jefftkJun 21, 2022, 6:00 PM

16 points

6 comments1 min readLW link

(www.jefftk.com)

Progress links and tweets, 2022-06-20

jasoncrawfordJun 21, 2022, 5:12 PM

12 points

2 comments1 min readLW link

(rootsofprogress.org)

Debating Whether AI is Conscious Is A Distraction from Real Problems

sidhe_theyJun 21, 2022, 4:56 PM

2 points

10 comments1 min readLW link

(techpolicy.press)

Mitigating the damage from unaligned ASI by cooperating with aliens that don’t exist yet

MSRayneJun 21, 2022, 4:12 PM

−8 points

7 comments6 min readLW link

The inordinately slow spread of good AGI conversations in ML

Rob BensingerJun 21, 2022, 4:09 PM

173 points

62 comments8 min readLW link

Getting from an unaligned AGI to an aligned AGI?

Tor Økland BarstadJun 21, 2022, 12:36 PM

13 points

7 comments9 min readLW link

Common but neglected risk factors that may let you get Paxlovid

DirectedEvolutionJun 21, 2022, 7:34 AM

29 points

8 comments4 min readLW link

Dagger of Detect Evil

lsusrJun 21, 2022, 6:23 AM

45 points

22 comments3 min readLW link

[Question] How easy/fast is it for a AGI to hack computers/a human brain?

Noosphere89Jun 21, 2022, 12:34 AM

0 points

1 comment1 min readLW link

[Question] What is the most probable AI?

Zeruel017Jun 20, 2022, 11:26 PM

−2 points

0 comments3 min readLW link

Evaluating a Corsi-Rosenthal Filter Cube

jefftkJun 20, 2022, 7:40 PM

13 points

4 comments1 min readLW link

(www.jefftk.com)

Survey re AIS/LTism office in NYC

RyanCareyJun 20, 2022, 7:21 PM

7 points

0 comments1 min readLW link

Is This Thing Sentient, Y/N?

Thane RuthenisJun 20, 2022, 6:37 PM

4 points

10 comments7 min readLW link

Steam

abramdemskiJun 20, 2022, 5:38 PM

149 points

13 comments5 min readLW link 1 review

Parable: The Bomb that doesn’t Explode

Lone PineJun 20, 2022, 4:41 PM

14 points

5 comments2 min readLW link

On corrigibility and its basin

Donald HobsonJun 20, 2022, 4:33 PM

16 points

3 comments2 min readLW link

Announcing the DWATV Discord

ZviJun 20, 2022, 3:50 PM

20 points

9 comments1 min readLW link

(thezvi.wordpress.com)

Key Papers in Language Model Safety

aogJun 20, 2022, 3:00 PM

40 points

1 comment22 min readLW link

Relationship Advice Repository

RubyJun 20, 2022, 2:39 PM

109 points

36 comments38 min readLW link

Adaptation Executors and the Telos Margin

PlinthistJun 20, 2022, 1:06 PM

2 points

8 comments5 min readLW link

Are we there yet?

theflowerpotJun 20, 2022, 11:19 AM

2 points

2 comments1 min readLW link

Causal confusion as an argument against the scaling hypothesis

RobertKirk and David Scott Krueger (formerly: capybaralet)

Jun 20, 2022, 10:54 AM

86 points

30 comments15 min readLW link

An AI defense-offense symmetry thesis

Chris van MerwijkJun 20, 2022, 10:01 AM

10 points

9 comments3 min readLW link

Let’s See You Write That Corrigibility Tag

Eliezer YudkowskyJun 19, 2022, 9:11 PM

125 points

70 comments1 min readLW link

Half-baked alignment idea: training to generalize

Aaron BergmanJun 19, 2022, 8:16 PM

10 points

2 comments4 min readLW link

Where I agree and disagree with Eliezer

paulfchristianoJun 19, 2022, 7:15 PM

901 points

224 comments18 min readLW link 2 reviews

[Question] AI misalignment risk from GPT-like systems?

fiso64Jun 19, 2022, 5:35 PM

10 points

8 comments1 min readLW link

[Link-post] On Deference and Yudkowsky’s AI Risk Estimates

bmgJun 19, 2022, 5:25 PM

29 points

8 comments1 min readLW link

Hebbian Learning Is More Common Than You Think

Aleksi LiimatainenJun 19, 2022, 3:57 PM

8 points

2 comments1 min readLW link

The Malthusian Trap: An Extremely Short Introduction

Davis KedroskyJun 19, 2022, 3:25 PM

5 points

0 comments6 min readLW link

(daviskedrosky.substack.com)

Parliaments without the Parties

Yair HalberstadtJun 19, 2022, 2:06 PM

18 points

18 comments2 min readLW link

Lamda is not an LLM

KevinJun 19, 2022, 11:13 AM

7 points

10 comments1 min readLW link

(www.wired.com)

Getting stuck in local minima

louis030195Jun 19, 2022, 8:50 AM

3 points

1 comment1 min readLW link

(brain.louis030195.com)

[Linkpost] The importance of stupidity in scientific research

PatternJun 19, 2022, 5:17 AM

17 points

1 comment1 min readLW link

(journals.biologists.com)

ETH is probably undervalued right now

mukashiJun 19, 2022, 2:20 AM

−7 points

22 comments1 min readLW link

Juneberry Cake

jefftkJun 19, 2022, 1:40 AM

29 points

0 comments1 min readLW link

(www.jefftk.com)

Agent level parallelism

Johannes C. MayerJun 18, 2022, 8:56 PM

5 points

5 comments1 min readLW link

What are our outs to play to?

HastingsJun 18, 2022, 7:32 PM

7 points

0 comments2 min readLW link

[Question] What’s the information value of government hearings?

KennyJun 18, 2022, 5:13 PM

6 points

4 comments2 min readLW link

The best ‘free solo’ (rock climbing) video

KennyJun 18, 2022, 3:29 PM

14 points

4 comments2 min readLW link

[Question] What’s the name of this fallacy/reasoning antipattern?

David GrossJun 18, 2022, 2:04 PM

9 points

6 comments1 min readLW link

“Brain enthusiasts” in AI Safety

Jan and Samuel Nellessen

Jun 18, 2022, 9:59 AM

63 points

5 comments10 min readLW link

(universalprior.substack.com)

To what extent have ideas and scientific discoveries gotten harder to find?

lsusrJun 18, 2022, 7:15 AM

33 points

10 comments6 min readLW link

[Question] What’s the goal in life?

Konstantin WeitzJun 18, 2022, 6:09 AM

5 points

6 comments1 min readLW link

Can DALL-E understand simple geometry?

Isaac KingJun 18, 2022, 4:37 AM

25 points

2 comments1 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer