All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

A first success story for Outer Alignment: InstructGPT

Noosphere89Nov 8, 2022, 10:52 PM

6 points

1 comment1 min readLW link

(openai.com)

Trying Mastodon

jefftkNov 8, 2022, 7:10 PM

12 points

4 comments1 min readLW link

(www.jefftk.com)

Inverse scaling can become U-shaped

Edouard HarrisNov 8, 2022, 7:04 PM

27 points

15 comments1 min readLW link

(arxiv.org)

People care about each other even though they have imperfect motivational pointers?

TurnTroutNov 8, 2022, 6:15 PM

33 points

25 comments7 min readLW link

Applying superintelligence without collusion

Eric DrexlerNov 8, 2022, 6:08 PM

109 points

63 comments4 min readLW link

[Question] Binance is buying FTX.com: How did it happen and what are the implications?

CaeruleanNov 8, 2022, 5:14 PM

16 points

6 comments1 min readLW link

Some advice on independent research

Marius HobbhahnNov 8, 2022, 2:46 PM

56 points

5 comments10 min readLW link

Mysteries of mode collapse

janusNov 8, 2022, 10:37 AM

284 points

57 comments14 min readLW link 1 review

[ASoT] Thoughts on GPT-N

Ulisse MiniNov 8, 2022, 7:14 AM

8 points

0 comments1 min readLW link

Michael Simm—Introducing Myself

Michael SimmNov 8, 2022, 5:45 AM

4 points

0 comments2 min readLW link

EA & LW Forums Weekly Summary (31st Oct − 6th Nov 22′)

Zoe WilliamsNov 8, 2022, 3:58 AM

12 points

1 comment LW link

[Question] Value of Querying 100+ People About Humanity’s Future

T431Nov 8, 2022, 12:41 AM

9 points

3 comments2 min readLW link

How could we know that an AGI system will have good consequences?

So8resNov 7, 2022, 10:42 PM

111 points

25 comments5 min readLW link

A Walkthrough of Interpretability in the Wild (w/ authors Kevin Wang, Arthur Conmy & Alexandre Variengien)

Neel NandaNov 7, 2022, 10:39 PM

30 points

15 comments3 min readLW link

(youtu.be)

Intercept article about lab accidents

ChristianKlNov 7, 2022, 9:10 PM

23 points

9 comments1 min readLW link

(theintercept.com)

The biological function of love for non-kin is to gain the trust of people we cannot deceive

chaosmageNov 7, 2022, 8:26 PM

43 points

3 comments8 min readLW link

Distillation Experiment: Chunk-Knitting

DirectedEvolutionNov 7, 2022, 7:56 PM

10 points

3 comments6 min readLW link

Thinking About Mastodon

jefftkNov 7, 2022, 7:40 PM

33 points

17 comments1 min readLW link

(www.jefftk.com)

[Question] Ideas for tiny research projects related to rationality?

FrejNov 7, 2022, 6:45 PM

3 points

1 comment1 min readLW link

Loss of control of AI is not a likely source of AI x-risk

squekNov 7, 2022, 6:44 PM

−6 points

0 comments5 min readLW link

AI Safety Unconference NeurIPS 2022

OrpheusNov 7, 2022, 3:39 PM

25 points

0 comments LW link

(aisafetyevents.org)

Hacker-AI – Does it already exist?

Erland WittkotterNov 7, 2022, 2:01 PM

3 points

12 comments11 min readLW link

What’s the Deal with Elon Musk and Twitter?

ZviNov 7, 2022, 1:50 PM

60 points

13 comments31 min readLW link

(thezvi.wordpress.com)

How to Make Easy Decisions

lynettebyeNov 7, 2022, 1:17 PM

17 points

3 comments2 min readLW link

Opportunities that surprised us during our Clearer Thinking Regrants program

spencergNov 7, 2022, 1:09 PM

20 points

0 comments LW link

4 Key Assumptions in AI Safety

PrometheusNov 7, 2022, 10:50 AM

20 points

5 comments7 min readLW link

Google Search as a Washed Up Service Dog: “I HALP!”

ShmiNov 7, 2022, 7:02 AM

20 points

8 comments1 min readLW link

[Book Review] “Station Eleven” by Emily St. John Mandel

lsusrNov 7, 2022, 5:56 AM

17 points

1 comment1 min readLW link

Counterfactability

Scott GarrabrantNov 7, 2022, 5:39 AM

40 points

5 comments11 min readLW link

2022 LessWrong Census?

SurfingOrcaNov 7, 2022, 5:16 AM

67 points

13 comments1 min readLW link

A philosopher’s critique of RLHF

TW123Nov 7, 2022, 2:42 AM

55 points

8 comments2 min readLW link

[Question] Is there any discussion on avoiding being Dutch-booked or otherwise taken advantage of one’s bounded rationality by refusing to engage?

ShmiNov 7, 2022, 2:36 AM

38 points

29 comments1 min readLW link

Exams-Only Universities

Mati_RoyNov 6, 2022, 10:05 PM

80 points

40 comments2 min readLW link

Democracy Is in Danger, but Not for the Reasons You Think

ExCephNov 6, 2022, 9:15 PM

−7 points

4 comments12 min readLW link

(ginnungagapfoundation.wordpress.com)

Playground Game: Monster

jefftkNov 6, 2022, 4:00 PM

14 points

4 comments1 min readLW link

(www.jefftk.com)

[Question] Has Pascal’s Mugging problem been completely solved yet?

EniScienNov 6, 2022, 12:52 PM

3 points

11 comments1 min readLW link

[Question] Should I Pursue a PhD?

DragonGodNov 6, 2022, 10:58 AM

8 points

8 comments2 min readLW link

You won’t solve alignment without agent foundations

Mikhail SaminNov 6, 2022, 8:07 AM

27 points

3 comments8 min readLW link

Word-Distance vs Idea-Distance: The Case for Lanoitaring

SableNov 6, 2022, 5:25 AM

7 points

7 comments7 min readLW link

(affablyevil.substack.com)

Apple Cider Syrup

jefftkNov 6, 2022, 2:10 AM

11 points

6 comments1 min readLW link

(www.jefftk.com)

What is epigenetics?

MetacelsusNov 6, 2022, 1:24 AM

78 points

4 comments6 min readLW link

(denovo.substack.com)

Response

Jarred FilmerNov 6, 2022, 1:03 AM

29 points

2 comments12 min readLW link

[Question] Has anyone increased their AGI timelines?

Darren McKeeNov 6, 2022, 12:03 AM

39 points

12 comments1 min readLW link

Takeaways from a survey on AI alignment resources

DanielFilanNov 5, 2022, 11:40 PM

73 points

10 comments6 min readLW link 1 review

(danielfilan.com)

Unpricable Information and Certificate Hell

eva_Nov 5, 2022, 10:56 PM

13 points

2 comments6 min readLW link

Recommend HAIST resources for assessing the value of RLHF-related alignment research

Sam Marks and Xander Davies

Nov 5, 2022, 8:58 PM

26 points

9 comments3 min readLW link

Instead of technical research, more people should focus on buying time

Orpheus16, OliviaJ and Thomas Larsen

Nov 5, 2022, 8:43 PM

100 points

45 comments14 min readLW link

Provably Honest—A First Step

Srijanak DeNov 5, 2022, 7:18 PM

10 points

2 comments8 min readLW link

Should AI focus on problem-solving or strategic planning? Why not both?

Oliver SiegelNov 5, 2022, 7:17 PM

−12 points

3 comments LW link

How to store human values on a computer

Oliver SiegelNov 5, 2022, 7:17 PM

−12 points

17 comments LW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer