Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
2
“Rudeness”, a useful coordination mechanic
Raemon
Nov 11, 2022, 10:27 PM
49
points
20
comments
2
min read
LW
link
Internalizing the damage of bad-acting partners creates incentives for due diligence
tailcalled
Nov 11, 2022, 8:57 PM
17
points
7
comments
1
min read
LW
link
Speculation on Current Opportunities for Unusually High Impact in Global Health
johnswentworth
Nov 11, 2022, 8:47 PM
114
points
31
comments
4
min read
LW
link
[Question]
Is acausal extortion possible?
sisyphus
Nov 11, 2022, 7:48 PM
−20
points
35
comments
3
min read
LW
link
Catharsis in Bb
jefftk
Nov 11, 2022, 5:40 PM
6
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Instrumental convergence is what makes general intelligence possible
tailcalled
Nov 11, 2022, 4:38 PM
105
points
11
comments
4
min read
LW
link
Weekly Roundup #5
Zvi
Nov 11, 2022, 4:20 PM
33
points
0
comments
6
min read
LW
link
(thezvi.wordpress.com)
Charging for the Dharma
jchan
Nov 11, 2022, 2:02 PM
32
points
18
comments
5
min read
LW
link
[Question]
EA (& AI Safety) has overestimated its projected funding — which decisions must be revised?
Cleo Nardo
Nov 11, 2022, 1:50 PM
22
points
7
comments
1
min read
LW
link
(forum.effectivealtruism.org)
Where the logical fallacy is not (Generalization From Fictional Evidence)
banev
Nov 11, 2022, 10:41 AM
−12
points
14
comments
1
min read
LW
link
Why I’m Working On Model Agnostic Interpretability
Jessica Rumbelow
Nov 11, 2022, 9:24 AM
27
points
9
comments
2
min read
LW
link
How likely are malign priors over objectives? [aborted WIP]
David Johnston
Nov 11, 2022, 5:36 AM
−1
points
0
comments
8
min read
LW
link
Do Timeless Decision Theorists reject all blackmail from other Timeless Decision Theorists?
myren
Nov 11, 2022, 12:38 AM
7
points
8
comments
3
min read
LW
link
We must be very clear: fraud in the service of effective altruism is unacceptable
evhub
Nov 10, 2022, 11:31 PM
42
points
56
comments
LW
link
[simulation] 4chan user claiming to be the attorney hired by Google’s sentient chatbot LaMDA shares wild details of encounter
janus
Nov 10, 2022, 9:39 PM
19
points
1
comment
13
min read
LW
link
(generative.ink)
divine carrot
Alok Singh
Nov 10, 2022, 8:50 PM
18
points
2
comments
1
min read
LW
link
(alok.github.io)
Metaculus Announces The Million Predictions Hackathon
ChristianWilliams
Nov 10, 2022, 8:00 PM
7
points
0
comments
LW
link
The harnessing of complexity
geduardo
Nov 10, 2022, 6:44 PM
6
points
2
comments
3
min read
LW
link
[Question]
I there a demo of “You can’t fetch the coffee if you’re dead”?
Ram Rachum
Nov 10, 2022, 6:41 PM
8
points
9
comments
1
min read
LW
link
Mastodon Linking Norms
jefftk
Nov 10, 2022, 3:10 PM
9
points
9
comments
2
min read
LW
link
(www.jefftk.com)
Covid 11/10/22: Into the Background
Zvi
Nov 10, 2022, 1:40 PM
31
points
5
comments
4
min read
LW
link
(thezvi.wordpress.com)
LessWrong Poll on AGI
Niclas Kupper
Nov 10, 2022, 1:13 PM
12
points
6
comments
1
min read
LW
link
The optimal angle for a solar boiler is different than for a solar panel
Yair Halberstadt
Nov 10, 2022, 10:32 AM
42
points
4
comments
2
min read
LW
link
What it’s like to dissect a cadaver
Alok Singh
Nov 10, 2022, 6:40 AM
208
points
24
comments
5
min read
LW
link
(alok.github.io)
I Converted Book I of The Sequences Into A Zoomer-Readable Format
dkirmani
Nov 10, 2022, 2:59 AM
200
points
32
comments
2
min read
LW
link
Adversarial Priors: Not Paying People to Lie to You
eva_
Nov 10, 2022, 2:29 AM
22
points
9
comments
3
min read
LW
link
Is full self-driving an AGI-complete problem?
kraemahz
Nov 10, 2022, 2:04 AM
10
points
5
comments
1
min read
LW
link
[Question]
What are examples of problems that were caused by intelligence, that couldn’t be solved with intelligence?
Peter O'Malley
Nov 10, 2022, 2:04 AM
1
point
2
comments
1
min read
LW
link
Desiderata for an Adversarial Prior
Shmi
Nov 9, 2022, 11:45 PM
13
points
2
comments
1
min read
LW
link
Chord Notation
jefftk
Nov 9, 2022, 9:30 PM
12
points
5
comments
1
min read
LW
link
(www.jefftk.com)
[ASoT] Instrumental convergence is useful
Ulisse Mini
Nov 9, 2022, 8:20 PM
5
points
9
comments
1
min read
LW
link
Mesatranslation and Metatranslation
jdp
Nov 9, 2022, 6:46 PM
25
points
4
comments
11
min read
LW
link
Trying to Make a Treacherous Mesa-Optimizer
MadHatter
Nov 9, 2022, 6:07 PM
95
points
14
comments
4
min read
LW
link
(attentionspan.blog)
A caveat to the Orthogonality Thesis
Wuschel Schulz
Nov 9, 2022, 3:06 PM
38
points
10
comments
2
min read
LW
link
Wednesday South Bay Meetups, November 16
Leonard Zabarsky
Nov 9, 2022, 2:21 AM
1
point
0
comments
1
min read
LW
link
FTX will probably be sold at a steep discount. What we know and some forecasts on what will happen next
Nathan Young
Nov 9, 2022, 2:14 AM
60
points
21
comments
LW
link
A first success story for Outer Alignment: InstructGPT
Noosphere89
Nov 8, 2022, 10:52 PM
6
points
1
comment
1
min read
LW
link
(openai.com)
Trying Mastodon
jefftk
Nov 8, 2022, 7:10 PM
12
points
4
comments
1
min read
LW
link
(www.jefftk.com)
Inverse scaling can become U-shaped
Edouard Harris
Nov 8, 2022, 7:04 PM
27
points
15
comments
1
min read
LW
link
(arxiv.org)
People care about each other even though they have imperfect motivational pointers?
TurnTrout
Nov 8, 2022, 6:15 PM
33
points
25
comments
7
min read
LW
link
Applying superintelligence without collusion
Eric Drexler
Nov 8, 2022, 6:08 PM
109
points
63
comments
4
min read
LW
link
[Question]
Binance is buying FTX.com: How did it happen and what are the implications?
Caerulean
Nov 8, 2022, 5:14 PM
16
points
6
comments
1
min read
LW
link
Some advice on independent research
Marius Hobbhahn
Nov 8, 2022, 2:46 PM
56
points
5
comments
10
min read
LW
link
Mysteries of mode collapse
janus
Nov 8, 2022, 10:37 AM
284
points
57
comments
14
min read
LW
link
1
review
[ASoT] Thoughts on GPT-N
Ulisse Mini
Nov 8, 2022, 7:14 AM
8
points
0
comments
1
min read
LW
link
Michael Simm—Introducing Myself
Michael Simm
Nov 8, 2022, 5:45 AM
4
points
0
comments
2
min read
LW
link
EA & LW Forums Weekly Summary (31st Oct − 6th Nov 22′)
Zoe Williams
Nov 8, 2022, 3:58 AM
12
points
1
comment
LW
link
[Question]
Value of Querying 100+ People About Humanity’s Future
T431
Nov 8, 2022, 12:41 AM
9
points
3
comments
2
min read
LW
link
How could we know that an AGI system will have good consequences?
So8res
Nov 7, 2022, 10:42 PM
111
points
25
comments
5
min read
LW
link
A Walkthrough of Interpretability in the Wild (w/ authors Kevin Wang, Arthur Conmy & Alexandre Variengien)
Neel Nanda
Nov 7, 2022, 10:39 PM
30
points
15
comments
3
min read
LW
link
(youtu.be)
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel