Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
2
Desiderata for an Adversarial Prior
Shmi
Nov 9, 2022, 11:45 PM
13
points
2
comments
1
min read
LW
link
Chord Notation
jefftk
Nov 9, 2022, 9:30 PM
12
points
5
comments
1
min read
LW
link
(www.jefftk.com)
[ASoT] Instrumental convergence is useful
Ulisse Mini
Nov 9, 2022, 8:20 PM
5
points
9
comments
1
min read
LW
link
Mesatranslation and Metatranslation
jdp
Nov 9, 2022, 6:46 PM
25
points
4
comments
11
min read
LW
link
Trying to Make a Treacherous Mesa-Optimizer
MadHatter
Nov 9, 2022, 6:07 PM
95
points
14
comments
4
min read
LW
link
(attentionspan.blog)
A caveat to the Orthogonality Thesis
Wuschel Schulz
Nov 9, 2022, 3:06 PM
38
points
10
comments
2
min read
LW
link
Wednesday South Bay Meetups, November 16
Leonard Zabarsky
Nov 9, 2022, 2:21 AM
1
point
0
comments
1
min read
LW
link
FTX will probably be sold at a steep discount. What we know and some forecasts on what will happen next
Nathan Young
Nov 9, 2022, 2:14 AM
60
points
21
comments
LW
link
A first success story for Outer Alignment: InstructGPT
Noosphere89
Nov 8, 2022, 10:52 PM
6
points
1
comment
1
min read
LW
link
(openai.com)
Trying Mastodon
jefftk
Nov 8, 2022, 7:10 PM
12
points
4
comments
1
min read
LW
link
(www.jefftk.com)
Inverse scaling can become U-shaped
Edouard Harris
Nov 8, 2022, 7:04 PM
27
points
15
comments
1
min read
LW
link
(arxiv.org)
People care about each other even though they have imperfect motivational pointers?
TurnTrout
Nov 8, 2022, 6:15 PM
33
points
25
comments
7
min read
LW
link
Applying superintelligence without collusion
Eric Drexler
Nov 8, 2022, 6:08 PM
109
points
63
comments
4
min read
LW
link
[Question]
Binance is buying FTX.com: How did it happen and what are the implications?
Caerulean
Nov 8, 2022, 5:14 PM
16
points
6
comments
1
min read
LW
link
Some advice on independent research
Marius Hobbhahn
Nov 8, 2022, 2:46 PM
56
points
5
comments
10
min read
LW
link
Mysteries of mode collapse
janus
Nov 8, 2022, 10:37 AM
284
points
57
comments
14
min read
LW
link
1
review
[ASoT] Thoughts on GPT-N
Ulisse Mini
Nov 8, 2022, 7:14 AM
8
points
0
comments
1
min read
LW
link
Michael Simm—Introducing Myself
Michael Simm
Nov 8, 2022, 5:45 AM
4
points
0
comments
2
min read
LW
link
EA & LW Forums Weekly Summary (31st Oct − 6th Nov 22′)
Zoe Williams
Nov 8, 2022, 3:58 AM
12
points
1
comment
LW
link
[Question]
Value of Querying 100+ People About Humanity’s Future
T431
Nov 8, 2022, 12:41 AM
9
points
3
comments
2
min read
LW
link
How could we know that an AGI system will have good consequences?
So8res
Nov 7, 2022, 10:42 PM
111
points
25
comments
5
min read
LW
link
A Walkthrough of Interpretability in the Wild (w/ authors Kevin Wang, Arthur Conmy & Alexandre Variengien)
Neel Nanda
Nov 7, 2022, 10:39 PM
30
points
15
comments
3
min read
LW
link
(youtu.be)
Intercept article about lab accidents
ChristianKl
Nov 7, 2022, 9:10 PM
23
points
9
comments
1
min read
LW
link
(theintercept.com)
The biological function of love for non-kin is to gain the trust of people we cannot deceive
chaosmage
Nov 7, 2022, 8:26 PM
43
points
3
comments
8
min read
LW
link
Distillation Experiment: Chunk-Knitting
DirectedEvolution
Nov 7, 2022, 7:56 PM
10
points
3
comments
6
min read
LW
link
Thinking About Mastodon
jefftk
Nov 7, 2022, 7:40 PM
33
points
17
comments
1
min read
LW
link
(www.jefftk.com)
[Question]
Ideas for tiny research projects related to rationality?
Frej
Nov 7, 2022, 6:45 PM
3
points
1
comment
1
min read
LW
link
Loss of control of AI is not a likely source of AI x-risk
squek
Nov 7, 2022, 6:44 PM
−6
points
0
comments
5
min read
LW
link
AI Safety Unconference NeurIPS 2022
Orpheus
Nov 7, 2022, 3:39 PM
25
points
0
comments
LW
link
(aisafetyevents.org)
Hacker-AI – Does it already exist?
Erland Wittkotter
Nov 7, 2022, 2:01 PM
3
points
12
comments
11
min read
LW
link
What’s the Deal with Elon Musk and Twitter?
Zvi
Nov 7, 2022, 1:50 PM
60
points
13
comments
31
min read
LW
link
(thezvi.wordpress.com)
How to Make Easy Decisions
lynettebye
Nov 7, 2022, 1:17 PM
17
points
3
comments
2
min read
LW
link
Opportunities that surprised us during our Clearer Thinking Regrants program
spencerg
Nov 7, 2022, 1:09 PM
20
points
0
comments
LW
link
4 Key Assumptions in AI Safety
Prometheus
Nov 7, 2022, 10:50 AM
20
points
5
comments
7
min read
LW
link
Google Search as a Washed Up Service Dog: “I HALP!”
Shmi
Nov 7, 2022, 7:02 AM
20
points
8
comments
1
min read
LW
link
[Book Review] “Station Eleven” by Emily St. John Mandel
lsusr
Nov 7, 2022, 5:56 AM
17
points
1
comment
1
min read
LW
link
Counterfactability
Scott Garrabrant
Nov 7, 2022, 5:39 AM
40
points
5
comments
11
min read
LW
link
2022 LessWrong Census?
SurfingOrca
Nov 7, 2022, 5:16 AM
67
points
13
comments
1
min read
LW
link
A philosopher’s critique of RLHF
TW123
Nov 7, 2022, 2:42 AM
55
points
8
comments
2
min read
LW
link
[Question]
Is there any discussion on avoiding being Dutch-booked or otherwise taken advantage of one’s bounded rationality by refusing to engage?
Shmi
Nov 7, 2022, 2:36 AM
38
points
29
comments
1
min read
LW
link
Exams-Only Universities
Mati_Roy
Nov 6, 2022, 10:05 PM
80
points
40
comments
2
min read
LW
link
Democracy Is in Danger, but Not for the Reasons You Think
ExCeph
Nov 6, 2022, 9:15 PM
−7
points
4
comments
12
min read
LW
link
(ginnungagapfoundation.wordpress.com)
Playground Game: Monster
jefftk
Nov 6, 2022, 4:00 PM
14
points
4
comments
1
min read
LW
link
(www.jefftk.com)
[Question]
Has Pascal’s Mugging problem been completely solved yet?
EniScien
Nov 6, 2022, 12:52 PM
3
points
11
comments
1
min read
LW
link
[Question]
Should I Pursue a PhD?
DragonGod
Nov 6, 2022, 10:58 AM
8
points
8
comments
2
min read
LW
link
You won’t solve alignment without agent foundations
Mikhail Samin
Nov 6, 2022, 8:07 AM
29
points
3
comments
8
min read
LW
link
Word-Distance vs Idea-Distance: The Case for Lanoitaring
Sable
Nov 6, 2022, 5:25 AM
7
points
7
comments
7
min read
LW
link
(affablyevil.substack.com)
Apple Cider Syrup
jefftk
Nov 6, 2022, 2:10 AM
11
points
6
comments
1
min read
LW
link
(www.jefftk.com)
What is epigenetics?
Metacelsus
Nov 6, 2022, 1:24 AM
78
points
4
comments
6
min read
LW
link
(denovo.substack.com)
Response
Jarred Filmer
Nov 6, 2022, 1:03 AM
29
points
2
comments
12
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel