Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
2
The harnessing of complexity
geduardo
Nov 10, 2022, 6:44 PM
6
points
2
comments
3
min read
LW
link
[Question]
I there a demo of “You can’t fetch the coffee if you’re dead”?
Ram Rachum
Nov 10, 2022, 6:41 PM
8
points
9
comments
1
min read
LW
link
Mastodon Linking Norms
jefftk
Nov 10, 2022, 3:10 PM
9
points
9
comments
2
min read
LW
link
(www.jefftk.com)
Covid 11/10/22: Into the Background
Zvi
Nov 10, 2022, 1:40 PM
31
points
5
comments
4
min read
LW
link
(thezvi.wordpress.com)
LessWrong Poll on AGI
Niclas Kupper
Nov 10, 2022, 1:13 PM
12
points
6
comments
1
min read
LW
link
The optimal angle for a solar boiler is different than for a solar panel
Yair Halberstadt
Nov 10, 2022, 10:32 AM
42
points
4
comments
2
min read
LW
link
What it’s like to dissect a cadaver
Alok Singh
Nov 10, 2022, 6:40 AM
208
points
24
comments
5
min read
LW
link
(alok.github.io)
I Converted Book I of The Sequences Into A Zoomer-Readable Format
dkirmani
Nov 10, 2022, 2:59 AM
200
points
32
comments
2
min read
LW
link
Adversarial Priors: Not Paying People to Lie to You
eva_
Nov 10, 2022, 2:29 AM
22
points
9
comments
3
min read
LW
link
Is full self-driving an AGI-complete problem?
kraemahz
Nov 10, 2022, 2:04 AM
10
points
5
comments
1
min read
LW
link
[Question]
What are examples of problems that were caused by intelligence, that couldn’t be solved with intelligence?
Peter O'Malley
Nov 10, 2022, 2:04 AM
1
point
2
comments
1
min read
LW
link
Desiderata for an Adversarial Prior
Shmi
Nov 9, 2022, 11:45 PM
13
points
2
comments
1
min read
LW
link
Chord Notation
jefftk
Nov 9, 2022, 9:30 PM
12
points
5
comments
1
min read
LW
link
(www.jefftk.com)
[ASoT] Instrumental convergence is useful
Ulisse Mini
Nov 9, 2022, 8:20 PM
5
points
9
comments
1
min read
LW
link
Mesatranslation and Metatranslation
jdp
Nov 9, 2022, 6:46 PM
25
points
4
comments
11
min read
LW
link
Trying to Make a Treacherous Mesa-Optimizer
MadHatter
Nov 9, 2022, 6:07 PM
95
points
14
comments
4
min read
LW
link
(attentionspan.blog)
A caveat to the Orthogonality Thesis
Wuschel Schulz
Nov 9, 2022, 3:06 PM
38
points
10
comments
2
min read
LW
link
Wednesday South Bay Meetups, November 16
Leonard Zabarsky
Nov 9, 2022, 2:21 AM
1
point
0
comments
1
min read
LW
link
FTX will probably be sold at a steep discount. What we know and some forecasts on what will happen next
Nathan Young
Nov 9, 2022, 2:14 AM
60
points
21
comments
LW
link
A first success story for Outer Alignment: InstructGPT
Noosphere89
Nov 8, 2022, 10:52 PM
6
points
1
comment
1
min read
LW
link
(openai.com)
Trying Mastodon
jefftk
Nov 8, 2022, 7:10 PM
12
points
4
comments
1
min read
LW
link
(www.jefftk.com)
Inverse scaling can become U-shaped
Edouard Harris
Nov 8, 2022, 7:04 PM
27
points
15
comments
1
min read
LW
link
(arxiv.org)
People care about each other even though they have imperfect motivational pointers?
TurnTrout
Nov 8, 2022, 6:15 PM
33
points
25
comments
7
min read
LW
link
Applying superintelligence without collusion
Eric Drexler
Nov 8, 2022, 6:08 PM
109
points
63
comments
4
min read
LW
link
[Question]
Binance is buying FTX.com: How did it happen and what are the implications?
Caerulean
Nov 8, 2022, 5:14 PM
16
points
6
comments
1
min read
LW
link
Some advice on independent research
Marius Hobbhahn
Nov 8, 2022, 2:46 PM
56
points
5
comments
10
min read
LW
link
Mysteries of mode collapse
janus
Nov 8, 2022, 10:37 AM
284
points
57
comments
14
min read
LW
link
1
review
[ASoT] Thoughts on GPT-N
Ulisse Mini
Nov 8, 2022, 7:14 AM
8
points
0
comments
1
min read
LW
link
Michael Simm—Introducing Myself
Michael Simm
Nov 8, 2022, 5:45 AM
4
points
0
comments
2
min read
LW
link
EA & LW Forums Weekly Summary (31st Oct − 6th Nov 22′)
Zoe Williams
Nov 8, 2022, 3:58 AM
12
points
1
comment
LW
link
[Question]
Value of Querying 100+ People About Humanity’s Future
T431
Nov 8, 2022, 12:41 AM
9
points
3
comments
2
min read
LW
link
How could we know that an AGI system will have good consequences?
So8res
Nov 7, 2022, 10:42 PM
111
points
25
comments
5
min read
LW
link
A Walkthrough of Interpretability in the Wild (w/ authors Kevin Wang, Arthur Conmy & Alexandre Variengien)
Neel Nanda
Nov 7, 2022, 10:39 PM
30
points
15
comments
3
min read
LW
link
(youtu.be)
Intercept article about lab accidents
ChristianKl
Nov 7, 2022, 9:10 PM
23
points
9
comments
1
min read
LW
link
(theintercept.com)
The biological function of love for non-kin is to gain the trust of people we cannot deceive
chaosmage
Nov 7, 2022, 8:26 PM
43
points
3
comments
8
min read
LW
link
Distillation Experiment: Chunk-Knitting
DirectedEvolution
Nov 7, 2022, 7:56 PM
10
points
3
comments
6
min read
LW
link
Thinking About Mastodon
jefftk
Nov 7, 2022, 7:40 PM
33
points
17
comments
1
min read
LW
link
(www.jefftk.com)
[Question]
Ideas for tiny research projects related to rationality?
Frej
Nov 7, 2022, 6:45 PM
3
points
1
comment
1
min read
LW
link
Loss of control of AI is not a likely source of AI x-risk
squek
Nov 7, 2022, 6:44 PM
−6
points
0
comments
5
min read
LW
link
AI Safety Unconference NeurIPS 2022
Orpheus
Nov 7, 2022, 3:39 PM
25
points
0
comments
LW
link
(aisafetyevents.org)
Hacker-AI – Does it already exist?
Erland Wittkotter
Nov 7, 2022, 2:01 PM
3
points
12
comments
11
min read
LW
link
What’s the Deal with Elon Musk and Twitter?
Zvi
Nov 7, 2022, 1:50 PM
60
points
13
comments
31
min read
LW
link
(thezvi.wordpress.com)
How to Make Easy Decisions
lynettebye
Nov 7, 2022, 1:17 PM
17
points
3
comments
2
min read
LW
link
Opportunities that surprised us during our Clearer Thinking Regrants program
spencerg
Nov 7, 2022, 1:09 PM
20
points
0
comments
LW
link
4 Key Assumptions in AI Safety
Prometheus
Nov 7, 2022, 10:50 AM
20
points
5
comments
7
min read
LW
link
Google Search as a Washed Up Service Dog: “I HALP!”
Shmi
Nov 7, 2022, 7:02 AM
20
points
8
comments
1
min read
LW
link
[Book Review] “Station Eleven” by Emily St. John Mandel
lsusr
Nov 7, 2022, 5:56 AM
17
points
1
comment
1
min read
LW
link
Counterfactability
Scott Garrabrant
Nov 7, 2022, 5:39 AM
40
points
5
comments
11
min read
LW
link
2022 LessWrong Census?
SurfingOrca
Nov 7, 2022, 5:16 AM
67
points
13
comments
1
min read
LW
link
A philosopher’s critique of RLHF
TW123
Nov 7, 2022, 2:42 AM
55
points
8
comments
2
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel