Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
Capital and inequality
NathanBarnard
Aug 15, 2022, 5:23 PM
7
points
2
comments
5
min read
LW
link
[Question]
Are there practical exercises for developing the Scout mindset?
ChristianKl
Aug 15, 2022, 5:23 PM
15
points
2
comments
1
min read
LW
link
[Question]
How do you get a job as a software developer?
lsusr
Aug 15, 2022, 2:45 PM
22
points
24
comments
1
min read
LW
link
The Parable of the Boy Who Cried 5% Chance of Wolf
KatWoods
Aug 15, 2022, 2:33 PM
140
points
24
comments
2
min read
LW
link
And the Revenues Are So Small
Zvi
Aug 15, 2022, 1:00 PM
19
points
5
comments
11
min read
LW
link
(thezvi.wordpress.com)
Extreme Security
lc
Aug 15, 2022, 12:11 PM
38
points
6
comments
5
min read
LW
link
No shortcuts to knowledge: Why AI needs to ease up on scaling and learn how to code
Yldedly
Aug 15, 2022, 8:42 AM
5
points
0
comments
1
min read
LW
link
(deoxyribose.github.io)
Seeking Interns/RAs for Mechanistic Interpretability Projects
Neel Nanda
Aug 15, 2022, 7:11 AM
61
points
0
comments
2
min read
LW
link
A Mechanistic Interpretability Analysis of Grokking
Neel Nanda
and
Tom Lieberum
Aug 15, 2022, 2:41 AM
373
points
48
comments
36
min read
LW
link
1
review
(colab.research.google.com)
[Question]
If a nuke is coming towards SF Bay can people bunker in BART tunnels?
Pee Doom
Aug 15, 2022, 1:56 AM
15
points
2
comments
1
min read
LW
link
[Question]
What is the probability that a superintelligent, sentient AGI is actually infeasible?
Nathan1123
Aug 14, 2022, 10:41 PM
−3
points
6
comments
1
min read
LW
link
Dealing With Delusions
adrusi
Aug 14, 2022, 9:11 PM
9
points
2
comments
1
min read
LW
link
All the posts I will never write
Alexander Gietelink Oldenziel
Aug 14, 2022, 6:29 PM
54
points
8
comments
8
min read
LW
link
Brain-like AGI project “aintelope”
Gunnar_Zarncke
Aug 14, 2022, 4:33 PM
54
points
2
comments
1
min read
LW
link
AI Transparency: Why it’s critical and how to obtain it.
Zohar Jackson
Aug 14, 2022, 10:31 AM
6
points
1
comment
5
min read
LW
link
A brief note on Simplicity Bias
carboniferous_umbraculum
Aug 14, 2022, 2:05 AM
20
points
0
comments
4
min read
LW
link
Evolution is a bad analogy for AGI: inner alignment
Quintin Pope
Aug 13, 2022, 10:15 PM
81
points
15
comments
8
min read
LW
link
An Uncanny Prison
Nathan1123
Aug 13, 2022, 9:40 PM
3
points
3
comments
2
min read
LW
link
Florida Elections
Double
Aug 13, 2022, 8:10 PM
−3
points
8
comments
1
min read
LW
link
Cultivating Valiance
Shoshannah Tekofsky
Aug 13, 2022, 6:47 PM
35
points
4
comments
4
min read
LW
link
An extended rocket alignment analogy
remember
Aug 13, 2022, 6:22 PM
28
points
3
comments
4
min read
LW
link
[Question]
The OpenAI playground for GPT-3 is a terrible interface. Is there any great local (or web) app for exploring/learning with language models?
aviv
Aug 13, 2022, 4:34 PM
3
points
1
comment
1
min read
LW
link
[Question]
What is an agent in reductionist materialism?
Valentine
Aug 13, 2022, 3:39 PM
7
points
17
comments
1
min read
LW
link
Refine’s First Blog Post Day
adamShimi
Aug 13, 2022, 10:23 AM
55
points
3
comments
1
min read
LW
link
The Dumbest Possible Gets There First
Artaxerxes
Aug 13, 2022, 10:20 AM
44
points
7
comments
2
min read
LW
link
I missed the crux of the alignment problem the whole time
zeshen
Aug 13, 2022, 10:11 AM
53
points
7
comments
3
min read
LW
link
Shapes of Mind and Pluralism in Alignment
adamShimi
Aug 13, 2022, 10:01 AM
33
points
2
comments
2
min read
LW
link
How I think about alignment
Linda Linsefors
Aug 13, 2022, 10:01 AM
31
points
11
comments
5
min read
LW
link
Steelmining via Analogy
Paul Bricman
Aug 13, 2022, 9:59 AM
24
points
0
comments
2
min read
LW
link
(paulbricman.com)
Appendix: Jargon Dictionary
CFAR!Duncan
Aug 13, 2022, 8:09 AM
34
points
5
comments
21
min read
LW
link
Appendix: Hamming Questions
CFAR!Duncan
Aug 13, 2022, 8:07 AM
41
points
0
comments
2
min read
LW
link
Building a Bugs List prompts
CFAR!Duncan
Aug 13, 2022, 8:00 AM
69
points
9
comments
2
min read
LW
link
Cambridge LW Meetup: Constructive Complaining
Tony Wang
Aug 13, 2022, 4:52 AM
2
points
0
comments
1
min read
LW
link
Gradient descent doesn’t select for inner search
Ivan Vendrov
Aug 13, 2022, 4:15 AM
47
points
23
comments
4
min read
LW
link
[Question]
How to bet against civilizational adequacy?
Wei Dai
Aug 12, 2022, 11:33 PM
54
points
20
comments
1
min read
LW
link
Infant AI Scenario
Nathan1123
Aug 12, 2022, 9:20 PM
1
point
0
comments
3
min read
LW
link
DeepMind alignment team opinions on AGI ruin arguments
Vika
Aug 12, 2022, 9:06 PM
395
points
37
comments
14
min read
LW
link
1
review
Dissolve: The Petty Crimes of Blaise Pascal
SebastianG
Aug 12, 2022, 8:04 PM
17
points
4
comments
6
min read
LW
link
The Host Minds of HBO’s Westworld.
Nerret
Aug 12, 2022, 6:53 PM
1
point
0
comments
3
min read
LW
link
What is estimational programming? Squiggle in context
Quinn
Aug 12, 2022, 6:39 PM
14
points
7
comments
7
min read
LW
link
Oversight Misses 100% of Thoughts The AI Does Not Think
johnswentworth
Aug 12, 2022, 4:30 PM
111
points
49
comments
1
min read
LW
link
Timelines explanation post part 1 of ?
Nathan Helm-Burger
Aug 12, 2022, 4:13 PM
10
points
1
comment
2
min read
LW
link
A little playing around with Blenderbot3
Nathan Helm-Burger
Aug 12, 2022, 4:06 PM
9
points
0
comments
1
min read
LW
link
Refining the Sharp Left Turn threat model, part 1: claims and mechanisms
Vika
,
Vikrant Varma
,
Ramana Kumar
and
Mary Phuong
Aug 12, 2022, 3:17 PM
86
points
4
comments
3
min read
LW
link
1
review
(vkrakovna.wordpress.com)
Argument by Intellectual Ordeal
lc
Aug 12, 2022, 1:03 PM
26
points
5
comments
5
min read
LW
link
Anti-squatted AI x-risk domains index
plex
Aug 12, 2022, 12:01 PM
59
points
6
comments
1
min read
LW
link
[Question]
Perfect Predictors
aditya malik
Aug 12, 2022, 11:51 AM
2
points
5
comments
1
min read
LW
link
[Question]
What are some good arguments against building new nuclear power plants?
RomanS
Aug 12, 2022, 7:32 AM
16
points
15
comments
2
min read
LW
link
Seeking PCK (Pedagogical Content Knowledge)
CFAR!Duncan
Aug 12, 2022, 4:15 AM
62
points
11
comments
5
min read
LW
link
Artificial intelligence wireheading
Big Tony
Aug 12, 2022, 3:06 AM
5
points
2
comments
1
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel