Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
Murphyjitsu: an Inner Simulator algorithm
CFAR!Duncan
Jun 30, 2022, 9:50 PM
67
points
24
comments
11
min read
LW
link
2
reviews
GPT-3 Catching Fish in Morse Code
Megan Kinniment
Jun 30, 2022, 9:22 PM
117
points
27
comments
8
min read
LW
link
Metacognition in the Rat
Jacob Falkovich
Jun 30, 2022, 8:53 PM
19
points
0
comments
6
min read
LW
link
On viewquakes
Dalton Mabery
Jun 30, 2022, 8:08 PM
8
points
0
comments
2
min read
LW
link
The Track Record of Futurists Seems … Fine
HoldenKarnofsky
Jun 30, 2022, 7:40 PM
91
points
25
comments
12
min read
LW
link
(www.cold-takes.com)
Quick survey on AI alignment resources
frances_lorenz
Jun 30, 2022, 7:09 PM
14
points
0
comments
1
min read
LW
link
[Linkpost] Solving Quantitative Reasoning Problems with Language Models
Yitz
Jun 30, 2022, 6:58 PM
76
points
15
comments
2
min read
LW
link
(storage.googleapis.com)
Failing to fix a dangerous intersection
alyssavance
Jun 30, 2022, 6:09 PM
110
points
17
comments
2
min read
LW
link
Most Functions Have Undesirable Global Extrema
En Kepeig
Jun 30, 2022, 5:10 PM
8
points
5
comments
3
min read
LW
link
Hedonistic Isotopes:
Trozxzr
Jun 30, 2022, 4:49 PM
1
point
0
comments
1
min read
LW
link
Abadarian Trades
David Udell
Jun 30, 2022, 4:41 PM
17
points
22
comments
2
min read
LW
link
Covid 6/30/22: Vaccine Update Update
Zvi
Jun 30, 2022, 2:00 PM
32
points
6
comments
12
min read
LW
link
(thezvi.wordpress.com)
[Question]
How should I talk about optimal but not subgame-optimal play?
JamesFaville
Jun 30, 2022, 1:58 PM
5
points
1
comment
3
min read
LW
link
Formal Philosophy and Alignment Possible Projects
Daniel Herrmann
Jun 30, 2022, 10:42 AM
34
points
5
comments
8
min read
LW
link
Bangalore LW/ACX Meetup in person
Aditya
Jun 30, 2022, 7:21 AM
5
points
2
comments
1
min read
LW
link
Cultivating And Destroying Agency
hath
Jun 30, 2022, 3:59 AM
104
points
11
comments
9
min read
LW
link
$500 bounty for alignment contest ideas
Orpheus16
Jun 30, 2022, 1:56 AM
29
points
5
comments
2
min read
LW
link
any good rationalist guides to nutrition / healthy eating?
Ben A
Jun 30, 2022, 12:50 AM
7
points
15
comments
1
min read
LW
link
A summary of every Replacing Guilt post
Orpheus16
Jun 30, 2022, 12:46 AM
35
points
3
comments
10
min read
LW
link
(forum.effectivealtruism.org)
Gradient hacking: definitions and examples
Richard_Ngo
Jun 29, 2022, 9:35 PM
38
points
2
comments
5
min read
LW
link
Progress links and tweets, 2022-06-29
jasoncrawford
Jun 29, 2022, 9:33 PM
9
points
0
comments
1
min read
LW
link
(rootsofprogress.org)
[Question]
Correcting human error vs doing exactly what you’re told—is there literature on this in context of general system design?
Jan Czechowski
Jun 29, 2022, 9:30 PM
6
points
0
comments
1
min read
LW
link
Latent Adversarial Training
Adam Jermyn
Jun 29, 2022, 8:04 PM
52
points
13
comments
5
min read
LW
link
Game Review: This Merchant Life
Zvi
Jun 29, 2022, 6:30 PM
20
points
0
comments
13
min read
LW
link
(thezvi.wordpress.com)
Limits to Legibility
Jan_Kulveit
Jun 29, 2022, 5:42 PM
157
points
11
comments
5
min read
LW
link
1
review
Will Capabilities Generalise More?
Ramana Kumar
Jun 29, 2022, 5:12 PM
133
points
39
comments
4
min read
LW
link
Kevin Kelly’s “103 Bits of Advice,” Expanded
Dalton Mabery
Jun 29, 2022, 1:36 PM
19
points
0
comments
5
min read
LW
link
The table of different sampling assumptions in anthropics
avturchin
Jun 29, 2022, 10:41 AM
39
points
5
comments
12
min read
LW
link
Can We Align AI by Having It Learn Human Preferences? I’m Scared (summary of last third of Human Compatible)
apollonianblues
Jun 29, 2022, 4:09 AM
19
points
3
comments
6
min read
LW
link
Kurzgesagt – The Last Human (Youtube)
habryka
Jun 29, 2022, 3:28 AM
54
points
7
comments
1
min read
LW
link
(www.youtube.com)
[Question]
Literature on How to Maximize Preferences
josh
Jun 28, 2022, 10:41 PM
1
point
0
comments
1
min read
LW
link
Challenge: A Much More Alien Message
kman
Jun 28, 2022, 9:50 PM
24
points
7
comments
1
min read
LW
link
It’s Probably Not Lithium
Natália
Jun 28, 2022, 9:24 PM
442
points
187
comments
28
min read
LW
link
1
review
Reflections on Living in “Guess Culture”
Dalton Mabery
Jun 28, 2022, 9:00 PM
13
points
1
comment
3
min read
LW
link
[Question]
What is the LessWrong Logo(?) Supposed to Represent?
DragonGod
Jun 28, 2022, 8:20 PM
8
points
6
comments
1
min read
LW
link
What Are You Tracking In Your Head?
johnswentworth
Jun 28, 2022, 7:30 PM
289
points
83
comments
4
min read
LW
link
1
review
Why is so much political commentary misleading?
contrarianbrit
Jun 28, 2022, 5:10 PM
−2
points
5
comments
6
min read
LW
link
(thomasprosser.substack.com)
CFAR Handbook: Introduction
CFAR!Duncan
Jun 28, 2022, 4:53 PM
116
points
12
comments
1
min read
LW
link
Units of Exchange
CFAR!Duncan
Jun 28, 2022, 4:53 PM
99
points
28
comments
11
min read
LW
link
Scott Aaronson and Steven Pinker Debate AI Scaling
Liron
Jun 28, 2022, 4:04 PM
37
points
7
comments
1
min read
LW
link
(scottaaronson.blog)
A physicist’s approach to Origins of Life
pchvykov
Jun 28, 2022, 3:23 PM
12
points
6
comments
16
min read
LW
link
What success looks like
Marius Hobbhahn
,
MaxRa
,
JasperGeh
and
Yannick_Muehlhaeuser
Jun 28, 2022, 2:38 PM
19
points
4
comments
1
min read
LW
link
(forum.effectivealtruism.org)
Four reasons I find AI safety emotionally compelling
KatWoods
and
AmberDawn
Jun 28, 2022, 2:10 PM
39
points
3
comments
4
min read
LW
link
Some alternative AI safety research projects
Michele Campolo
Jun 28, 2022, 2:09 PM
9
points
0
comments
3
min read
LW
link
Doom doubts—is inner alignment a likely problem?
Crissman
Jun 28, 2022, 12:42 PM
6
points
7
comments
1
min read
LW
link
Low-Friction MBTA Predictions
jefftk
Jun 28, 2022, 12:30 PM
15
points
0
comments
1
min read
LW
link
(www.jefftk.com)
What Diet Books Don’t Teach: A book review and a request for more reading
Lone Pine
Jun 28, 2022, 12:27 PM
22
points
34
comments
4
min read
LW
link
Assessing AlephAlphas Multimodal Model
p.b.
28 Jun 2022 9:28 UTC
30
points
5
comments
3
min read
LW
link
[Question]
Is there any way someone could post about public policy relating to abortion access (or another sensitive subject) on LessWrong without getting super downvoted?
Evan_Gaensbauer
28 Jun 2022 5:45 UTC
18
points
20
comments
1
min read
LW
link
[Test Post Please Ignore] Testing polling features
Lone Pine
28 Jun 2022 4:35 UTC
7
points
5
comments
1
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel