Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
2
[Question]
Which LessWrong content would you like recorded into audio/podcast form?
Ruby
Sep 13, 2022, 1:20 AM
29
points
11
comments
1
min read
LW
link
How To Actually Succeed
Jordan Arel
Sep 13, 2022, 12:21 AM
3
points
1
comment
5
min read
LW
link
EA & LW Forums Weekly Summary (5 − 11 Sep 22′)
Zoe Williams
Sep 12, 2022, 11:24 PM
24
points
0
comments
13
min read
LW
link
Time is not the bottleneck (on making progress thinking about difficult things)
kman
Sep 12, 2022, 8:45 PM
30
points
10
comments
1
min read
LW
link
[Linkpost] A survey on over 300 works about interpretability in deep networks
scasper
Sep 12, 2022, 7:07 PM
97
points
7
comments
2
min read
LW
link
(arxiv.org)
[Question]
Why do People Think Intelligence Will be “Easy”?
DragonGod
Sep 12, 2022, 5:32 PM
15
points
32
comments
2
min read
LW
link
Alignment via prosocial brain algorithms
Cameron Berg
Sep 12, 2022, 1:48 PM
45
points
30
comments
6
min read
LW
link
I’ve written a Fantasy Novel to Promote Effective Altruism
Timothy Underwood
Sep 12, 2022, 12:14 PM
23
points
21
comments
13
min read
LW
link
Ideological Inference Engines: Making Deontology Differentiable*
Paul Bricman
Sep 12, 2022, 12:00 PM
6
points
0
comments
14
min read
LW
link
Freeloading?
jefftk
Sep 12, 2022, 11:20 AM
28
points
24
comments
3
min read
LW
link
(www.jefftk.com)
Can you force a neural network to keep generalizing?
Q Home
Sep 12, 2022, 10:14 AM
2
points
10
comments
5
min read
LW
link
Black Box Investigation Research Hackathon
Esben Kran
and
Jonas Hallgren
Sep 12, 2022, 7:20 AM
9
points
4
comments
2
min read
LW
link
Argument against 20% GDP growth from AI within 10 years [Linkpost]
aog
Sep 12, 2022, 4:08 AM
59
points
20
comments
5
min read
LW
link
(twitter.com)
AI Safety field-building projects I’d like to see
Orpheus16
Sep 11, 2022, 11:43 PM
46
points
8
comments
6
min read
LW
link
Fermi Paradox: Iron Age Milky Way
Rofel Wodring
Sep 11, 2022, 8:32 PM
−10
points
9
comments
3
min read
LW
link
You Don’t Have To Click The Links
Simon Berens
Sep 11, 2022, 6:13 PM
25
points
7
comments
1
min read
LW
link
The Ultimate Step-by-Step Hiring Playbook
intellectronica
Sep 11, 2022, 2:39 PM
8
points
2
comments
4
min read
LW
link
(www.intellectronica.net)
[Question]
In forecasting, how do accuracy, calibration and reliability relate to each other?
amarai
Sep 11, 2022, 12:04 PM
3
points
4
comments
1
min read
LW
link
Briefly thinking through some analogs of debate
Eli Tyre
Sep 11, 2022, 12:02 PM
20
points
3
comments
4
min read
LW
link
Making a New Table Leaf
jefftk
Sep 11, 2022, 11:40 AM
19
points
0
comments
1
min read
LW
link
(www.jefftk.com)
AI Risk Intro 1: Advanced AI Might Be Very Bad
CallumMcDougall
and
L Rudolf L
Sep 11, 2022, 10:57 AM
46
points
13
comments
30
min read
LW
link
A Pin and a Balloon: Anthropic Fragility Increases Chances of Runaway Global Warming
avturchin
Sep 11, 2022, 10:25 AM
33
points
23
comments
52
min read
LW
link
[Question]
Is there an Ultimate text editor?
Johannes C. Mayer
Sep 11, 2022, 9:19 AM
4
points
10
comments
1
min read
LW
link
Pascal: The Greatness and Littleness of Man, A Thinking Reed
NoBadCake
Sep 10, 2022, 8:05 PM
9
points
0
comments
1
min read
LW
link
[Job] Project Manager: Community Health (CEA)
Xodarap
Sep 10, 2022, 6:40 PM
3
points
0
comments
1
min read
LW
link
(www.centreforeffectivealtruism.org)
Unbounded utility functions and precommitment
MichaelStJules
Sep 10, 2022, 4:16 PM
4
points
3
comments
1
min read
LW
link
[Question]
What is the “Less Wrong” approved acronym for 1984-risk?
Logan Zoellner
Sep 10, 2022, 2:38 PM
5
points
8
comments
1
min read
LW
link
Find out how utilitarian you are—a mega thread of philosophy polls
spencerg
Sep 10, 2022, 2:05 PM
8
points
3
comments
1
min read
LW
link
(twitter.com)
Put Dirty Dishes in the Dishwasher
jefftk
Sep 10, 2022, 1:10 PM
37
points
16
comments
1
min read
LW
link
(www.jefftk.com)
Join ASAP! (AI Safety Accountability Programme) 🚀
CallumMcDougall
Sep 10, 2022, 11:15 AM
19
points
0
comments
3
min read
LW
link
Quintin’s alignment papers roundup—week 1
Quintin Pope
Sep 10, 2022, 6:39 AM
120
points
6
comments
9
min read
LW
link
Path dependence in ML inductive biases
Vivek Hebbar
and
evhub
Sep 10, 2022, 1:38 AM
68
points
13
comments
10
min read
LW
link
Keeping Time in Epoch Seconds
Gordon Seidoh Worley
Sep 10, 2022, 12:28 AM
11
points
2
comments
2
min read
LW
link
Ought will host a factored cognition “Lab Meeting”
jungofthewon
and
stuhlmueller
Sep 9, 2022, 11:46 PM
35
points
1
comment
1
min read
LW
link
Web4/Heaven—The Simulation
Dunning K.
Sep 9, 2022, 10:58 PM
26
points
2
comments
1
min read
LW
link
Evaluations project @ ARC is hiring a researcher and a webdev/engineer
Beth Barnes
Sep 9, 2022, 10:46 PM
99
points
7
comments
10
min read
LW
link
Swap and Scale
Stephen Fowler
Sep 9, 2022, 10:41 PM
17
points
3
comments
1
min read
LW
link
My emotional reaction to the current funding situation
Sam F. Brown
Sep 9, 2022, 10:02 PM
105
points
36
comments
5
min read
LW
link
(sambrown.eu)
AlexaTM − 20 Billion Parameter Model With Impressive Performance
MrThink
Sep 9, 2022, 9:46 PM
5
points
0
comments
1
min read
LW
link
[Fun][Link] Alignment SMBC Comic
Gunnar_Zarncke
Sep 9, 2022, 9:38 PM
7
points
2
comments
1
min read
LW
link
(www.smbc-comics.com)
Gatekeeper Victory: AI Box Reflection
Double
and
DaemonicSigil
Sep 9, 2022, 9:38 PM
6
points
6
comments
9
min read
LW
link
Interpreting Affordable Housing
jefftk
Sep 9, 2022, 7:40 PM
16
points
0
comments
1
min read
LW
link
(www.jefftk.com)
London Rationalish Meetup 2022-09-11
calmiguana
Sep 9, 2022, 6:39 PM
1
point
0
comments
1
min read
LW
link
AI alignment with humans… but with which humans?
geoffreymiller
Sep 9, 2022, 6:21 PM
12
points
33
comments
3
min read
LW
link
[Question]
Should you refrain from having children because of the risk posed by artificial intelligence?
Mientras
Sep 9, 2022, 5:39 PM
17
points
31
comments
1
min read
LW
link
Notes on Resolve
David Gross
Sep 9, 2022, 4:42 PM
10
points
3
comments
31
min read
LW
link
Oversight Leagues: The Training Game as a Feature
Paul Bricman
9 Sep 2022 10:08 UTC
20
points
6
comments
10
min read
LW
link
Understanding and avoiding value drift
TurnTrout
9 Sep 2022 4:16 UTC
48
points
14
comments
6
min read
LW
link
Samotsvety’s AI risk forecasts
elifland
9 Sep 2022 4:01 UTC
44
points
0
comments
4
min read
LW
link
Most People Start With The Same Few Bad Ideas
johnswentworth
9 Sep 2022 0:29 UTC
165
points
30
comments
3
min read
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel