Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page
2
Current themes in mechanistic interpretability research
Lee Sharkey
,
Sid Black
and
beren
Nov 16, 2022, 2:14 PM
89
points
2
comments
12
min read
LW
link
Unpacking “Shard Theory” as Hunch, Question, Theory, and Insight
Jacy Reese Anthis
Nov 16, 2022, 1:54 PM
31
points
9
comments
2
min read
LW
link
Miracles and why not to believe them
mruwnik
Nov 16, 2022, 12:07 PM
4
points
0
comments
2
min read
LW
link
[Question]
How do people do remote research collaborations effectively?
Krieger
Nov 16, 2022, 11:51 AM
8
points
0
comments
1
min read
LW
link
Method of statements: an alternative to taboo
Q Home
Nov 16, 2022, 10:57 AM
7
points
0
comments
41
min read
LW
link
The two conceptions of Active Inference: an intelligence architecture and a theory of agency
Roman Leventov
Nov 16, 2022, 9:30 AM
17
points
0
comments
4
min read
LW
link
Developer experience for the motivation
Adam Zerner
Nov 16, 2022, 7:12 AM
49
points
7
comments
4
min read
LW
link
Progress links and tweets, 2022-11-15
jasoncrawford
Nov 16, 2022, 3:21 AM
9
points
0
comments
2
min read
LW
link
(rootsofprogress.org)
EA & LW Forums Weekly Summary (7th Nov − 13th Nov 22′)
Zoe Williams
Nov 16, 2022, 3:04 AM
19
points
0
comments
LW
link
The FTX Saga—Simplified
Annapurna
Nov 16, 2022, 2:42 AM
44
points
10
comments
7
min read
LW
link
(jorgevelez.substack.com)
Utilitarianism and the idea of a “rational agent” are fundamentally inconsistent with reality
banev
Nov 16, 2022, 12:19 AM
−4
points
1
comment
1
min read
LW
link
[Question]
Is the speed of training large models going to increase significantly in the near future due to Cerebras Andromeda?
Amal
Nov 15, 2022, 10:50 PM
13
points
11
comments
1
min read
LW
link
[Question]
What is our current best infohazard policy for AGI (safety) research?
Roman Leventov
Nov 15, 2022, 10:33 PM
12
points
2
comments
1
min read
LW
link
ACX/SSC Meetup 1 pm Sunday Nov 20
svfritz
Nov 15, 2022, 8:39 PM
2
points
0
comments
1
min read
LW
link
SBF x LoL
Nicholas / Heather Kross
Nov 15, 2022, 8:24 PM
17
points
6
comments
LW
link
Some research ideas in forecasting
Jsevillamol
Nov 15, 2022, 7:47 PM
35
points
2
comments
LW
link
Strategy of Inner Conflict
Jonathan Moregård
Nov 15, 2022, 7:38 PM
9
points
4
comments
6
min read
LW
link
(honestliving.substack.com)
The limited upside of interpretability
Peter S. Park
Nov 15, 2022, 6:46 PM
13
points
11
comments
LW
link
Why bet Kelly?
AlexMennen
Nov 15, 2022, 6:12 PM
32
points
14
comments
5
min read
LW
link
Entropy Scaling And Intrinsic Memory
Alexander Gietelink Oldenziel
and
Adam Shai
Nov 15, 2022, 6:11 PM
20
points
5
comments
5
min read
LW
link
[Question]
Will nanotech/biotech be what leads to AI doom?
tailcalled
Nov 15, 2022, 5:38 PM
4
points
9
comments
2
min read
LW
link
Value Formation: An Overarching Model
Thane Ruthenis
Nov 15, 2022, 5:16 PM
34
points
20
comments
34
min read
LW
link
Internal communication framework
rosehadshar
and
Nora_Ammann
Nov 15, 2022, 12:41 PM
38
points
13
comments
12
min read
LW
link
Better Mastodon Aliases
jefftk
Nov 15, 2022, 12:10 PM
14
points
3
comments
1
min read
LW
link
(www.jefftk.com)
The economy as an analogy for advanced AI systems
rosehadshar
and
particlemania
Nov 15, 2022, 11:16 AM
28
points
0
comments
5
min read
LW
link
We need better prediction markets
eigen
Nov 15, 2022, 4:54 AM
9
points
8
comments
1
min read
LW
link
Preventing, reversing, and addressing data leakage: some thoughts
VipulNaik
Nov 15, 2022, 2:09 AM
14
points
4
comments
25
min read
LW
link
Winners of the AI Safety Nudge Competition
Marc Carauleanu
Nov 15, 2022, 1:06 AM
4
points
0
comments
LW
link
Lying to Save Humanity
cebsuvx
Nov 14, 2022, 11:04 PM
−1
points
4
comments
1
min read
LW
link
Moral contagion heuristic
Mvolz
Nov 14, 2022, 9:17 PM
14
points
3
comments
2
min read
LW
link
Will we run out of ML data? Evidence from projecting dataset size trends
Pablo Villalobos
Nov 14, 2022, 4:42 PM
75
points
12
comments
2
min read
LW
link
(epochai.org)
I (with the help of a few more people) am planning to create an introduction to AI Safety that a smart teenager can understand. What am I missing?
Tapatakt
Nov 14, 2022, 4:12 PM
3
points
5
comments
1
min read
LW
link
Two New Newcomb Variants
eva_
Nov 14, 2022, 2:01 PM
26
points
24
comments
3
min read
LW
link
Improving Emergency Vehicle Utilization
jefftk
Nov 14, 2022, 2:00 PM
15
points
10
comments
1
min read
LW
link
(www.jefftk.com)
X-risk Mitigation Does Actually Require Longtermism
DragonGod
Nov 14, 2022, 12:54 PM
6
points
1
comment
LW
link
[Question]
Why don’t we have self driving cars yet?
Linda Linsefors
Nov 14, 2022, 12:19 PM
22
points
16
comments
1
min read
LW
link
Eigenvalues for Distance from The Buddhist Precepts And The Ten Commandments
benjamin.j.campbell
Nov 14, 2022, 5:50 AM
−3
points
2
comments
1
min read
LW
link
AI Safety Microgrant Round
Chris_Leong
Nov 14, 2022, 4:25 AM
22
points
1
comment
LW
link
Estimating the probability that FTX Future Fund grant money gets clawed back
spencerg
Nov 14, 2022, 3:33 AM
28
points
6
comments
LW
link
Rational overconfidence in the tens of billions: recent example
banev
Nov 13, 2022, 10:48 PM
−20
points
3
comments
2
min read
LW
link
In Defence of Temporal Discounting in Longtermist Ethics
DragonGod
Nov 13, 2022, 9:54 PM
25
points
4
comments
LW
link
Announcing Nonlinear Emergency Funding
KatWoods
Nov 13, 2022, 7:02 PM
54
points
0
comments
LW
link
The Alignment Community Is Culturally Broken
sudo
Nov 13, 2022, 6:53 PM
136
points
68
comments
2
min read
LW
link
The Futility of Status and Signalling
Ape in the coat
Nov 13, 2022, 5:14 PM
19
points
4
comments
3
min read
LW
link
A short critique of Vanessa Kosoy’s PreDCA
Martín Soto
Nov 13, 2022, 4:00 PM
28
points
8
comments
4
min read
LW
link
What’s the Alternative to Independence?
jefftk
Nov 13, 2022, 3:30 PM
50
points
3
comments
1
min read
LW
link
(www.jefftk.com)
Decision making under model ambiguity, moral uncertainty, and other agents with free will?
Jobst Heitzig
Nov 13, 2022, 12:50 PM
4
points
0
comments
1
min read
LW
link
(forum.effectivealtruism.org)
The sky is not blue (pardon the obviousness)
banev
Nov 13, 2022, 10:49 AM
−13
points
6
comments
1
min read
LW
link
Characterizing Intrinsic Compositionality in Transformers with Tree Projections
Ulisse Mini
Nov 13, 2022, 9:46 AM
12
points
2
comments
1
min read
LW
link
(arxiv.org)
Noting an unsubstantiated communal belief about the FTX disaster
Yitz
Nov 13, 2022, 5:37 AM
50
points
52
comments
LW
link
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel