Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
Refine: what helped me write more?
Alexander Gietelink Oldenziel
Oct 25, 2022, 2:44 PM
12
points
0
comments
2
min read
LW
link
Logical Decision Theories: Our final failsafe?
Noosphere89
Oct 25, 2022, 12:51 PM
−7
points
8
comments
1
min read
LW
link
(www.lesswrong.com)
What will the scaled up GATO look like? (Updated with questions)
Amal
Oct 25, 2022, 12:44 PM
34
points
22
comments
1
min read
LW
link
Mechanism Design for AI Safety—Reading Group Curriculum
Rubi J. Hudson
Oct 25, 2022, 3:54 AM
15
points
3
comments
LW
link
Furry Rationalists & Effective Anthropomorphism both exist
agentydragon
Oct 25, 2022, 3:37 AM
42
points
3
comments
1
min read
LW
link
EA & LW Forums Weekly Summary (17 − 23 Oct 22′)
Zoe Williams
Oct 25, 2022, 2:57 AM
10
points
0
comments
LW
link
Dance Weekends: Tests not Masks
jefftk
Oct 25, 2022, 2:10 AM
12
points
0
comments
2
min read
LW
link
(www.jefftk.com)
[Question]
What is good Cyber Security Advice?
Gunnar_Zarncke
Oct 24, 2022, 11:27 PM
30
points
12
comments
2
min read
LW
link
Connections between Mind-Body Problem & Civilizations
oblivion
Oct 24, 2022, 9:55 PM
−3
points
1
comment
1
min read
LW
link
[Question]
Rationalism and money
David K
Oct 24, 2022, 9:22 PM
−5
points
2
comments
1
min read
LW
link
[Question]
Game semantics
David K
Oct 24, 2022, 9:22 PM
2
points
2
comments
1
min read
LW
link
A Good Future (rough draft)
Michael Soareverix
Oct 24, 2022, 8:45 PM
10
points
5
comments
3
min read
LW
link
A Barebones Guide to Mechanistic Interpretability Prerequisites
Neel Nanda
Oct 24, 2022, 8:45 PM
64
points
12
comments
3
min read
LW
link
(neelnanda.io)
POWERplay: An open-source toolchain to study AI power-seeking
Edouard Harris
Oct 24, 2022, 8:03 PM
29
points
0
comments
1
min read
LW
link
(github.com)
Consider trying Vivek Hebbar’s alignment exercises
Orpheus16
Oct 24, 2022, 7:46 PM
38
points
1
comment
4
min read
LW
link
[Question]
Education not meant for mass-consumption
Tolo
Oct 24, 2022, 7:45 PM
7
points
5
comments
2
min read
LW
link
Realizations in Regards to Masculinity
nmc
Oct 24, 2022, 7:42 PM
−2
points
2
comments
2
min read
LW
link
The Futility of Religion
nmc
Oct 24, 2022, 7:42 PM
−1
points
5
comments
3
min read
LW
link
The optimal timing of spending on AGI safety work; why we should probably be spending more now
Tristan Cook
Oct 24, 2022, 5:42 PM
62
points
0
comments
LW
link
AGI in our lifetimes is wishful thinking
niknoble
Oct 24, 2022, 11:53 AM
1
point
25
comments
8
min read
LW
link
DeepMind on Stratego, an imperfect information game
sanxiyn
Oct 24, 2022, 5:57 AM
15
points
9
comments
1
min read
LW
link
(arxiv.org)
[Question]
TOMT: Post from 1-2 years ago talking about a paper on social networks
Simon Berens
Oct 24, 2022, 1:29 AM
5
points
1
comment
1
min read
LW
link
AI researchers announce NeuroAI agenda
Cameron Berg
Oct 24, 2022, 12:14 AM
37
points
12
comments
6
min read
LW
link
(arxiv.org)
Empowerment is (almost) All We Need
jacob_cannell
Oct 23, 2022, 9:48 PM
61
points
44
comments
17
min read
LW
link
“Originality is nothing but judicious imitation”—Voltaire
Vestozia
Oct 23, 2022, 7:00 PM
0
points
0
comments
13
min read
LW
link
Mid-Peninsula ACX/LW Meetup [CANCELLED]
moshezadka
Oct 23, 2022, 5:37 PM
1
point
0
comments
1
min read
LW
link
I am a Memoryless System
Nicholas / Heather Kross
Oct 23, 2022, 5:34 PM
26
points
2
comments
9
min read
LW
link
(www.thinkingmuchbetter.com)
Accountability Buddies: Why you might want one.
Samuel Nellessen
Oct 23, 2022, 4:25 PM
10
points
3
comments
LW
link
How to get past Haidt’s elephant and listen
Astynax
Oct 23, 2022, 4:06 PM
13
points
4
comments
2
min read
LW
link
Writing Russian and Ukrainian words in Latin script
Viliam
Oct 23, 2022, 3:25 PM
19
points
22
comments
6
min read
LW
link
[Question]
Have you noticed any ways that rationalists differ? [Brainstorming session]
tailcalled
Oct 23, 2022, 11:32 AM
23
points
22
comments
1
min read
LW
link
Mnestics
Jarred Filmer
Oct 23, 2022, 12:30 AM
122
points
6
comments
4
min read
LW
link
Telic intuitions across the sciences
mrcbarbier
Oct 22, 2022, 9:31 PM
4
points
0
comments
17
min read
LW
link
A basic lexicon of telic concepts
mrcbarbier
Oct 22, 2022, 9:28 PM
2
points
0
comments
3
min read
LW
link
Do we have the right kind of math for roles, goals and meaning?
mrcbarbier
Oct 22, 2022, 9:28 PM
13
points
5
comments
7
min read
LW
link
[Question]
The Last Year - is there an existing novel about the last year before AI doom?
Luca Petrolati
Oct 22, 2022, 8:44 PM
4
points
4
comments
1
min read
LW
link
The highest-probability outcome can be out of distribution
tailcalled
Oct 22, 2022, 8:00 PM
14
points
5
comments
1
min read
LW
link
Newsletter for Alignment Research: The ML Safety Updates
Esben Kran
Oct 22, 2022, 4:17 PM
25
points
0
comments
LW
link
Crypto loves impact markets: Notes from Schelling Point Bogotá
Rachel Shu
Oct 22, 2022, 3:58 PM
17
points
2
comments
LW
link
[Question]
When trying to define general intelligence is ability to achieve goals the best metric?
jmh
Oct 22, 2022, 3:09 AM
5
points
0
comments
1
min read
LW
link
[Question]
Simple question about corrigibility and values in AI.
jmh
Oct 22, 2022, 2:59 AM
6
points
1
comment
1
min read
LW
link
Moorean Statements
David Udell
Oct 22, 2022, 12:50 AM
11
points
11
comments
1
min read
LW
link
Wisdom Cannot Be Unzipped
Sable
Oct 22, 2022, 12:28 AM
74
points
17
comments
7
min read
LW
link
1
review
(affablyevil.substack.com)
A framework and open questions for game theoretic shard modeling
Garrett Baker
Oct 21, 2022, 9:40 PM
11
points
4
comments
4
min read
LW
link
Cooperators are more powerful than agents
Ivan Vendrov
Oct 21, 2022, 8:02 PM
29
points
7
comments
3
min read
LW
link
Intelligent behaviour across systems, scales and substrates
Nora_Ammann
Oct 21, 2022, 5:09 PM
11
points
0
comments
10
min read
LW
link
Deepfake(?) Phishing
jefftk
Oct 21, 2022, 2:30 PM
37
points
9
comments
1
min read
LW
link
(www.jefftk.com)
acronyms ftw
Emrik
Oct 21, 2022, 1:36 PM
−2
points
5
comments
2
min read
LW
link
Crossword puzzle: LessWrong Halloween 2022
jchan
Oct 21, 2022, 12:41 PM
11
points
11
comments
1
min read
LW
link
Weekly Roundup #2
Zvi
Oct 21, 2022, 12:10 PM
37
points
2
comments
11
min read
LW
link
(thezvi.wordpress.com)
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel