Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
mikes
Karma:
214
All
Posts
Comments
New
Top
Old
Breaking Circuit Breakers
mikes
and
tbenthompson
Jul 14, 2024, 6:57 PM
53
points
13
comments
1
min read
LW
link
(confirmlabs.org)
Fluent dreaming for language models (AI interpretability method)
tbenthompson
,
mikes
and
Zygi Straznickas
Feb 6, 2024, 6:02 AM
46
points
5
comments
1
min read
LW
link
(arxiv.org)
Takeaways from the NeurIPS 2023 Trojan Detection Competition
mikes
Jan 13, 2024, 12:35 PM
20
points
2
comments
1
min read
LW
link
(confirmlabs.org)
[Question]
The literature on aluminum adjuvants is very suspicious. Small IQ tax is plausible—can any experts help me estimate it?
mikes
Jul 4, 2023, 9:33 AM
61
points
39
comments
3
min read
LW
link
Back to top
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel