Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Eric Winsor
Karma:
513
All
Posts
Comments
New
Top
Old
Toward Safety Case Inspired Basic Research
Lucas Teixeira
,
Lauren Greenspan
,
Dmitry Vaintrob
and
Eric Winsor
Oct 31, 2024, 11:06 PM
55
points
3
comments
13
min read
LW
link
A Universal Emergent Decomposition of Retrieval Tasks in Language Models
Alexandre Variengien
and
Eric Winsor
Dec 19, 2023, 11:52 AM
84
points
3
comments
10
min read
LW
link
(arxiv.org)
Basic Facts about Language Model Internals
beren
and
Eric Winsor
Jan 4, 2023, 1:01 PM
130
points
19
comments
9
min read
LW
link
Re-Examining LayerNorm
Eric Winsor
Dec 1, 2022, 10:20 PM
127
points
12
comments
5
min read
LW
link
Interpreting Neural Networks through the Polytope Lens
Sid Black
,
Lee Sharkey
,
Connor Leahy
,
beren
,
CRG
,
merizian
,
Eric Winsor
and
Dan Braun
Sep 23, 2022, 5:58 PM
144
points
29
comments
33
min read
LW
link
Back to top
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel