Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Eric Winsor
Karma:
503
All
Posts
Comments
New
Top
Old
Toward Safety Case Inspired Basic Research
Lucas Teixeira
,
Lauren Greenspan
,
Dmitry Vaintrob
and
Eric Winsor
31 Oct 2024 23:06 UTC
47
points
2
comments
13
min read
LW
link
A Universal Emergent Decomposition of Retrieval Tasks in Language Models
Alexandre Variengien
and
Eric Winsor
19 Dec 2023 11:52 UTC
84
points
3
comments
10
min read
LW
link
(arxiv.org)
Basic Facts about Language Model Internals
beren
and
Eric Winsor
4 Jan 2023 13:01 UTC
130
points
19
comments
9
min read
LW
link
Re-Examining LayerNorm
Eric Winsor
1 Dec 2022 22:20 UTC
125
points
12
comments
5
min read
LW
link
Interpreting Neural Networks through the Polytope Lens
Sid Black
,
Lee Sharkey
,
Connor Leahy
,
beren
,
CRG
,
merizian
,
Eric Winsor
and
Dan Braun
23 Sep 2022 17:58 UTC
144
points
29
comments
33
min read
LW
link
Back to top