Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Dan Braun
Karma:
755
Lead engineer at Apollo Research
All
Posts
Comments
New
Top
Old
Apollo Research 1-year update
Marius Hobbhahn
,
Lee Sharkey
,
Lucius Bushnaq
,
Dan Braun
,
Mikita Balesni
,
Jérémy Scheurer
,
Nicholas Goldowsky-Dill
,
StefanHex
,
jake_mendel
,
AlexMeinke
and
rusheb
29 May 2024 17:44 UTC
85
points
0
comments
7
min read
LW
link
The Local Interaction Basis: Identifying Computationally-Relevant and Sparsely Interacting Features in Neural Networks
Lucius Bushnaq
,
jake_mendel
,
Dan Braun
,
StefanHex
,
Nicholas Goldowsky-Dill
,
Kaarel
,
Avery
,
Joern Stoehler
,
debrevitatevitae
,
Magdalena Wache
and
Marius Hobbhahn
20 May 2024 17:53 UTC
101
points
2
comments
3
min read
LW
link
Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning
Dan Braun
,
Jordan Taylor
,
Nicholas Goldowsky-Dill
and
Lee Sharkey
17 May 2024 16:25 UTC
57
points
2
comments
4
min read
LW
link
(arxiv.org)
Understanding strategic deception and deceptive alignment
Marius Hobbhahn
,
Mikita Balesni
,
Jérémy Scheurer
and
Dan Braun
25 Sep 2023 16:27 UTC
59
points
16
comments
7
min read
LW
link
(www.apolloresearch.ai)
Announcing Apollo Research
Marius Hobbhahn
,
beren
,
Lee Sharkey
,
Lucius Bushnaq
,
Dan Braun
,
Mikita Balesni
and
Jérémy Scheurer
30 May 2023 16:17 UTC
215
points
11
comments
8
min read
LW
link
A small update to the Sparse Coding interim research report
Lee Sharkey
,
Dan Braun
and
beren
30 Apr 2023 19:54 UTC
61
points
5
comments
1
min read
LW
link
Navigating public AI x-risk hype while pursuing technical solutions
Dan Braun
19 Feb 2023 12:22 UTC
18
points
0
comments
2
min read
LW
link
[Interim research report] Taking features out of superposition with sparse autoencoders
Lee Sharkey
,
Dan Braun
and
beren
13 Dec 2022 15:41 UTC
142
points
22
comments
22
min read
LW
link
2
reviews
Interpreting Neural Networks through the Polytope Lens
Sid Black
,
Lee Sharkey
,
Connor Leahy
,
beren
,
CRG
,
merizian
,
Eric Winsor
and
Dan Braun
23 Sep 2022 17:58 UTC
136
points
29
comments
33
min read
LW
link
Back to top