Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Avery
Karma:
174
All
Posts
Comments
New
Top
Old
Modifying LLM Beliefs with Synthetic Document Finetuning
RowanWang
,
Johannes Treutlein
,
Avery
,
Ethan Perez
,
Fabien Roger
and
Sam Marks
Apr 24, 2025, 9:15 PM
70
points
12
comments
2
min read
LW
link
(alignment.anthropic.com)
The Local Interaction Basis: Identifying Computationally-Relevant and Sparsely Interacting Features in Neural Networks
Lucius Bushnaq
,
jake_mendel
,
Dan Braun
,
StefanHex
,
Nicholas Goldowsky-Dill
,
Kaarel
,
Avery
,
Joern Stoehler
,
debrevitatevitae
,
Magdalena Wache
and
Marius Hobbhahn
May 20, 2024, 5:53 PM
107
points
4
comments
3
min read
LW
link
Basin broadness depends on the size and number of orthogonal features
CallumMcDougall
,
Avery
and
Lucius Bushnaq
Aug 27, 2022, 5:29 PM
36
points
21
comments
6
min read
LW
link
What Is The True Name of Modularity?
CallumMcDougall
,
Lucius Bushnaq
and
Avery
Jul 1, 2022, 2:55 PM
39
points
10
comments
12
min read
LW
link
Ten experiments in modularity, which we’d like you to run!
CallumMcDougall
,
Lucius Bushnaq
and
Avery
Jun 16, 2022, 9:17 AM
62
points
3
comments
9
min read
LW
link
Project Intro: Selection Theorems for Modularity
CallumMcDougall
,
Avery
and
Lucius Bushnaq
Apr 4, 2022, 12:59 PM
73
points
20
comments
16
min read
LW
link
Theories of Modularity in the Biological Literature
CallumMcDougall
,
Avery
and
Lucius Bushnaq
Apr 4, 2022, 12:48 PM
51
points
13
comments
7
min read
LW
link
Back to top
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel