RSS

Lucius Bushnaq

Karma: 2,486

AI notkilleveryoneism researcher, focused on interpretability.

Personal account, opinions are my own.

I have signed no contracts or agreements whose existence I cannot mention.

Cir­cuits in Su­per­po­si­tion: Com­press­ing many small neu­ral net­works into one

14 Oct 2024 13:06 UTC
126 points
8 comments13 min readLW link

The Hes­sian rank bounds the learn­ing coefficient

Lucius Bushnaq8 Aug 2024 20:55 UTC
68 points
9 comments4 min readLW link

A List of 45+ Mech In­terp Pro­ject Ideas from Apollo Re­search’s In­ter­pretabil­ity Team

18 Jul 2024 14:15 UTC
117 points
18 comments18 min readLW link

Lu­cius Bush­naq’s Shortform

Lucius Bushnaq6 Jul 2024 9:08 UTC
6 points
58 comments1 min readLW link

Apollo Re­search 1-year update

29 May 2024 17:44 UTC
93 points
0 comments7 min readLW link

In­ter­pretabil­ity: In­te­grated Gra­di­ents is a de­cent at­tri­bu­tion method

20 May 2024 17:55 UTC
22 points
7 comments6 min readLW link

The Lo­cal In­ter­ac­tion Ba­sis: Iden­ti­fy­ing Com­pu­ta­tion­ally-Rele­vant and Sparsely In­ter­act­ing Fea­tures in Neu­ral Networks

20 May 2024 17:53 UTC
105 points
4 comments3 min readLW link

Char­bel-Raphaël and Lu­cius dis­cuss interpretability

30 Oct 2023 5:50 UTC
105 points
7 comments21 min readLW link

An­nounc­ing Apollo Research

30 May 2023 16:17 UTC
215 points
11 comments8 min readLW link

Basin broad­ness de­pends on the size and num­ber of or­thog­o­nal features

27 Aug 2022 17:29 UTC
36 points
21 comments6 min readLW link

What Is The True Name of Mo­du­lar­ity?

1 Jul 2022 14:55 UTC
39 points
10 comments12 min readLW link

Ten ex­per­i­ments in mod­u­lar­ity, which we’d like you to run!

16 Jun 2022 9:17 UTC
62 points
3 comments9 min readLW link

Pro­ject In­tro: Selec­tion The­o­rems for Modularity

4 Apr 2022 12:59 UTC
73 points
20 comments16 min readLW link

The­o­ries of Mo­du­lar­ity in the Biolog­i­cal Literature

4 Apr 2022 12:48 UTC
51 points
13 comments7 min readLW link

Wel­come to the SSC Dublin Meetup

Lucius Bushnaq30 Jul 2020 18:56 UTC
3 points
2 comments1 min readLW link