RSS

Logan Riggs

Karma: 3,021

Veo-2 Can Pro­duce Real­is­tic Ads

Logan RiggsJan 21, 2025, 7:13 PM
14 points
0 comments1 min readLW link

[Ex­er­cise] Four Ex­am­ples of Notic­ing Confusion

Logan RiggsJan 18, 2025, 3:29 PM
8 points
8 comments3 min readLW link

How do you deal w/​ Su­per Stim­uli?

Logan RiggsJan 14, 2025, 3:14 PM
100 points
25 comments3 min readLW link

When AI 10x’s AI R&D, What Do We Do?

Logan RiggsDec 21, 2024, 11:56 PM
72 points
16 comments4 min readLW link

Lo­gan Riggs’s Shortform

Logan RiggsDec 4, 2024, 2:52 PM
7 points
13 comments1 min readLW link

Book a Time to Chat about In­terp Research

Logan RiggsDec 3, 2024, 5:27 PM
47 points
3 comments1 min readLW link

Eval­u­at­ing Sparse Au­toen­coders with Board Game Models

Aug 2, 2024, 7:50 PM
38 points
1 comment9 min readLW link

In­ter­pret­ing Prefer­ence Models w/​ Sparse Autoencoders

Jul 1, 2024, 9:35 PM
74 points
12 comments9 min readLW link

Was Re­leas­ing Claude-3 Net-Nega­tive?

Logan RiggsMar 27, 2024, 5:41 PM
52 points
5 comments4 min readLW link

Im­prov­ing SAE’s by Sqrt()-ing L1 & Re­mov­ing Low­est Ac­ti­vat­ing Fea­tures

Mar 15, 2024, 4:30 PM
26 points
5 comments4 min readLW link

Find­ing Sparse Lin­ear Con­nec­tions be­tween Fea­tures in LLMs

Dec 9, 2023, 2:27 AM
70 points
5 comments10 min readLW link

Sparse Au­toen­coders: Fu­ture Work

Sep 21, 2023, 3:30 PM
35 points
5 comments6 min readLW link

Sparse Au­toen­coders Find Highly In­ter­pretable Direc­tions in Lan­guage Models

Sep 21, 2023, 3:30 PM
159 points
8 comments5 min readLW link

Really Strong Fea­tures Found in Resi­d­ual Stream

Logan RiggsJul 8, 2023, 7:40 PM
69 points
6 comments2 min readLW link

(ten­ta­tively) Found 600+ Monose­man­tic Fea­tures in a Small LM Us­ing Sparse Autoencoders

Logan RiggsJul 5, 2023, 4:49 PM
60 points
1 comment7 min readLW link

[Repli­ca­tion] Con­jec­ture’s Sparse Cod­ing in Small Transformers

Jun 16, 2023, 6:02 PM
52 points
0 comments5 min readLW link

[Repli­ca­tion] Con­jec­ture’s Sparse Cod­ing in Toy Models

Jun 2, 2023, 5:34 PM
24 points
0 comments1 min readLW link

[Si­mu­la­tors sem­i­nar se­quence] #2 Semiotic physics—revamped

Feb 27, 2023, 12:25 AM
24 points
23 comments13 min readLW link

Mak­ing Im­plied Stan­dards Explicit

Logan RiggsFeb 25, 2023, 8:02 PM
22 points
0 comments4 min readLW link

Pro­posal for In­duc­ing Steganog­ra­phy in LMs

Logan RiggsJan 12, 2023, 10:15 PM
22 points
3 comments2 min readLW link