Logan Riggs

Karma: 3,021

Veo-2 Can Produce Realistic Ads

Logan RiggsJan 21, 2025, 7:13 PM

14 points

0 comments1 min readLW link

[Exercise] Four Examples of Noticing Confusion

Logan RiggsJan 18, 2025, 3:29 PM

8 points

8 comments3 min readLW link

How do you deal w/ Super Stimuli?

Logan RiggsJan 14, 2025, 3:14 PM

100 points

25 comments3 min readLW link

When AI 10x’s AI R&D, What Do We Do?

Logan RiggsDec 21, 2024, 11:56 PM

72 points

16 comments4 min readLW link

Logan Riggs’s Shortform

Logan RiggsDec 4, 2024, 2:52 PM

7 points

13 comments1 min readLW link

Book a Time to Chat about Interp Research

Logan RiggsDec 3, 2024, 5:27 PM

47 points

3 comments1 min readLW link

Evaluating Sparse Autoencoders with Board Game Models

Adam Karvonen, Sam Marks, Can, Benjamin Wright, Jannik Brinkmann, Logan Riggs and Rico Angell

Aug 2, 2024, 7:50 PM

38 points

1 comment9 min readLW link

Interpreting Preference Models w/ Sparse Autoencoders

Logan Riggs and Jannik Brinkmann

Jul 1, 2024, 9:35 PM

74 points

12 comments9 min readLW link

Was Releasing Claude-3 Net-Negative?

Logan RiggsMar 27, 2024, 5:41 PM

52 points

5 comments4 min readLW link

Improving SAE’s by Sqrt()-ing L1 & Removing Lowest Activating Features

Logan Riggs and Jannik Brinkmann

Mar 15, 2024, 4:30 PM

26 points

5 comments4 min readLW link

Finding Sparse Linear Connections between Features in LLMs

Logan Riggs, Sam Mitchell and Adam Kaufman

Dec 9, 2023, 2:27 AM

70 points

5 comments10 min readLW link

Sparse Autoencoders: Future Work

Logan Riggs and Aidan Ewart

Sep 21, 2023, 3:30 PM

35 points

5 comments6 min readLW link

Sparse Autoencoders Find Highly Interpretable Directions in Language Models

Logan Riggs, Hoagy, Aidan Ewart and Robert_AIZI

Sep 21, 2023, 3:30 PM

159 points

8 comments5 min readLW link

Really Strong Features Found in Residual Stream

Logan RiggsJul 8, 2023, 7:40 PM

69 points

6 comments2 min readLW link

(tentatively) Found 600+ Monosemantic Features in a Small LM Using Sparse Autoencoders

Logan RiggsJul 5, 2023, 4:49 PM

60 points

1 comment7 min readLW link

[Replication] Conjecture’s Sparse Coding in Small Transformers

Hoagy and Logan Riggs

Jun 16, 2023, 6:02 PM

52 points

0 comments5 min readLW link

[Replication] Conjecture’s Sparse Coding in Toy Models

Hoagy and Logan Riggs

Jun 2, 2023, 5:34 PM

24 points

0 comments1 min readLW link

[Simulators seminar sequence] #2 Semiotic physics—revamped

Jan, Charlie Steiner, Logan Riggs, janus, jacquesthibs, metasemi, Michael Oesterle, Lucas Teixeira, peligrietzer and remember

Feb 27, 2023, 12:25 AM

24 points

23 comments13 min readLW link

Making Implied Standards Explicit

Logan RiggsFeb 25, 2023, 8:02 PM

22 points

0 comments4 min readLW link

Proposal for Inducing Steganography in LMs

Logan RiggsJan 12, 2023, 10:15 PM

22 points

3 comments2 min readLW link