Alexandre Variengien

Karma: 634

My guess at Conjecture’s vision: triggering a narrative bifurcation

Alexandre Variengien6 Feb 2024 19:10 UTC

75 points

12 comments16 min readLW link

The case for training frontier AIs on Sumerian-only corpus

Alexandre Variengien, Charbel-Raphaël and Jonathan Claybrough

15 Jan 2024 16:40 UTC

130 points

15 comments3 min readLW link

A Universal Emergent Decomposition of Retrieval Tasks in Language Models

Alexandre Variengien and Eric Winsor

19 Dec 2023 11:52 UTC

84 points

3 comments10 min readLW link

(arxiv.org)

Capture the Flag Mechanistic Interpretability Challenges

Alejandro Acelas and Alexandre Variengien

8 Sep 2023 23:00 UTC

24 points

0 comments7 min readLW link

Input Swap Graphs: Discovering the role of neural network components at scale

Alexandre Variengien12 May 2023 9:41 UTC

92 points

0 comments33 min readLW link

An introduction to language model interpretability

Alexandre Variengien20 Apr 2023 22:22 UTC

14 points

0 comments9 min readLW link

Some common confusion about induction heads

Alexandre Variengien28 Mar 2023 21:51 UTC

64 points

4 comments5 min readLW link

Gliders in Language Models

Alexandre Variengien25 Nov 2022 0:38 UTC

30 points

11 comments10 min readLW link

Some Lessons Learned from Studying Indirect Object Identification in GPT-2 small

KevinRoWang, Alexandre Variengien, Arthur Conmy, Buck and jsteinhardt

28 Oct 2022 23:55 UTC

101 points

9 comments9 min readLW link 2 reviews

(arxiv.org)

Apply to the Machine Learning For Good bootcamp in France

Alexandre Variengien17 Jun 2022 7:32 UTC

10 points

0 comments1 min readLW link

Croesus, Cerberus, and the magpies: a gentle introduction to Eliciting Latent Knowledge

Alexandre Variengien27 May 2022 17:58 UTC

17 points

0 comments16 min readLW link