Has Diagram

TagLast edit: Apr 29, 2023, 10:52 PM by Gunnar_Zarncke

This tag is used to indicate that the post contains diagrams. This may be useful to quickly find such posts, or to exclude them in case you are visually impaired.

What are the results of more parental supervision and less outdoor play?

juliawiseNov 25, 2023, 12:52 PM

228 points

31 comments5 min readLW link

Using axis lines for good or evil

dynomightMar 6, 2024, 2:47 PM

151 points

39 comments4 min readLW link

(dynomight.net)

Neural Categories

Eliezer YudkowskyFeb 10, 2008, 12:33 AM

63 points

17 comments4 min readLW link

The lattice of partial updatelessness

Martín SotoFeb 10, 2024, 5:34 PM

23 points

5 comments5 min readLW link

Demystifying “Alignment” through a Comic

milanroskoJun 9, 2024, 8:24 AM

106 points

19 comments1 min readLW link

Shard Theory—is it true for humans?

RishikaJun 14, 2024, 7:21 PM

71 points

7 comments15 min readLW link

Towards a Less Bullshit Model of Semantics

johnswentworth and David Lorell

Jun 17, 2024, 3:51 PM

94 points

44 comments21 min readLW link

How good are LLMs at doing ML on an unknown dataset?

Håvard Tveit IhleJul 1, 2024, 9:04 AM

33 points

4 comments13 min readLW link

[Intro to brain-like-AGI safety] 4. The “short-term predictor”

Steven ByrnesFeb 16, 2022, 1:12 PM

64 points

11 comments13 min readLW link

An Introduction To The Mandelbrot Set That Doesn’t Mention Complex Numbers

YitzJan 17, 2024, 9:48 AM

82 points

11 comments9 min readLW link

An Illustrated Proof of the No Free Lunch Theorem

lifelonglearnerJun 8, 2020, 1:54 AM

19 points

0 comments1 min readLW link

(mlu.red)

Corrigibility, Much more detail than anyone wants to Read

Logan ZoellnerMay 7, 2023, 1:02 AM

26 points

2 comments7 min readLW link

How much do you believe your results?

Eric NeymanMay 6, 2023, 8:31 PM

505 points

18 comments15 min readLW link 4 reviews

(ericneyman.wordpress.com)

Residual stream norms grow exponentially over the forward pass

StefanHex and TurnTrout

May 7, 2023, 12:46 AM

77 points

24 comments11 min readLW link

Being the (Pareto) Best in the World

johnswentworthJun 24, 2019, 6:36 PM

471 points

60 comments3 min readLW link 3 reviews

Hyperpolation

Gunnar_ZarnckeSep 15, 2024, 9:37 PM

22 points

6 comments1 min readLW link

(arxiv.org)

The case for a negative alignment tax

Cameron Berg, Judd Rosenblatt, Diogo de Lucena and AE Studio

Sep 18, 2024, 6:33 PM

75 points

20 comments7 min readLW link

Machine Learning Analogy for Meditation (illustrated)

abramdemskiJun 28, 2018, 10:51 PM

100 points

48 comments1 min readLW link

Four Types of Disagreement

silentbobApr 13, 2025, 11:22 AM

22 points

1 comment5 min readLW link

How might we solve the alignment problem? (Part 1: Intro, summary, ontology)

Joe CarlsmithOct 28, 2024, 9:57 PM

54 points

5 comments32 min readLW link

I turned decision theory problems into memes about trolleys

TapataktOct 30, 2024, 8:13 PM

104 points

23 comments1 min readLW link

The Cartoon Guide to Löb’s Theorem

Eliezer YudkowskyAug 17, 2008, 8:35 PM

45 points

104 comments1 min readLW link

[Intro to brain-like-AGI safety] 10. The alignment problem

Steven ByrnesMar 30, 2022, 1:24 PM

48 points

7 comments21 min readLW link

[Intro to brain-like-AGI safety] 12. Two paths forward: “Controlled AGI” and “Social-instinct AGI”

Steven ByrnesApr 20, 2022, 12:58 PM

44 points

10 comments15 min readLW link

Drawing Less Wrong: Technical Skill

RaemonDec 5, 2011, 5:12 AM

37 points

36 comments9 min readLW link

All images from the WaitButWhy sequence on AI

trevorApr 8, 2023, 7:36 AM

73 points

5 comments2 min readLW link

The Natural Abstraction Hypothesis: Implications and Evidence

CallumMcDougallDec 14, 2021, 11:14 PM

39 points

9 comments19 min readLW link

Testing The Natural Abstraction Hypothesis: Project Update

johnswentworthSep 20, 2021, 3:44 AM

88 points

17 comments8 min readLW link 1 review

Open technical problem: A Quinean proof of Löb’s theorem, for an easier cartoon guide

Andrew_CritchNov 24, 2022, 9:16 PM

58 points

35 comments3 min readLW link 1 review

[Intro to brain-like-AGI safety] 5. The “long-term predictor”, and TD learning

Steven ByrnesFeb 23, 2022, 2:44 PM

54 points

27 comments20 min readLW link

[Intro to brain-like-AGI safety] 6. Big picture of motivation, decision-making, and RL

Steven ByrnesMar 2, 2022, 3:26 PM

69 points

17 comments16 min readLW link

[Intro to brain-like-AGI safety] 7. From hardcoded drives to foresighted plans: A worked example

Steven ByrnesMar 9, 2022, 2:28 PM

78 points

0 comments10 min readLW link

[Intro to brain-like-AGI safety] 8. Takeaways from neuro 1/2: On AGI development

Steven ByrnesMar 16, 2022, 1:59 PM

57 points

2 comments14 min readLW link

[Intro to brain-like-AGI safety] 9. Takeaways from neuro 2/2: On AGI motivation

Steven ByrnesMar 23, 2022, 12:48 PM

46 points

11 comments22 min readLW link

[Intro to brain-like-AGI safety] 13. Symbol grounding & human social instincts

Steven ByrnesApr 27, 2022, 1:30 PM

73 points

15 comments15 min readLW link

[Intro to brain-like-AGI safety] 14. Controlled AGI

Steven ByrnesMay 11, 2022, 1:17 PM

45 points

25 comments20 min readLW link

[Intro to brain-like-AGI safety] 1. What’s the problem & Why work on it now?

Steven ByrnesJan 26, 2022, 3:23 PM

159 points

19 comments26 min readLW link

[Intro to brain-like-AGI safety] 2. “Learning from scratch” in the brain

Steven ByrnesFeb 2, 2022, 1:22 PM

60 points

12 comments25 min readLW link

[Intro to brain-like-AGI safety] 3. Two subsystems: Learning & Steering

Steven ByrnesFeb 9, 2022, 1:09 PM

95 points

3 comments25 min readLW link

[Valence series] 4. Valence & Social Status (deprecated)

Steven ByrnesDec 15, 2023, 2:24 PM

35 points

19 comments11 min readLW link

Bayes’ Theorem Illustrated (My Way)

komponistoJun 3, 2010, 4:40 AM

171 points

195 comments9 min readLW link

Induction heads—illustrated

CallumMcDougallJan 2, 2023, 3:35 PM

128 points

12 comments3 min readLW link

Embedding safety in ML development

zeshenOct 31, 2022, 12:27 PM

24 points

1 comment18 min readLW link

Visualizing small Attention-only Transformers

WCargoNov 19, 2024, 9:37 AM

4 points

0 comments8 min readLW link

Levels of goals and alignment

zeshenSep 16, 2022, 4:44 PM

27 points

4 comments6 min readLW link

A newcomer’s guide to the technical AI safety field

zeshenNov 4, 2022, 2:29 PM

42 points

3 comments10 min readLW link

Emrik May 25, 2024, 4:21 PM
1 point
0
It would be awesome if there was a way of actually browsing the diagrams directly, instead of opening and checking each post individually. Use-case: I’m trying to optimize my information-diet, and I often find visualizations way more usefwl per unit time compared to text. Alas, there’s no way to quickly search for eg “diagrams/graphs/figures related to X”.
(Originally I imagined it would be awesome if e.g. Elicit had a feature for previewing the figures associated with each paper returned by a search term, but I would love this for LW as well.)
Raemon May 1, 2023, 8:23 PM
2 points
2
Hmm. So, I think adding tags to posts is a bit of a cost (in that if there are more than a couple tags on a post, they blur together and become hard to read).
If people do actually find this tag useful, I think maybe the thing to do is make it hidden-by-default. (Maybe have a type of tag that is hidden beneath a “show more” on the OP)