All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Contra Common Knowledge

abramdemski4 Jan 2023 22:50 UTC

52 points

31 comments16 min readLW link

Additional space complexity isn’t always a useful metric

Brendan Long4 Jan 2023 21:53 UTC

4 points

3 comments3 min readLW link

(www.brendanlong.com)

List of links for getting into AI safety

zef4 Jan 2023 19:45 UTC

6 points

0 comments1 min readLW link

Opening Facebook Links Externally

jefftk4 Jan 2023 19:00 UTC

12 points

3 comments1 min readLW link

(www.jefftk.com)

Conversational canyons

Henrik Karlsson4 Jan 2023 18:55 UTC

59 points

4 comments7 min readLW link

(escapingflatland.substack.com)

Progress links and tweets, 2023-01-04

jasoncrawford4 Jan 2023 18:23 UTC

15 points

0 comments1 min readLW link

(rootsofprogress.org)

200 COP in MI: Analysing Training Dynamics

Neel Nanda4 Jan 2023 16:08 UTC

16 points

0 comments14 min readLW link

What’s up with ChatGPT and the Turing Test?

JoshuaFox and Zvi Schreiber

4 Jan 2023 15:37 UTC

13 points

19 comments3 min readLW link

2022 was the year AGI arrived (Just don’t call it that)

Logan Zoellner4 Jan 2023 15:19 UTC

102 points

60 comments3 min readLW link

From Simon’s ant to machine learning, a parable

Bill Benzon4 Jan 2023 14:37 UTC

6 points

5 comments2 min readLW link

Basic Facts about Language Model Internals

beren and Eric Winsor

4 Jan 2023 13:01 UTC

130 points

19 comments9 min readLW link

Ritual as the only tool for overwriting values and goals

mrcbarbier4 Jan 2023 11:11 UTC

41 points

24 comments32 min readLW link

Normalcy bias and Base rate neglect: Bias in Evaluating AGI X-Risks

Remmelt4 Jan 2023 3:16 UTC

−16 points

0 comments1 min readLW link

Causal representation learning as a technique to prevent goal misgeneralization

PabloAMC4 Jan 2023 0:07 UTC

21 points

0 comments8 min readLW link

What makes a probability question “well-defined”? (Part II: Bertrand’s Paradox)

Noah Topper3 Jan 2023 22:39 UTC

7 points

3 comments9 min readLW link

(naivebayes.substack.com)

“AI” is an indexical

TW1233 Jan 2023 22:00 UTC

10 points

0 comments6 min readLW link

(aiwatchtower.substack.com)

An ML interpretation of Shard Theory

beren3 Jan 2023 20:30 UTC

39 points

5 comments4 min readLW link

Talking to God

abramdemski3 Jan 2023 20:14 UTC

30 points

7 comments2 min readLW link

My Advice for Incoming SERI MATS Scholars

Johannes C. Mayer3 Jan 2023 19:25 UTC

58 points

6 comments4 min readLW link

Touch reality as soon as possible (when doing machine learning research)

LawrenceC3 Jan 2023 19:11 UTC

116 points

9 comments8 min readLW link 1 review

Kolb’s: an approach to consciously get better at anything

jacquesthibs3 Jan 2023 18:16 UTC

12 points

1 comment6 min readLW link

[Question] {M|Im|Am}oral Mazes—any large-scale counterexamples?

Dagon3 Jan 2023 16:43 UTC

24 points

4 comments1 min readLW link

Effectively self-studying over the Internet

libai3 Jan 2023 16:23 UTC

4 points

0 comments4 min readLW link

Set-like mathematics in type theory

Thomas Kehrenberg3 Jan 2023 14:33 UTC

5 points

1 comment13 min readLW link

Monthly Roundup #2

Zvi3 Jan 2023 12:50 UTC

23 points

3 comments23 min readLW link

(thezvi.wordpress.com)

Whisper’s Wild Implications

Ollie J3 Jan 2023 12:17 UTC

19 points

6 comments5 min readLW link

How to eat potato chips while typing

KatjaGrace3 Jan 2023 11:50 UTC

45 points

12 comments1 min readLW link

(worldspiritsockpuppet.com)

[Question] I have thousands of copies of HPMOR in Russian. How to use them with the most impact?

Mikhail Samin3 Jan 2023 10:21 UTC

26 points

3 comments1 min readLW link

Is recursive self-alignment possible?

No77e3 Jan 2023 9:15 UTC

5 points

5 comments1 min readLW link

On the naturalistic study of the linguistic behavior of artificial intelligence

Bill Benzon3 Jan 2023 9:06 UTC

1 point

0 comments4 min readLW link

SF Severe Weather Warning

stavros3 Jan 2023 6:04 UTC

3 points

3 comments1 min readLW link

(news.ycombinator.com)

Status quo bias; System justification: Bias in Evaluating AGI X-Risks

Remmelt and flandry19

3 Jan 2023 2:50 UTC

−11 points

0 comments1 min readLW link

200 COP in MI: Exploring Polysemanticity and Superposition

Neel Nanda3 Jan 2023 1:52 UTC

34 points

6 comments16 min readLW link

The need for speed in web frameworks?

Adam Zerner3 Jan 2023 0:06 UTC

19 points

2 comments8 min readLW link

[Simulators seminar sequence] #1 Background & shared assumptions

Jan, Charlie Steiner, Logan Riggs, janus, jacquesthibs, metasemi, Michael Oesterle, Lucas Teixeira, peligrietzer and remember

2 Jan 2023 23:48 UTC

50 points

4 comments3 min readLW link

Linear Algebra Done Right, Axler

David Udell2 Jan 2023 22:54 UTC

56 points

6 comments9 min readLW link

MacArthur BART (Filk)

Gordon Seidoh Worley2 Jan 2023 22:50 UTC

10 points

1 comment1 min readLW link

Knottiness

abramdemski2 Jan 2023 22:13 UTC

43 points

4 comments2 min readLW link

[Question] Default Sort for Shortforms is Very Bad; How Do I Change It?

DragonGod2 Jan 2023 21:50 UTC

15 points

0 comments1 min readLW link

MAKE IT BETTER (a poetic demonstration of the banality of GPT-3)

rogersbacon2 Jan 2023 20:47 UTC

7 points

2 comments5 min readLW link

Review of “Make People Better”

Metacelsus2 Jan 2023 20:30 UTC

10 points

0 comments3 min readLW link

(denovo.substack.com)

Preparing for Less Privacy

jefftk2 Jan 2023 20:30 UTC

23 points

1 comment2 min readLW link

(www.jefftk.com)

Large language models can provide “normative assumptions” for learning human preferences

Stuart_Armstrong2 Jan 2023 19:39 UTC

29 points

12 comments3 min readLW link

On the Importance of Open Sourcing Reward Models

elandgre2 Jan 2023 19:01 UTC

18 points

5 comments6 min readLW link

Prediction Markets for Science

Vaniver2 Jan 2023 17:55 UTC

27 points

7 comments5 min readLW link

Why don’t Rationalists use bidets?

Lakin2 Jan 2023 17:42 UTC

31 points

33 comments2 min readLW link

Soft optimization makes the value target bigger

Jeremy Gillen2 Jan 2023 16:06 UTC

117 points

20 comments12 min readLW link

Results from the AI testing hackathon

Esben Kran2 Jan 2023 15:46 UTC

13 points

0 comments1 min readLW link

Induction heads—illustrated

CallumMcDougall2 Jan 2023 15:35 UTC

114 points

9 comments3 min readLW link

Opportunity Cost Blackmail

adamShimi2 Jan 2023 13:48 UTC

70 points

11 comments2 min readLW link

(epistemologicalvigilance.substack.com)