All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 131415 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Out of the Box

jesseduffield13 Nov 2023 23:43 UTC

5 points

1 comment7 min readLW link

Loudly Give Up, Don’t Quietly Fade

Screwtape13 Nov 2023 23:30 UTC

144 points

11 comments6 min readLW link

Great Empathy and Great Response Ability

positivesum13 Nov 2023 23:04 UTC

16 points

0 comments3 min readLW link

(tryingtruly.substack.com)

Theories of Change for AI Auditing

Lee Sharkey, beren and Marius Hobbhahn

13 Nov 2023 19:33 UTC

54 points

0 comments18 min readLW link

(www.apolloresearch.ai)

They are made of repeating patterns

quetzal_rainbow13 Nov 2023 18:17 UTC

50 points

4 comments2 min readLW link

How to Upload a Mind (In Three Not-So-Easy Steps)

aggliu and Writer

13 Nov 2023 18:13 UTC

26 points

0 comments7 min readLW link

(youtu.be)

Non-myopia stories

lberglund13 Nov 2023 17:52 UTC

29 points

10 comments7 min readLW link

It’s OK to eat shrimp: EAs Make Invalid Inferences About Fish Qualia and Moral Patienthood

Mikhail Samin13 Nov 2023 16:51 UTC

0 points

17 comments1 min readLW link

Suggestions for chess puzzles

Zane13 Nov 2023 15:39 UTC

13 points

1 comment1 min readLW link

Why small phenomenons are relevant to morality

Ryo 13 Nov 2023 15:25 UTC

1 point

0 comments3 min readLW link

Optionality approach to ethics

Ryo 13 Nov 2023 15:23 UTC

7 points

2 comments3 min readLW link

Redirecting one’s own taxes as an effective altruism method

David Gross13 Nov 2023 15:17 UTC

2 points

34 comments16 min readLW link

AISC Project: Benchmarks for Stable Reflectivity

jacquesthibs13 Nov 2023 14:51 UTC

17 points

0 comments8 min readLW link

AISC Project: Modelling Trajectories of Language Models

NickyP13 Nov 2023 14:33 UTC

27 points

0 comments12 min readLW link

Bostrom Goes Unheard

Zvi13 Nov 2023 14:11 UTC

81 points

9 comments18 min readLW link

November hangout in Warsaw

ntoxeg13 Nov 2023 13:20 UTC

1 point

1 comment1 min readLW link

The Science Algorithm AISC Project

Johannes C. Mayer13 Nov 2023 12:52 UTC

12 points

0 comments1 min readLW link

(docs.google.com)

You can just spontaneously call people you haven’t met in years

lc13 Nov 2023 5:21 UTC

165 points

21 comments1 min readLW link

Zvi’s Manifold Markets House Rules

Zvi13 Nov 2023 0:28 UTC

53 points

6 comments3 min readLW link

[Question] What’s your best utilitarian model for risking your best kidneys?

Ilio12 Nov 2023 23:01 UTC

−3 points

4 comments1 min readLW link

Helpful examples to get a sense of modern automated manipulation

trevor12 Nov 2023 20:49 UTC

33 points

4 comments9 min readLW link

The Snuggle/Date/Slap Protocol

MadHatter12 Nov 2023 20:44 UTC

−21 points

4 comments2 min readLW link

Two children’s stories

Optimization Process12 Nov 2023 20:29 UTC

11 points

1 comment7 min readLW link

The Fundamental Theorem for measurable factor spaces

Matthias G. Mayer12 Nov 2023 19:25 UTC

38 points

2 comments2 min readLW link

How accurate are standard Dark Triad personality scales?

jamesbill12 Nov 2023 8:21 UTC

0 points

2 comments2 min readLW link

[Question] What ML gears do you like?

Ulisse Mini11 Nov 2023 19:10 UTC

25 points

4 comments1 min readLW link

Smart Sessions—Finally a (kinda) window-centric session manager

Eli Tyre11 Nov 2023 18:54 UTC

14 points

3 comments5 min readLW link

AISC project: SatisfIA – AI that satisfies without overdoing it

Jobst Heitzig11 Nov 2023 18:22 UTC

12 points

0 comments1 min readLW link

(docs.google.com)

Control Symmetry: why we might want to start investigating asymmetric alignment interventions

domenicrosati11 Nov 2023 17:27 UTC

25 points

1 comment2 min readLW link

Game Theory without Argmax [Part 2]

Cleo Nardo11 Nov 2023 16:02 UTC

31 points

14 comments13 min readLW link

Game Theory without Argmax [Part 1]

Cleo Nardo11 Nov 2023 15:59 UTC

70 points

18 comments19 min readLW link

It’s OK to be biased towards humans

dr_s11 Nov 2023 11:59 UTC

55 points

69 comments6 min readLW link

The Top AI Safety Bets for 2023: GiveWiki’s Latest Recommendations

Dawn Drescher11 Nov 2023 9:04 UTC

3 points

2 comments1 min readLW link

Artificial General Horsiness

robotelvis11 Nov 2023 5:15 UTC

4 points

0 comments5 min readLW link

(messyprogress.substack.com)

Palisade is hiring Research Engineers

Charlie Rogers-Smith and Jeffrey Ladish

11 Nov 2023 3:09 UTC

23 points

0 comments3 min readLW link

Open Phil releases RFPs on LLM Benchmarks and Forecasting

LawrenceC11 Nov 2023 3:01 UTC

53 points

0 comments2 min readLW link

(www.openphilanthropy.org)

Memo on some neglected topics

Lukas Finnveden11 Nov 2023 2:01 UTC

28 points

2 comments1 min readLW link

(open.substack.com)

Who is Sam Bankman-Fried (SBF) really, and how could he have done what he did? - three theories and a lot of evidence

spencerg11 Nov 2023 1:04 UTC

36 points

28 comments1 min readLW link

(www.spencergreenberg.com)

Survey on the acceleration risks of our new RFPs to study LLM capabilities

Ajeya Cotra10 Nov 2023 23:59 UTC

27 points

1 comment1 min readLW link

Rat Fest 2024

LoganChipkin10 Nov 2023 23:25 UTC

7 points

6 comments1 min readLW link

How I Think, Part Three: Weighing Cryonics

Richard Henage10 Nov 2023 22:21 UTC

4 points

1 comment2 min readLW link

Linear encoding of character-level information in GPT-J token embeddings

mwatkins and Joseph Bloom

10 Nov 2023 22:19 UTC

34 points

4 comments28 min readLW link

Follow-up survey: inositol

Elizabeth10 Nov 2023 19:30 UTC

13 points

1 comment1 min readLW link

(acesounderglass.com)

We have promising alignment plans with low taxes

Seth Herd10 Nov 2023 18:51 UTC

40 points

9 comments5 min readLW link

[Question] Vector search on a large dataset?

camsdixon10 Nov 2023 18:43 UTC

−1 points

2 comments1 min readLW link

About Me

Abe Dillon10 Nov 2023 18:32 UTC

3 points

0 comments1 min readLW link

Metaculus Introduces AI-Powered Community Insights to Reveal Factors Driving User Forecasts

ChristianWilliams10 Nov 2023 17:57 UTC

6 points

0 comments1 min readLW link

(www.metaculus.com)

Joy in the Here and Real

Screwtape10 Nov 2023 17:22 UTC

18 points

0 comments2 min readLW link

Artefacts generated by mode collapse in GPT-4 Turbo serve as adversarial attacks.

Sohaib Imran10 Nov 2023 15:23 UTC

11 points

0 comments2 min readLW link

Wastewater RNA Read Lengths

jefftk10 Nov 2023 15:20 UTC

13 points

0 comments4 min readLW link

(www.jefftk.com)