All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 141516 17 18 19 20 21 22 23 24 25 26 27 28 29 30

SIA Is Just Being a Bayesian About the Fact That One Exists

omnizoid14 Nov 2023 22:55 UTC

3 points

5 comments4 min readLW link

AI Alignment [progress] this Week (11/12/2023)

Logan Zoellner14 Nov 2023 22:21 UTC

6 points

0 comments2 min readLW link

(midwitalignment.substack.com)

[Question] When did Eliezer Yudkowsky change his mind about neural networks?

[deactivated]14 Nov 2023 21:24 UTC

31 points

15 comments1 min readLW link

Betting on what is un-falsifiable and un-verifiable

Abhimanyu Pallavi Sudhir14 Nov 2023 21:11 UTC

13 points

0 comments15 min readLW link

Facebook is Paying Me to Post

jefftk14 Nov 2023 19:10 UTC

26 points

5 comments1 min readLW link

(www.jefftk.com)

Feelings, Nothing More than Feelings, About AI

PaulBecon14 Nov 2023 18:50 UTC

7 points

0 comments3 min readLW link

Kids or No kids

Kids or no kids14 Nov 2023 18:37 UTC

95 points

10 comments13 min readLW link

Raemon’s Deliberate (“Purposeful?”) Practice Club

Raemon, Elizabeth, lynettebye and Alex_Altair

14 Nov 2023 18:24 UTC

60 points

11 comments22 min readLW link

More metal less ore

Logan Kieller14 Nov 2023 16:59 UTC

6 points

3 comments2 min readLW link

(logankieller.substack.com)

Monthly Roundup #12: November 2023

Zvi14 Nov 2023 15:20 UTC

34 points

5 comments33 min readLW link

(thezvi.wordpress.com)

Do you want a first-principled preparedness guide to prepare yourself and loved ones for potential catastrophes?

Ulrik Horn14 Nov 2023 12:13 UTC

16 points

5 comments15 min readLW link

[Question] Is there Work on Embedded Agency in Cellular Automata Toy Models?

Johannes C. Mayer14 Nov 2023 9:08 UTC

10 points

0 comments1 min readLW link

[Question] Would this be Progress in Solving Embedded Agency?

Johannes C. Mayer14 Nov 2023 9:08 UTC

9 points

2 comments2 min readLW link

Is Interpretability All We Need?

RogerDearnaley14 Nov 2023 5:31 UTC

1 point

1 comment1 min readLW link

What is wisdom?

TsviBT14 Nov 2023 2:13 UTC

37 points

3 comments13 min readLW link

Festival Stats 2023

jefftk14 Nov 2023 1:20 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

Out of the Box

jesseduffield13 Nov 2023 23:43 UTC

5 points

1 comment7 min readLW link

Loudly Give Up, Don’t Quietly Fade

Screwtape13 Nov 2023 23:30 UTC

144 points

11 comments6 min readLW link

Great Empathy and Great Response Ability

positivesum13 Nov 2023 23:04 UTC

16 points

0 comments3 min readLW link

(tryingtruly.substack.com)

Theories of Change for AI Auditing

Lee Sharkey, beren and Marius Hobbhahn

13 Nov 2023 19:33 UTC

54 points

0 comments18 min readLW link

(www.apolloresearch.ai)

They are made of repeating patterns

quetzal_rainbow13 Nov 2023 18:17 UTC

50 points

4 comments2 min readLW link

How to Upload a Mind (In Three Not-So-Easy Steps)

aggliu and Writer

13 Nov 2023 18:13 UTC

26 points

0 comments7 min readLW link

(youtu.be)

Non-myopia stories

lberglund13 Nov 2023 17:52 UTC

29 points

10 comments7 min readLW link

It’s OK to eat shrimp: EAs Make Invalid Inferences About Fish Qualia and Moral Patienthood

Mikhail Samin13 Nov 2023 16:51 UTC

0 points

17 comments1 min readLW link

Suggestions for chess puzzles

Zane13 Nov 2023 15:39 UTC

13 points

1 comment1 min readLW link

Why small phenomenons are relevant to morality

Ryo 13 Nov 2023 15:25 UTC

1 point

0 comments3 min readLW link

Optionality approach to ethics

Ryo 13 Nov 2023 15:23 UTC

7 points

2 comments3 min readLW link

Redirecting one’s own taxes as an effective altruism method

David Gross13 Nov 2023 15:17 UTC

2 points

34 comments16 min readLW link

AISC Project: Benchmarks for Stable Reflectivity

jacquesthibs13 Nov 2023 14:51 UTC

17 points

0 comments8 min readLW link

AISC Project: Modelling Trajectories of Language Models

NickyP13 Nov 2023 14:33 UTC

27 points

0 comments12 min readLW link

Bostrom Goes Unheard

Zvi13 Nov 2023 14:11 UTC

81 points

9 comments18 min readLW link

November hangout in Warsaw

ntoxeg13 Nov 2023 13:20 UTC

1 point

1 comment1 min readLW link

The Science Algorithm AISC Project

Johannes C. Mayer13 Nov 2023 12:52 UTC

12 points

0 comments1 min readLW link

(docs.google.com)

You can just spontaneously call people you haven’t met in years

lc13 Nov 2023 5:21 UTC

165 points

21 comments1 min readLW link

Zvi’s Manifold Markets House Rules

Zvi13 Nov 2023 0:28 UTC

53 points

6 comments3 min readLW link

[Question] What’s your best utilitarian model for risking your best kidneys?

Ilio12 Nov 2023 23:01 UTC

−3 points

4 comments1 min readLW link

Helpful examples to get a sense of modern automated manipulation

trevor12 Nov 2023 20:49 UTC

33 points

4 comments9 min readLW link

The Snuggle/Date/Slap Protocol

MadHatter12 Nov 2023 20:44 UTC

−21 points

4 comments2 min readLW link

Two children’s stories

Optimization Process12 Nov 2023 20:29 UTC

11 points

1 comment7 min readLW link

The Fundamental Theorem for measurable factor spaces

Matthias G. Mayer12 Nov 2023 19:25 UTC

38 points

2 comments2 min readLW link

How accurate are standard Dark Triad personality scales?

jamesbill12 Nov 2023 8:21 UTC

0 points

2 comments2 min readLW link

[Question] What ML gears do you like?

Ulisse Mini11 Nov 2023 19:10 UTC

25 points

4 comments1 min readLW link

Smart Sessions—Finally a (kinda) window-centric session manager

Eli Tyre11 Nov 2023 18:54 UTC

14 points

3 comments5 min readLW link

AISC project: SatisfIA – AI that satisfies without overdoing it

Jobst Heitzig11 Nov 2023 18:22 UTC

12 points

0 comments1 min readLW link

(docs.google.com)

Control Symmetry: why we might want to start investigating asymmetric alignment interventions

domenicrosati11 Nov 2023 17:27 UTC

25 points

1 comment2 min readLW link

Game Theory without Argmax [Part 2]

Cleo Nardo11 Nov 2023 16:02 UTC

31 points

14 comments13 min readLW link

Game Theory without Argmax [Part 1]

Cleo Nardo11 Nov 2023 15:59 UTC

70 points

18 comments19 min readLW link

It’s OK to be biased towards humans

dr_s11 Nov 2023 11:59 UTC

55 points

69 comments6 min readLW link

The Top AI Safety Bets for 2023: GiveWiki’s Latest Recommendations

Dawn Drescher11 Nov 2023 9:04 UTC

3 points

2 comments1 min readLW link

Artificial General Horsiness

robotelvis11 Nov 2023 5:15 UTC

4 points

0 comments5 min readLW link

(messyprogress.substack.com)