All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 161718 19 20 21 22 23 24 25 26 27 28 29 30 31

An upcoming US Supreme Court case may impede AI governance efforts

NickGabs16 Jul 2023 23:51 UTC

57 points

17 comments2 min readLW link

Weak Evidence is Common

dkl916 Jul 2023 23:37 UTC

7 points

5 comments1 min readLW link

(dkl9.net)

Even briefer summary of ai-plans.com

Iknownothing16 Jul 2023 23:25 UTC

10 points

6 comments2 min readLW link

(www.ai-plans.com)

Mech Interp Puzzle 1: Suspiciously Similar Embeddings in GPT-Neo

Neel Nanda16 Jul 2023 22:02 UTC

66 points

15 comments1 min readLW link

A Technology of Everything – Part 1: A Magical Science Experiment

aiuisensei16 Jul 2023 22:01 UTC

−3 points

0 comments7 min readLW link

(www.aiui.cloud)

Scaling and Sustaining Standards: A Case Study on the Basel Accords

Conrad K.16 Jul 2023 22:01 UTC

8 points

1 comment7 min readLW link

(docs.google.com)

AI, Consciousness, and the problem of Moral Considerability

stultus16 Jul 2023 19:56 UTC

1 point

0 comments2 min readLW link

Narrative Theory. Part 3. Simplest to succeed

Eris16 Jul 2023 14:41 UTC

4 points

0 comments1 min readLW link

Runaway Optimizers in Mind Space

silentbob16 Jul 2023 14:26 UTC

16 points

0 comments12 min readLW link

[Question] Is Adam Elga’s proof for thirdism in Sleeping Beauty still considered to be sound?

Ape in the coat16 Jul 2023 14:11 UTC

8 points

25 comments1 min readLW link

A simple way of exploiting AI’s coming economic impact may be highly-impactful

kuira16 Jul 2023 9:33 UTC

11 points

2 comments2 min readLW link

Activation adding experiments with llama-7b

Nina Panickssery16 Jul 2023 4:17 UTC

51 points

1 comment3 min readLW link

Introducción al Riesgo Existencial de Inteligencia Artificial

david.friva15 Jul 2023 20:37 UTC

4 points

2 comments4 min readLW link

(youtu.be)

The housing crisis, explained using game theory

Johnstone15 Jul 2023 20:27 UTC

4 points

2 comments8 min readLW link

Only a hack can solve the shutdown problem

dp15 Jul 2023 20:26 UTC

5 points

0 comments8 min readLW link

Robustness of Model-Graded Evaluations and Automated Interpretability

Simon Lermen and viluon

15 Jul 2023 19:12 UTC

47 points

5 comments9 min readLW link

[Question] How to deal with fear of failure?

TeaTieAndHat15 Jul 2023 18:57 UTC

1 point

2 comments1 min readLW link

Simplified bio-anchors for upper bounds on AI timelines

Fabien Roger15 Jul 2023 18:15 UTC

21 points

4 comments5 min readLW link

A Hill of Validity in Defense of Meaning

Zack_M_Davis15 Jul 2023 17:57 UTC

8 points

118 comments75 min readLW link

(unremediatedgender.space)

What is a cognitive bias?

Lionel15 Jul 2023 13:01 UTC

1 point

0 comments2 min readLW link

(lionelpage.substack.com)

[Question] When people say robots will steal jobs, what kinds of jobs are never implied?

Mary Chernyshenko15 Jul 2023 10:50 UTC

5 points

12 comments1 min readLW link

Narrative Theory. Part 2. A new way of doing the same thing

Eris15 Jul 2023 10:37 UTC

2 points

0 comments1 min readLW link

How to use ChatGPT to get better book & movie recommendations

KatWoods15 Jul 2023 8:55 UTC

29 points

3 comments1 min readLW link

[Question] Would you take a job making humanoid robots for an AGI?

Super AGI15 Jul 2023 5:26 UTC

−1 points

2 comments1 min readLW link

Rationality, Pedagogy, and “Vibes”: Quick Thoughts

Nicholas / Heather Kross15 Jul 2023 2:09 UTC

14 points

1 comment4 min readLW link

(redacted) Anomalous tokens might disproportionately affect complex language tasks

Nikola Jurkovic15 Jul 2023 0:48 UTC

4 points

0 comments7 min readLW link

Why was the AI Alignment community so unprepared for this moment?

Ras151315 Jul 2023 0:26 UTC

121 points

65 comments2 min readLW link

Physics is Ultimately Subjective

Gordon Seidoh Worley14 Jul 2023 22:19 UTC

5 points

34 comments3 min readLW link

[Question] How should a rational agent construct their utility function when faced with existence?

Aman Rusia14 Jul 2023 19:48 UTC

−2 points

1 comment1 min readLW link

AI Risk and Survivorship Bias—How Andreessen and LeCun got it wrong

Štěpán Los14 Jul 2023 17:43 UTC

13 points

2 comments6 min readLW link

Unsafe AI as Dynamical Systems

Robert_AIZI14 Jul 2023 15:31 UTC

11 points

0 comments3 min readLW link

(aizi.substack.com)

A Short Summary of “Focus Your Uncertainty”

Stephen James14 Jul 2023 11:18 UTC

2 points

0 comments1 min readLW link

Do the change you want to see in the world

TeaTieAndHat14 Jul 2023 10:19 UTC

7 points

0 comments1 min readLW link

Gearing Up for Long Timelines in a Hard World

Dalcy14 Jul 2023 6:11 UTC

15 points

0 comments4 min readLW link

When Someone Tells You They’re Lying, Believe Them

ymeskhout14 Jul 2023 0:31 UTC

95 points

3 comments3 min readLW link

Activation adding experiments with FLAN-T5

Nina Panickssery13 Jul 2023 23:32 UTC

21 points

5 comments7 min readLW link

[Question] What criterion would you use to select companies likely to cause AI doom?

momom213 Jul 2023 20:31 UTC

8 points

4 comments1 min readLW link

Newcomb II: Newer and Comb-ier

Nathaniel Monson13 Jul 2023 18:49 UTC

0 points

11 comments3 min readLW link

Jailbreaking GPT-4′s code interpreter

Nikola Jurkovic13 Jul 2023 18:43 UTC

160 points

22 comments7 min readLW link

An attempt to steelman OpenAI’s alignment plan

Nathan Helm-Burger13 Jul 2023 18:25 UTC

22 points

0 comments4 min readLW link

Instrumental Convergence to Complexity Preservation

Macro Flaneur13 Jul 2023 17:40 UTC

2 points

2 comments3 min readLW link

Unabridged History of Global Parenting

CrimsonChin13 Jul 2023 16:49 UTC

0 points

2 comments7 min readLW link

The Goddess of Everything Else—The Animation

Writer13 Jul 2023 16:26 UTC

142 points

4 comments1 min readLW link

(youtu.be)

Winners of AI Alignment Awards Research Contest

Akash and OliviaJ

13 Jul 2023 16:14 UTC

115 points

4 comments12 min readLW link

(alignmentawards.com)

Accidentally Load Bearing

jefftk13 Jul 2023 16:10 UTC

280 points

17 comments1 min readLW link 1 review

(www.jefftk.com)

AI #20: Code Interpreter and Claude 2.0 for Everyone

Zvi13 Jul 2023 14:00 UTC

60 points

9 comments56 min readLW link

(thezvi.wordpress.com)

[Question] How can I get help becoming a better rationalist?

TeaTieAndHat13 Jul 2023 13:41 UTC

31 points

19 comments1 min readLW link

i love eating trash

Ace Delgado13 Jul 2023 11:23 UTC

−15 points

0 comments1 min readLW link

Elon Musk announces xAI

Jan_Kulveit13 Jul 2023 9:01 UTC

75 points

35 comments1 min readLW link

(www.ft.com)

The intelligence-sentience orthogonality thesis

Ben Smith13 Jul 2023 6:55 UTC

19 points

9 comments9 min readLW link