8 Jul 2024 22:24 UTC

106 points

28 comments5 min readLW link

Robin Hanson & Liron Shapira Debate AI X-Risk

Liron8 Jul 2024 21:45 UTC

34 points

4 comments1 min readLW link

(www.youtube.com)

“The Singularity Is Nearer” by Ray Kurzweil—Review

Lavender8 Jul 2024 21:32 UTC

22 points

0 comments4 min readLW link

Sample Prevalence vs Global Prevalence

jefftk8 Jul 2024 21:00 UTC

11 points

0 comments2 min readLW link

(www.jefftk.com)

Advice to junior AI governance researchers

Akash8 Jul 2024 19:19 UTC

65 points

1 comment5 min readLW link

Pantheon Interface

NicholasKees and Sofia Vanhanen

8 Jul 2024 19:03 UTC

126 points

22 comments6 min readLW link

Launching the AI Forecasting Benchmark Series Q3 | $30k in Prizes

ChristianWilliams8 Jul 2024 17:20 UTC

5 points

0 comments1 min readLW link

(www.metaculus.com)

The Golden Mean of Scientific Virtues

adamShimi8 Jul 2024 17:16 UTC

12 points

4 comments8 min readLW link

(epistemologicalfascinations.substack.com)

Massapequa (Long Island), New York, USA – ACX Meetup

Gabriel Weil8 Jul 2024 17:01 UTC

2 points

0 comments1 min readLW link

Dialogue introduction to Singular Learning Theory

Olli Järviniemi8 Jul 2024 16:58 UTC

97 points

14 comments8 min readLW link

Announcing The Techno-Humanist Manifesto: A new philosophy of progress for the 21st century

jasoncrawford8 Jul 2024 16:33 UTC

18 points

4 comments5 min readLW link

(blog.rootsofprogress.org)

Response to Dileep George: AGI safety warrants planning ahead

Steven Byrnes8 Jul 2024 15:27 UTC

27 points

7 comments27 min readLW link

Why not parliamentarianism? [book by Tiago Ribeiro dos Santos]

Arturo Macias8 Jul 2024 14:57 UTC

2 points

1 comment4 min readLW link

Games of My Childhood: The Troops

Kaj_Sotala8 Jul 2024 11:20 UTC

18 points

0 comments5 min readLW link

(kajsotala.fi)

Towards shutdownable agents via stochastic choice

EJT, alexr, christosi and LAThomson

8 Jul 2024 10:14 UTC

59 points

12 comments23 min readLW link

(arxiv.org)

On scalable oversight with weak LLMs judging strong LLMs

zac_kenton, Noah Siegel, janos, Jonah Brown-Cohen, Samuel Albanie, David Lindner and Rohin Shah

8 Jul 2024 8:59 UTC

49 points

18 comments7 min readLW link

(arxiv.org)

Poker is a bad game for teaching epistemics. Figgie is a better one.

rossry8 Jul 2024 6:05 UTC

104 points

47 comments11 min readLW link

(blog.rossry.net)

Controlled Creative Destruction

Martin Sustrik8 Jul 2024 4:36 UTC

11 points

0 comments2 min readLW link

On saying “Thank you” instead of “I’m Sorry”

Michael Cohn8 Jul 2024 3:13 UTC

132 points

16 comments3 min readLW link

How can I get over my fear of becoming an emulated consciousness?

James Dowdell7 Jul 2024 22:02 UTC

6 points

8 comments5 min readLW link

An Extremely Opinionated Annotated List of My Favourite Mechanistic Interpretability Papers v2

Neel Nanda7 Jul 2024 17:39 UTC

134 points

15 comments25 min readLW link

Joint mandatory donation as a way to increase the number of donations

Crazy philosopher7 Jul 2024 10:56 UTC

3 points

3 comments2 min readLW link

Rationality vs Alignment

Donatas Lučiūnas7 Jul 2024 10:12 UTC

−14 points

14 comments2 min readLW link

Beyond Biomarkers: Understanding Multiscale Causality

Matěj Nekoranec7 Jul 2024 9:56 UTC

13 points

0 comments7 min readLW link

Goodhart’s Law and Emotions

Zero Contradictions7 Jul 2024 8:32 UTC

1 point

5 comments1 min readLW link

(expandingrationality.substack.com)

Reflections on Less Online

Error7 Jul 2024 3:49 UTC

85 points

15 comments18 min readLW link

LK-99 in retrospect

bhauth7 Jul 2024 2:06 UTC

72 points

21 comments3 min readLW link

(www.bhauth.com)

NYU Debate Training Update: Methods, Baselines, Preliminary Results

samarnesen6 Jul 2024 18:28 UTC

9 points

0 comments20 min readLW link

Scalable oversight as a quantitative rather than qualitative problem

Buck6 Jul 2024 17:42 UTC

85 points

11 comments3 min readLW link

An AI Manhattan Project is Not Inevitable

Maxwell Tabarrok6 Jul 2024 16:42 UTC

38 points

25 comments4 min readLW link

(www.maximum-progress.com)

[Linkpost] A Case for AI Consciousness

cdkg and Simon Goldstein

6 Jul 2024 14:52 UTC

19 points

2 comments1 min readLW link

(philpapers.org)

[Question] Can agents coordinate on randomness without outside sources?

Mikhail Samin6 Jul 2024 13:43 UTC

6 points

16 comments1 min readLW link

AI Alignment Research Engineer Accelerator (ARENA): Call for applicants v4.0

James Fox, Chloe Li, JamesH, Gracie Green and CallumMcDougall

6 Jul 2024 11:34 UTC

57 points

7 comments6 min readLW link

Links and brief musings for June

Kaj_Sotala6 Jul 2024 10:10 UTC

26 points

0 comments10 min readLW link

(kajsotala.fi)

Indecision and internalized authority figures

Kaj_Sotala6 Jul 2024 10:10 UTC

68 points

1 comment2 min readLW link

(kajsotala.fi)

Free Will, Determinism, And Choice

Zero Contradictions6 Jul 2024 6:34 UTC

7 points

3 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

Travel Buffer

jefftk6 Jul 2024 2:20 UTC

17 points

3 comments1 min readLW link

(www.jefftk.com)

[Question] What progress have we made on automated auditing?

LawrenceC6 Jul 2024 1:49 UTC

38 points

1 comment1 min readLW link

A “Bitter Lesson” Approach to Aligning AGI and ASI

RogerDearnaley6 Jul 2024 1:23 UTC

58 points

39 comments24 min readLW link

D&D.Sci: Whom Shall You Call?

abstractapplic5 Jul 2024 20:53 UTC

38 points

6 comments2 min readLW link

[Interim research report] Activation plateaus & sensitive directions in GPT2

StefanHex and jake_mendel

5 Jul 2024 17:05 UTC

65 points

2 comments5 min readLW link

Minimalist And Maximalist Type Systems

adamShimi5 Jul 2024 16:25 UTC

17 points

6 comments3 min readLW link

(epistemologicalfascinations.substack.com)

ML4Good Summer Bootcamps—Applications Open [deadline extended]

YM5 Jul 2024 13:59 UTC

12 points

0 comments1 min readLW link

[Question] Are there any plans to launch a paperback version of “Rationality: From AI to Zombies”?

m_arj5 Jul 2024 11:14 UTC

2 points

1 comment1 min readLW link

Doomsday Argument and the False Dilemma of Anthropic Reasoning

Ape in the coat5 Jul 2024 5:38 UTC

36 points

55 comments7 min readLW link

Finding the Wisdom to Build Safe AI

Gordon Seidoh Worley4 Jul 2024 19:04 UTC

36 points

10 comments9 min readLW link

Libs vs Frameworks, Middle-Level Regularities vs Theories

adamShimi4 Jul 2024 19:01 UTC

23 points

0 comments2 min readLW link

(epistemologicalfascinations.substack.com)

The Potential Impossibility of Subjective Death

VictorLJZ4 Jul 2024 18:17 UTC

3 points

34 comments1 min readLW link

Consider the humble rock (or: why the dumb thing kills you)

pleiotroth4 Jul 2024 13:54 UTC

60 points

11 comments4 min readLW link

AI #71: Farewell to Chevron

Zvi4 Jul 2024 13:40 UTC

53 points

9 comments36 min readLW link

(thezvi.wordpress.com)