All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 161718 19 20 21 22 23 24 25 26 27 28 29 30 31

Book Launch: “The Carving of Reality,” Best of LessWrong vol. III

Raemon16 Aug 2023 23:52 UTC

131 points

22 comments5 min readLW link

One example of how LLM propaganda attacks can hack the brain

trevor16 Aug 2023 21:41 UTC

24 points

8 comments4 min readLW link

If we had known the atmosphere would ignite

Jeffs16 Aug 2023 20:28 UTC

56 points

63 comments2 min readLW link

Stampy’s AI Safety Info—New Distillations #4 [July 2023]

markov16 Aug 2023 19:03 UTC

22 points

10 comments1 min readLW link

(aisafety.info)

A Proof of Löb’s Theorem using Computability Theory

jessicata16 Aug 2023 18:57 UTC

71 points

0 comments17 min readLW link

(unstableontology.com)

Summary of and Thoughts on the Hotz/Yudkowsky Debate

Zvi16 Aug 2023 16:50 UTC

105 points

47 comments9 min readLW link

(thezvi.wordpress.com)

Red Pill vs Blue Pill, Bayes style

ErickBall16 Aug 2023 15:23 UTC

28 points

33 comments1 min readLW link

What does it mean to “trust science”?

jasoncrawford16 Aug 2023 14:56 UTC

34 points

9 comments1 min readLW link

(rootsofprogress.org)

Jason Crawford / The Roots of Progress in Bangalore, August 21 to September 8

jasoncrawford16 Aug 2023 13:36 UTC

13 points

1 comment1 min readLW link

(rootsofprogress.org)

Gaining knowledge at a price

DavidMadsen16 Aug 2023 10:21 UTC

−4 points

5 comments1 min readLW link

Understanding and visualizing sycophancy datasets

Nina Panickssery16 Aug 2023 5:34 UTC

45 points

0 comments6 min readLW link

George Hotz vs Eliezer Yudkowsky AI Safety Debate—link and brief discussion

Gerald Monroe16 Aug 2023 4:31 UTC

11 points

26 comments2 min readLW link

(www.youtube.com)

[Question] How to take advanage of the market’s irrationality regarding AGI?

GeneSmith16 Aug 2023 3:30 UTC

23 points

6 comments2 min readLW link

Infinite Ethics: Infinite Problems

omnizoid16 Aug 2023 2:44 UTC

−2 points

25 comments23 min readLW link

Private Biostasis & Cryonics Social

Mati_Roy16 Aug 2023 2:34 UTC

11 points

0 comments1 min readLW link

Some thoughts on George Hotz vs Eliezer Yudkowsky

TristanTrim15 Aug 2023 23:33 UTC

10 points

3 comments2 min readLW link

Understanding the Information Flow inside Large Language Models

Felix Hofstätter and cozyfractal

15 Aug 2023 21:13 UTC

19 points

0 comments17 min readLW link

[Question] Any research in “probe-tuning” of LLMs?

Roman Leventov15 Aug 2023 21:01 UTC

20 points

3 comments1 min readLW link

Can AI Transform the Electorate into a Citizen’s Assembly

RoscoHunter15 Aug 2023 17:52 UTC

−3 points

5 comments3 min readLW link

Ten Thousand Years of Solitude

agp15 Aug 2023 17:45 UTC

136 points

19 comments4 min readLW link

(www.discovermagazine.com)

AISN #19: US-China Competition on AI Chips, Measuring Language Agent Developments, Economic Analysis of Language Model Propaganda, and White House AI Cyber Challenge

aogara and Dan H

15 Aug 2023 16:10 UTC

21 points

0 comments5 min readLW link

(newsletter.safe.ai)

[Question] What is the most effective anti-tyranny charity?

lc15 Aug 2023 15:26 UTC

20 points

10 comments1 min readLW link

My checklist for publishing a blog post

Steven Byrnes15 Aug 2023 15:04 UTC

84 points

6 comments3 min readLW link

The Dunbar Playbook: A CRM system for your friends

Severin T. Seehrich15 Aug 2023 8:44 UTC

33 points

16 comments5 min readLW link

(amoretlicentia.substack.com)

Optical Illusions are Out of Distribution Errors

vitaliya15 Aug 2023 2:23 UTC

30 points

8 comments2 min readLW link

A short calculation about a Twitter poll

Ege Erdil14 Aug 2023 19:48 UTC

64 points

64 comments11 min readLW link

Decomposing independent generalizations in neural networks via Hessian analysis

Dmitry Vaintrob and Nina Panickssery

14 Aug 2023 17:04 UTC

83 points

4 comments1 min readLW link

Memetic Judo #2: Incorporal Switches and Levers Compendium

Max TK14 Aug 2023 16:53 UTC

19 points

6 comments17 min readLW link

Existentially relevant thought experiment: To kill or not to kill, a sniper, a man and a button.

AlexFromSafeTransition14 Aug 2023 10:53 UTC

−18 points

6 comments4 min readLW link

Stepping down as moderator on LW

Kaj_Sotala14 Aug 2023 10:46 UTC

82 points

1 comment1 min readLW link

Announcing Manifest 2023 (Sep 22-24 in Berkeley)

Saul Munn and Austin Chen

14 Aug 2023 5:13 UTC

31 points

0 comments2 min readLW link

Coherence Therapy with LLMs—quick demo

Chipmonk14 Aug 2023 3:34 UTC

19 points

11 comments1 min readLW link

Listen For What You Don’t Hear: The Case for Contrarianism

Yashvardhan Sharma14 Aug 2023 2:53 UTC

1 point

1 comment5 min readLW link

Recipe: Hessian eigenvector computation for PyTorch models

Nina Panickssery14 Aug 2023 2:48 UTC

32 points

5 comments5 min readLW link

[Question] Assuming LK99 or similar: how to accelerate commercialization?

ryan_b13 Aug 2023 21:34 UTC

7 points

5 comments1 min readLW link

Twin Cities ACX Meetup September 2023

Timothy M.13 Aug 2023 20:10 UTC

1 point

4 comments1 min readLW link

Fundamental Uncertainty: Chapter 1 - How can we know what’s true?

Gordon Seidoh Worley13 Aug 2023 18:55 UTC

17 points

4 comments12 min readLW link

We Should Prepare for a Larger Representation of Academia in AI Safety

Leon Lang13 Aug 2023 18:03 UTC

90 points

13 comments5 min readLW link

AGI is easier than robotaxis

Daniel Kokotajlo13 Aug 2023 17:00 UTC

41 points

30 comments4 min readLW link

[Question] If we’re alive in 5 years, do you think the funding situation will be much better by then? (With large amounts of government funding, for example)

kuira13 Aug 2023 16:32 UTC

−2 points

6 comments1 min readLW link

Abstract Theories of Everything

Philosophistry13 Aug 2023 6:06 UTC

−17 points

0 comments1 min readLW link

[Linkpost] Personal and Psychological Dimensions of AI Researchers Confronting AI Catastrophic Risks

Bogdan Ionut Cirstea12 Aug 2023 22:02 UTC

42 points

0 comments1 min readLW link

The Empathy Engine: A Deconstruction of the Societal Metamorphosis through Technological Empathy Augmentation

bigdickproblems12 Aug 2023 18:23 UTC

−30 points

3 comments2 min readLW link

The Benevolent Ruler’s Handbook (Part 2): Morality Rules

FCCC12 Aug 2023 14:25 UTC

5 points

0 comments4 min readLW link

Learning as you play: anthropic shadow in deadly games

dr_s12 Aug 2023 7:34 UTC

37 points

28 comments35 min readLW link

Biological Anchors: The Trick that Might or Might Not Work

Scott Alexander12 Aug 2023 0:53 UTC

91 points

3 comments33 min readLW link

(astralcodexten.substack.com)

Simulate the CEO

robotelvis12 Aug 2023 0:09 UTC

23 points

5 comments5 min readLW link

(messyprogress.substack.com)

How to decide under low-stakes uncertainty

dkl911 Aug 2023 18:07 UTC

11 points

4 comments1 min readLW link

(dkl9.net)

The Pandemic is Only Beginning: The Long COVID Disaster

salvatore mattera11 Aug 2023 17:36 UTC

−6 points

15 comments8 min readLW link

When discussing AI risks, talk about capabilities, not intelligence

Vika11 Aug 2023 13:38 UTC

116 points

7 comments3 min readLW link

(vkrakovna.wordpress.com)