All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Anthropic is further accelerating the Arms Race?

sapphire6 Apr 2023 23:29 UTC

82 points

22 comments1 min readLW link

(techcrunch.com)

Suggestion for safe AI structure (Curated Transparent Decisions)

Kane Gregory6 Apr 2023 22:00 UTC

5 points

6 comments3 min readLW link

10 reasons why lists of 10 reasons might be a winning strategy

trevor6 Apr 2023 21:24 UTC

110 points

7 comments1 min readLW link

A Defense of Utilitarianism

Pareto Optimal6 Apr 2023 21:09 UTC

−3 points

2 comments5 min readLW link

(paretooptimal.substack.com)

One Does Not Simply Replace the Humans

JerkyTreats6 Apr 2023 20:56 UTC

9 points

3 comments4 min readLW link

(www.lesswrong.com)

[Question] Where to begin in ML/AI?

Jake the Student6 Apr 2023 20:45 UTC

9 points

4 comments1 min readLW link

Misgeneralization as a misnomer

So8res6 Apr 2023 20:43 UTC

129 points

22 comments4 min readLW link

You can use GPT-4 to create prompt injections against GPT-4

WitchBOT6 Apr 2023 20:39 UTC

87 points

8 comments2 min readLW link

AI scares and changing public beliefs

Seth Herd6 Apr 2023 18:51 UTC

45 points

21 comments6 min readLW link

AISafety.world is a map of the AIS ecosystem

Hamish Doodles6 Apr 2023 18:37 UTC

80 points

0 comments1 min readLW link

I asked my senator to slow AI

Omid6 Apr 2023 18:18 UTC

21 points

5 comments2 min readLW link

Pause AI Development?

PeterMcCluskey6 Apr 2023 17:23 UTC

11 points

0 comments2 min readLW link

(bayesianinvestor.com)

Use these three heuristic imperatives to solve alignment

G6 Apr 2023 16:20 UTC

−17 points

4 comments1 min readLW link

Eliezer on The Lunar Society podcast

Max H6 Apr 2023 16:18 UTC

40 points

5 comments1 min readLW link

(www.dwarkeshpatel.com)

Do we get better or worse at adapting to change?

jasoncrawford6 Apr 2023 14:42 UTC

12 points

2 comments3 min readLW link

(rootsofprogress.org)

Is it true that only a chatbot encouraged a man to commit suicide?

Jeroen De Ryck6 Apr 2023 14:10 UTC

6 points

0 comments4 min readLW link

(www.vrt.be)

A Fresh FAQ on GiveWiki and Impact Markets Generally

Dawn Drescher6 Apr 2023 14:02 UTC

−1 points

0 comments1 min readLW link

(impactmarkets.substack.com)

AI #6: Agents of Change

Zvi6 Apr 2023 14:00 UTC

79 points

13 comments47 min readLW link

(thezvi.wordpress.com)

Stupid Questions—April 2023

ChristianKl6 Apr 2023 13:07 UTC

17 points

46 comments1 min readLW link

(Yet Another) Map for AI Risk Discussion

chronolitus6 Apr 2023 11:55 UTC

1 point

0 comments2 min readLW link

The Computational Anatomy of Human Values

beren6 Apr 2023 10:33 UTC

72 points

30 comments30 min readLW link

[Question] Is “Recursive Self-Improvement” Relevant in the Deep Learning Paradigm?

DragonGod6 Apr 2023 7:13 UTC

32 points

36 comments7 min readLW link

Revisiting the Horizon Length Hypothesis

Pablo Villalobos6 Apr 2023 6:39 UTC

23 points

4 comments3 min readLW link

Monthly Shorts 3/23

Celer6 Apr 2023 6:20 UTC

7 points

1 comment4 min readLW link

(keller.substack.com)

Dual-Useness is a Ratio

jimrandomh6 Apr 2023 5:46 UTC

35 points

2 comments1 min readLW link

[Question] What’s the deal with Effective Accelerationism (e/acc)?

RomanHauksson6 Apr 2023 4:03 UTC

23 points

9 comments2 min readLW link

No Summer Harvest: Why AI Development Won’t Pause

Stephen Fowler6 Apr 2023 3:53 UTC

14 points

17 comments12 min readLW link

Yoshua Bengio: “Slowing down development of AI systems passing the Turing test”

Roman Leventov6 Apr 2023 3:31 UTC

49 points

2 comments5 min readLW link

(yoshuabengio.org)

Unaligned stable loops emerge at scale

Michael Tontchev6 Apr 2023 2:15 UTC

9 points

8 comments4 min readLW link

Someone already tried “Chaos-GPT”

robert-cronin6 Apr 2023 2:15 UTC

17 points

4 comments1 min readLW link

[Question] Daisy-chaining epsilon-step verifiers

Decaeneus6 Apr 2023 2:07 UTC

2 points

1 comment1 min readLW link

Auto-GPT: Open-sourced disaster?

awg5 Apr 2023 22:46 UTC

23 points

18 comments1 min readLW link

(github.com)

The Orthogonality Thesis is Not Obviously True

omnizoid5 Apr 2023 21:06 UTC

3 points

79 comments9 min readLW link

Williams-Beuren Syndrome: Frendly Mutations

Takk5 Apr 2023 20:59 UTC

−1 points

1 comment1 min readLW link

OpenAI: Our approach to AI safety

Jacob G-W5 Apr 2023 20:26 UTC

1 point

1 comment1 min readLW link

(openai.com)

Why Are Maximum Entropy Distributions So Ubiquitous?

johnswentworth5 Apr 2023 20:12 UTC

68 points

6 comments9 min readLW link

“On Living in an Atomic Age”, by C.S. Lewis (1948)

tjaffee5 Apr 2023 18:34 UTC

17 points

3 comments8 min readLW link

(hebrew-streams.org)

Eliezer Yudkowsky’s Letter in Time Magazine

Zvi5 Apr 2023 18:00 UTC

212 points

86 comments14 min readLW link

(thezvi.wordpress.com)

Dark Artificial Intelligence

FrankAI5 Apr 2023 17:37 UTC

0 points

0 comments4 min readLW link

[Question] Best arguments against instrumental convergence?

lfrymire5 Apr 2023 17:06 UTC

5 points

7 comments1 min readLW link

Progress links and tweets, 2023-04-05

jasoncrawford5 Apr 2023 16:18 UTC

20 points

0 comments2 min readLW link

(rootsofprogress.org)

Universality and Hidden Information in Concept Bottleneck Models

Hoagy5 Apr 2023 14:00 UTC

23 points

0 comments11 min readLW link

AI safety and the security mindset: user interface design, red-teams, formal verification

Allison Duettmann5 Apr 2023 11:33 UTC

34 points

0 comments8 min readLW link

ICA Simulacra

Ozyrus5 Apr 2023 6:41 UTC

26 points

2 comments7 min readLW link

AGI deployment as an act of aggression

dr_s5 Apr 2023 6:39 UTC

28 points

30 comments13 min readLW link

A Brief Introduction to Algorithmic Common Intelligence, ACI . 1

Akira Pyinya5 Apr 2023 5:43 UTC

−2 points

1 comment2 min readLW link

46% of US adults at least “somewhat concerned” about AI extinction risk.

Foyle5 Apr 2023 5:25 UTC

1 point

0 comments1 min readLW link

[Question] Has anyone thought about how to proceed now that AI notkilleveryoneism is becoming more relevant/is approaching the Overton window?

metachirality5 Apr 2023 3:06 UTC

11 points

8 comments1 min readLW link

Empathy bandaid for immediate AI catastrophe

installgentoo5 Apr 2023 2:12 UTC

1 point

2 comments1 min readLW link

“Corrigibility at some small length” by dath ilan

Christopher King5 Apr 2023 1:47 UTC

32 points

3 comments9 min readLW link

(www.glowfic.com)