All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 121314 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

AI x-risk, approximately ordered by embarrassment

Alex Lawsen 12 Apr 2023 23:01 UTC

151 points

7 comments19 min readLW link

AXRP Episode 20 - ‘Reform’ AI Alignment with Scott Aaronson

DanielFilan12 Apr 2023 21:30 UTC

22 points

2 comments68 min readLW link

Apply to >30 AI safety funders in one application with the Nonlinear Network

KatWoods, Emerson Spartz and Drew Spartz

12 Apr 2023 21:23 UTC

65 points

12 comments2 min readLW link

AGI goal space is big, but narrowing might not be as hard as it seems.

Jacy Reese Anthis12 Apr 2023 19:03 UTC

15 points

0 comments3 min readLW link

Natural language alignment

Jacy Reese Anthis12 Apr 2023 19:02 UTC

31 points

2 comments2 min readLW link

Repugnant levels of violins

Solenoid_Entity12 Apr 2023 17:11 UTC

72 points

10 comments12 min readLW link

Progress links and tweets, 2023-04-12

jasoncrawford12 Apr 2023 16:52 UTC

8 points

2 comments1 min readLW link

(rootsofprogress.org)

A basic mathematical structure of intelligence

Golol12 Apr 2023 16:49 UTC

4 points

6 comments4 min readLW link

[Question] Should AutoGPT update us towards researching IDA?

Michaël Trazzi12 Apr 2023 16:41 UTC

15 points

5 comments1 min readLW link

Boxing lessons

yakimoff12 Apr 2023 16:19 UTC

1 point

0 comments1 min readLW link

Dazed and confused: Good olde’ walk around the Marin Headlands

yakimoff12 Apr 2023 16:09 UTC

1 point

0 comments1 min readLW link

Towards a solution to the alignment problem via objective detection and evaluation

Paul Colognese12 Apr 2023 15:39 UTC

9 points

7 comments12 min readLW link

Artificial Intelligence as exit strategy from the age of acute existential risk

Arturo Macias12 Apr 2023 14:48 UTC

−7 points

15 comments7 min readLW link

The UBI dystopia: a glimpse into the future via present-day abuses

Solenoid_Entity12 Apr 2023 14:44 UTC

50 points

73 comments4 min readLW link

[Question] Goals of model vs. goals of simulacra?

dr_s12 Apr 2023 13:02 UTC

5 points

7 comments1 min readLW link

Alignment of AutoGPT agents

Ozyrus12 Apr 2023 12:54 UTC

14 points

1 comment4 min readLW link

Boundaries-based security and AI safety approaches

Allison Duettmann12 Apr 2023 12:36 UTC

43 points

2 comments6 min readLW link

Scaffolded LLMs as natural language computers

beren12 Apr 2023 10:47 UTC

94 points

10 comments11 min readLW link

LW is probably not the place for “I asked this LLM (x) and here’s what it said!”, but where is?

lillybaeum12 Apr 2023 10:12 UTC

21 points

3 comments1 min readLW link

No convincing evidence for gradient descent in activation space

Blaine12 Apr 2023 4:48 UTC

82 points

9 comments20 min readLW link

A Brief Introduction to ACI, 2: An Event-Centric View

Akira Pyinya12 Apr 2023 3:23 UTC

3 points

0 comments2 min readLW link

Boston Social Dance Covid Requirements

jefftk12 Apr 2023 2:30 UTC

7 points

2 comments1 min readLW link

(www.jefftk.com)

[Link] Sarah Constantin: “Why I am Not An AI Doomer”

lbThingrb12 Apr 2023 1:52 UTC

61 points

13 comments1 min readLW link

(sarahconstantin.substack.com)

[Question] Rationalist position towards lying?

WilliamTerry12 Apr 2023 1:21 UTC

−2 points

4 comments1 min readLW link

National Telecommunications and Information Administration: AI Accountability Policy Request for Comment

sanxiyn11 Apr 2023 22:59 UTC

9 points

0 comments1 min readLW link

(ntia.gov)

Binaristic Bifurcation: How Reality Splits Into Two Separate Binaries

Thoth Hermes11 Apr 2023 21:19 UTC

−25 points

0 comments3 min readLW link

(thothhermes.substack.com)

Bryan Bishop AMA on the Progress Forum

jasoncrawford11 Apr 2023 21:05 UTC

8 points

0 comments1 min readLW link

(rootsofprogress.org)

AI Risk US Presidential Candidate

Simon Berens11 Apr 2023 19:31 UTC

5 points

3 comments1 min readLW link

Evolution provides no evidence for the sharp left turn

Quintin Pope11 Apr 2023 18:43 UTC

206 points

62 comments15 min readLW link

On “aiming for convergence on truth”

gjm11 Apr 2023 18:19 UTC

67 points

55 comments13 min readLW link

In favor of accelerating problems you’re trying to solve

Christopher King11 Apr 2023 18:15 UTC

2 points

2 comments4 min readLW link

[Interview w/ Jeffrey Ladish] Applying the ‘security mindset’ to AI and x-risk

fowlertm11 Apr 2023 18:14 UTC

12 points

0 comments1 min readLW link

Request to AGI organizations: Share your views on pausing AI progress

Akash and simeon_c

11 Apr 2023 17:30 UTC

141 points

11 comments1 min readLW link

Core of AI projections from first principles: Attempt 1

tailcalled11 Apr 2023 17:24 UTC

21 points

3 comments3 min readLW link

What Jason has been reading, April 2023

jasoncrawford11 Apr 2023 16:29 UTC

18 points

0 comments5 min readLW link

(rootsofprogress.org)

What about an AI that’s SUPPOSED to kill us (not ChaosGPT; only on paper)?

False Name11 Apr 2023 16:09 UTC

−13 points

1 comment3 min readLW link

Contra-Berkeley

False Name11 Apr 2023 16:06 UTC

0 points

0 comments4 min readLW link

Contra-Wittgenstein; no postmodernism

False Name11 Apr 2023 16:05 UTC

−17 points

1 comment5 min readLW link

[MLSN #9] Verifying large training runs, security risks from LLM access to APIs, why natural selection may favor AIs over humans

Dan H and TW123

11 Apr 2023 16:03 UTC

11 points

0 comments6 min readLW link

(newsletter.mlsafety.org)

Where’s the foom?

Fergus Fettes11 Apr 2023 15:50 UTC

34 points

27 comments2 min readLW link

“The Need for Long-term Research”—Seeds of Science call for reviewers

rogersbacon11 Apr 2023 15:37 UTC

15 points

0 comments1 min readLW link

NTIA—AI Accountability Announcement

samshap11 Apr 2023 15:03 UTC

7 points

0 comments1 min readLW link

(www.ntia.doc.gov)

A couple of questions about Conjecture’s Cognitive Emulation proposal

Igor Ivanov11 Apr 2023 14:05 UTC

30 points

1 comment3 min readLW link

Childhood Roundup #2

Zvi11 Apr 2023 13:50 UTC

31 points

4 comments19 min readLW link

(thezvi.wordpress.com)

Measuring artificial intelligence on human benchmarks is naive

Anomalous11 Apr 2023 11:34 UTC

11 points

4 comments1 min readLW link

(forum.effectivealtruism.org)

Killing Socrates

Duncan Sabien (Deactivated)11 Apr 2023 10:28 UTC

186 points

144 comments8 min readLW link

Cyberspace Administration of China: Draft of “Regulation for Generative Artificial Intelligence Services” is open for comments

sanxiyn11 Apr 2023 9:32 UTC

7 points

2 comments1 min readLW link

(archive.is)

[Question] Is “Strong Coherence” Anti-Natural?

DragonGod11 Apr 2023 6:22 UTC

23 points

25 comments2 min readLW link

Four mindset disagreements behind existential risk disagreements in ML

Rob Bensinger11 Apr 2023 4:53 UTC

136 points

12 comments1 min readLW link

Alignment vs capabilities

Adam Zerner11 Apr 2023 4:35 UTC

13 points

2 comments4 min readLW link