All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 171819 20 21 22 23 24 25 26 27 28 29 30

AI Impacts Quarterly Newsletter, Jan-Mar 2023

Harlan17 Apr 2023 22:10 UTC

5 points

0 comments3 min readLW link

(blog.aiimpacts.org)

[Question] How do you align your emotions through updates and existential uncertainty?

VojtaKovarik17 Apr 2023 20:46 UTC

4 points

10 comments1 min readLW link

AI Alignment Research Engineer Accelerator (ARENA): call for applicants

CallumMcDougall17 Apr 2023 20:30 UTC

100 points

9 comments7 min readLW link

AI policy ideas: Reading list

Zach Stein-Perlman17 Apr 2023 19:00 UTC

23 points

7 comments4 min readLW link

NYT: The Surprising Thing A.I. Engineers Will Tell You if You Let Them

Sodium17 Apr 2023 18:59 UTC

11 points

2 comments1 min readLW link

(www.nytimes.com)

But why would the AI kill us?

So8res17 Apr 2023 18:42 UTC

131 points

95 comments2 min readLW link

Sama Says the Age of Giant AI Models is Already Over

Algon17 Apr 2023 18:36 UTC

49 points

12 comments1 min readLW link

(www.wired.com)

Meetup Tip: Conversation Starters

Screwtape17 Apr 2023 18:25 UTC

20 points

1 comment2 min readLW link

Critiques of prominent AI safety labs: Redwood Research

Omega.17 Apr 2023 18:20 UTC

2 points

0 comments22 min readLW link

(forum.effectivealtruism.org)

How Large Language Models Nuke our Naive Notions of Truth and Reality

Sean Lee17 Apr 2023 18:08 UTC

0 points

23 comments11 min readLW link

An alternative of PPO towards alignment

ml hkust17 Apr 2023 17:58 UTC

2 points

2 comments4 min readLW link

What I learned at the AI Safety Europe Retreat

skaisg17 Apr 2023 17:40 UTC

28 points

0 comments10 min readLW link

(skaisg.eu)

What is your timelines for ADI (artificial disempowering intelligence)?

Christopher King17 Apr 2023 17:01 UTC

3 points

3 comments2 min readLW link

[Question] Can we get around Godel’s Incompleteness theorems and Turing undecidable problems via infinite computers?

Noosphere8917 Apr 2023 15:14 UTC

−11 points

12 comments1 min readLW link

La Crosse, WI Rationality Meetup

Daniel Uebele17 Apr 2023 15:13 UTC

1 point

0 comments1 min readLW link

Slowing AI: Foundations

Zach Stein-Perlman17 Apr 2023 14:30 UTC

45 points

11 comments17 min readLW link

Slowing AI: Reading list

Zach Stein-Perlman17 Apr 2023 14:30 UTC

45 points

3 comments4 min readLW link

Goodhart’s Law inside the human mind

Kaj_Sotala17 Apr 2023 13:48 UTC

117 points

13 comments16 min readLW link

Prediction: any uncontrollable AI will turn earth into a giant computer

Karl von Wendt17 Apr 2023 12:30 UTC

11 points

8 comments3 min readLW link

AutoBound on neural network can achieve OOMs lower training loss

Maybe_a17 Apr 2023 5:20 UTC

10 points

9 comments1 min readLW link

(ai.googleblog.com)

Making Booking.Com less out to get you

Elizabeth17 Apr 2023 4:04 UTC

21 points

0 comments1 min readLW link

(www.alexcharlton.co)

grey goo is unlikely

bhauth17 Apr 2023 1:59 UTC

157 points

120 comments9 min readLW link 1 review

(bhauth.com)

AGI Clinics: A Safe Haven for Humanity’s First Encounters with Superintelligence

portr.17 Apr 2023 1:52 UTC

−5 points

1 comment1 min readLW link

Summaries of top forum posts (27th March to 16th April)

Zoe Williams17 Apr 2023 0:28 UTC

14 points

1 comment1 min readLW link

AI Takeover Scenario with Scaled LLMs

simeon_c16 Apr 2023 23:28 UTC

42 points

15 comments8 min readLW link

My experience getting funding for my biological research

Metacelsus16 Apr 2023 22:53 UTC

75 points

10 comments5 min readLW link

(denovo.substack.com)

Top lesson from GPT: we will probably destroy humanity “for the lulz” as soon as we are able.

Shmi16 Apr 2023 20:27 UTC

63 points

28 comments1 min readLW link

On urgency, priority and collective reaction to AI-Risks: Part I

Denreik16 Apr 2023 19:14 UTC

−10 points

15 comments5 min readLW link

Efficient Learning: Memorization

Alvin Ånestrand16 Apr 2023 17:58 UTC

4 points

2 comments5 min readLW link

(forum.effectivealtruism.org)

Mechanistically interpreting time in GPT-2 small

rgould, Elizabeth Ho and Arthur Conmy

16 Apr 2023 17:57 UTC

68 points

6 comments21 min readLW link

La Crosse, WI Rationality Meetup

Daniel Uebele16 Apr 2023 17:33 UTC

1 point

0 comments1 min readLW link

The Soul of the Writer (on LLMs, the psychology of writers, and the nature of intelligence)

rogersbacon16 Apr 2023 16:02 UTC

11 points

1 comment3 min readLW link

(www.secretorum.life)

Possibilizing vs. actualizing

TsviBT16 Apr 2023 15:55 UTC

31 points

2 comments5 min readLW link

Human Extinction by AI through economic power

ChristianKl16 Apr 2023 12:15 UTC

8 points

1 comment8 min readLW link

Bit Flip

Charlie Sanders16 Apr 2023 7:30 UTC

−2 points

11 comments11 min readLW link

Double-negation as framing

Stuart Johnson16 Apr 2023 6:59 UTC

25 points

9 comments6 min readLW link

[Link/crosspost] [US] NTIA: AI Accountability Policy Request for Comment

Kyle J. Lucchese16 Apr 2023 6:57 UTC

8 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

[Question] Who is testing AI Safety public outreach messaging?

yanni kyriacos16 Apr 2023 6:57 UTC

13 points

2 comments1 min readLW link

Features of Emacs that I only recently discovered

EmacsScrub16 Apr 2023 6:57 UTC

12 points

5 comments3 min readLW link

ACX meetup in Prague (16th of May)

Jiří Nádvorník16 Apr 2023 6:25 UTC

4 points

0 comments1 min readLW link

SmartyHeaderCode: anomalous tokens for GPT3.5 and GPT-4

AdamYedidia15 Apr 2023 22:35 UTC

71 points

18 comments6 min readLW link

Open-source LLMs may prove Bostrom’s vulnerable world hypothesis

Roope Ahvenharju15 Apr 2023 19:16 UTC

1 point

1 comment1 min readLW link

[linkpost] Elon Musk plans AI start-up to rival OpenAI

Hatfield15 Apr 2023 19:06 UTC

11 points

11 comments1 min readLW link

(www.ft.com)

FLI report: Policymaking in the Pause

Zach Stein-Perlman15 Apr 2023 17:01 UTC

9 points

3 comments1 min readLW link

(futureoflife.org)

Reflective journal entries using GPT-4 and Obsidian that demand less willpower.

Solenoid_Entity15 Apr 2023 12:45 UTC

56 points

24 comments7 min readLW link

An example elevator pitch for AI doom

laserfiche15 Apr 2023 12:29 UTC

2 points

5 comments1 min readLW link

AI as Contact with our Collective Unconscious

Scott Broock15 Apr 2023 2:11 UTC

−4 points

6 comments4 min readLW link

The Truth About False

Thoth Hermes15 Apr 2023 1:01 UTC

−21 points

4 comments17 min readLW link

(thothhermes.substack.com)

The ‘ petertodd’ phenomenon

mwatkins15 Apr 2023 0:59 UTC

192 points

49 comments38 min readLW link

[Question] Concave Utility Question

Scott Garrabrant15 Apr 2023 0:14 UTC

55 points

36 comments2 min readLW link