An­thropic is fur­ther ac­cel­er­at­ing the Arms Race?

sapphire6 Apr 2023 23:29 UTC
82 points
22 comments1 min readLW link
(techcrunch.com)

Sugges­tion for safe AI struc­ture (Cu­rated Trans­par­ent De­ci­sions)

Kane Gregory6 Apr 2023 22:00 UTC
5 points
6 comments3 min readLW link

10 rea­sons why lists of 10 rea­sons might be a win­ning strategy

trevor6 Apr 2023 21:24 UTC
101 points
7 comments1 min readLW link

A Defense of Utilitarianism

Pareto Optimal6 Apr 2023 21:09 UTC
−3 points
2 comments5 min readLW link
(paretooptimal.substack.com)

One Does Not Sim­ply Re­place the Hu­mans

JerkyTreats6 Apr 2023 20:56 UTC
9 points
3 comments4 min readLW link
(www.lesswrong.com)

[Question] Where to be­gin in ML/​AI?

Jake the Student6 Apr 2023 20:45 UTC
9 points
4 comments1 min readLW link

Mis­gen­er­al­iza­tion as a misnomer

So8res6 Apr 2023 20:43 UTC
129 points
22 comments4 min readLW link

You can use GPT-4 to cre­ate prompt in­jec­tions against GPT-4

WitchBOT6 Apr 2023 20:39 UTC
87 points
7 comments2 min readLW link

AI scares and chang­ing pub­lic beliefs

Seth Herd6 Apr 2023 18:51 UTC
45 points
21 comments6 min readLW link

AISafety.world is a map of the AIS ecosystem

Hamish Doodles6 Apr 2023 18:37 UTC
79 points
0 comments1 min readLW link

I asked my sen­a­tor to slow AI

Omid6 Apr 2023 18:18 UTC
21 points
5 comments2 min readLW link

Pause AI Devel­op­ment?

PeterMcCluskey6 Apr 2023 17:23 UTC
11 points
0 comments2 min readLW link
(bayesianinvestor.com)

Use these three heuris­tic im­per­a­tives to solve alignment

G6 Apr 2023 16:20 UTC
−17 points
4 comments1 min readLW link

Eliezer on The Lu­nar So­ciety podcast

Max H6 Apr 2023 16:18 UTC
40 points
5 comments1 min readLW link
(www.dwarkeshpatel.com)

Do we get bet­ter or worse at adapt­ing to change?

jasoncrawford6 Apr 2023 14:42 UTC
12 points
2 comments3 min readLW link
(rootsofprogress.org)

Is it true that only a chat­bot en­couraged a man to com­mit suicide?

Jeroen De Ryck6 Apr 2023 14:10 UTC
6 points
0 comments4 min readLW link
(www.vrt.be)

A Fresh FAQ on GiveWiki and Im­pact Mar­kets Generally

Dawn Drescher6 Apr 2023 14:02 UTC
−1 points
0 comments1 min readLW link
(impactmarkets.substack.com)

AI #6: Agents of Change

Zvi6 Apr 2023 14:00 UTC
79 points
13 comments47 min readLW link
(thezvi.wordpress.com)

Stupid Ques­tions—April 2023

ChristianKl6 Apr 2023 13:07 UTC
17 points
44 comments1 min readLW link

(Yet Another) Map for AI Risk Discussion

chronolitus6 Apr 2023 11:55 UTC
1 point
0 comments2 min readLW link

The Com­pu­ta­tional Anatomy of Hu­man Values

beren6 Apr 2023 10:33 UTC
70 points
30 comments30 min readLW link

[Question] Is “Re­cur­sive Self-Im­prove­ment” Rele­vant in the Deep Learn­ing Paradigm?

DragonGod6 Apr 2023 7:13 UTC
32 points
36 comments7 min readLW link

Re­vis­it­ing the Hori­zon Length Hypothesis

Pablo Villalobos6 Apr 2023 6:39 UTC
23 points
4 comments3 min readLW link

Monthly Shorts 3/​23

Celer6 Apr 2023 6:20 UTC
7 points
1 comment4 min readLW link
(keller.substack.com)

Dual-Use­ness is a Ratio

jimrandomh6 Apr 2023 5:46 UTC
35 points
2 comments1 min readLW link

[Question] What’s the deal with Effec­tive Ac­cel­er­a­tionism (e/​acc)?

RomanHauksson6 Apr 2023 4:03 UTC
23 points
9 comments2 min readLW link

No Sum­mer Har­vest: Why AI Devel­op­ment Won’t Pause

Stephen Fowler6 Apr 2023 3:53 UTC
14 points
17 comments12 min readLW link

Yoshua Ben­gio: “Slow­ing down de­vel­op­ment of AI sys­tems pass­ing the Tur­ing test”

Roman Leventov6 Apr 2023 3:31 UTC
49 points
2 comments5 min readLW link
(yoshuabengio.org)

Unal­igned sta­ble loops emerge at scale

Michael Tontchev6 Apr 2023 2:15 UTC
9 points
8 comments4 min readLW link

Some­one already tried “Chaos-GPT”

robert-cronin6 Apr 2023 2:15 UTC
17 points
4 comments1 min readLW link

[Question] Daisy-chain­ing ep­silon-step verifiers

Decaeneus6 Apr 2023 2:07 UTC
2 points
1 comment1 min readLW link

Auto-GPT: Open-sourced dis­aster?

awg5 Apr 2023 22:46 UTC
23 points
18 comments1 min readLW link
(github.com)

The Orthog­o­nal­ity Th­e­sis is Not Ob­vi­ously True

omnizoid5 Apr 2023 21:06 UTC
1 point
79 comments9 min readLW link

Willi­ams-Beuren Syn­drome: Frendly Mutations

Takk5 Apr 2023 20:59 UTC
−1 points
1 comment1 min readLW link

OpenAI: Our ap­proach to AI safety

Jacob G-W5 Apr 2023 20:26 UTC
1 point
1 comment1 min readLW link
(openai.com)

Why Are Max­i­mum En­tropy Distri­bu­tions So Ubiquitous?

johnswentworth5 Apr 2023 20:12 UTC
68 points
6 comments9 min readLW link

“On Liv­ing in an Atomic Age”, by C.S. Lewis (1948)

tjaffee5 Apr 2023 18:34 UTC
17 points
3 comments8 min readLW link
(hebrew-streams.org)

Eliezer Yud­kowsky’s Let­ter in Time Magazine

Zvi5 Apr 2023 18:00 UTC
212 points
86 comments14 min readLW link
(thezvi.wordpress.com)

Dark Ar­tifi­cial Intelligence

FrankAI5 Apr 2023 17:37 UTC
0 points
0 comments4 min readLW link

[Question] Best ar­gu­ments against in­stru­men­tal con­ver­gence?

lfrymire5 Apr 2023 17:06 UTC
5 points
7 comments1 min readLW link

Progress links and tweets, 2023-04-05

jasoncrawford5 Apr 2023 16:18 UTC
20 points
0 comments2 min readLW link
(rootsofprogress.org)

Univer­sal­ity and Hid­den In­for­ma­tion in Con­cept Bot­tle­neck Models

Hoagy5 Apr 2023 14:00 UTC
23 points
0 comments11 min readLW link

AI safety and the se­cu­rity mind­set: user in­ter­face de­sign, red-teams, for­mal verification

Allison Duettmann5 Apr 2023 11:33 UTC
34 points
0 comments8 min readLW link

ICA Simulacra

Ozyrus5 Apr 2023 6:41 UTC
26 points
2 comments7 min readLW link

AGI de­ploy­ment as an act of aggression

dr_s5 Apr 2023 6:39 UTC
27 points
29 comments13 min readLW link

A Brief In­tro­duc­tion to Al­gorith­mic Com­mon In­tel­li­gence, ACI . 1

Akira Pyinya5 Apr 2023 5:43 UTC
−2 points
1 comment2 min readLW link

46% of US adults at least “some­what con­cerned” about AI ex­tinc­tion risk.

Foyle5 Apr 2023 5:25 UTC
1 point
0 comments1 min readLW link

[Question] Has any­one thought about how to pro­ceed now that AI notkil­lev­ery­oneism is be­com­ing more rele­vant/​is ap­proach­ing the Over­ton win­dow?

metachirality5 Apr 2023 3:06 UTC
11 points
8 comments1 min readLW link

Em­pa­thy bandaid for im­me­di­ate AI catastrophe

installgentoo5 Apr 2023 2:12 UTC
1 point
2 comments1 min readLW link

“Cor­rigi­bil­ity at some small length” by dath ilan

Christopher King5 Apr 2023 1:47 UTC
32 points
3 comments9 min readLW link
(www.glowfic.com)