Auto-GPT: Open-sourced dis­aster?

awg5 Apr 2023 22:46 UTC
23 points
18 comments1 min readLW link
(github.com)

The Orthog­o­nal­ity Th­e­sis is Not Ob­vi­ously True

omnizoid5 Apr 2023 21:06 UTC
1 point
79 comments9 min readLW link

Willi­ams-Beuren Syn­drome: Frendly Mutations

Takk5 Apr 2023 20:59 UTC
−1 points
1 comment1 min readLW link

OpenAI: Our ap­proach to AI safety

Jacob G-W5 Apr 2023 20:26 UTC
1 point
1 comment1 min readLW link
(openai.com)

Why Are Max­i­mum En­tropy Distri­bu­tions So Ubiquitous?

johnswentworth5 Apr 2023 20:12 UTC
68 points
6 comments9 min readLW link

“On Liv­ing in an Atomic Age”, by C.S. Lewis (1948)

tjaffee5 Apr 2023 18:34 UTC
17 points
3 comments8 min readLW link
(hebrew-streams.org)

Eliezer Yud­kowsky’s Let­ter in Time Magazine

Zvi5 Apr 2023 18:00 UTC
212 points
86 comments14 min readLW link
(thezvi.wordpress.com)

Dark Ar­tifi­cial Intelligence

FrankAI5 Apr 2023 17:37 UTC
0 points
0 comments4 min readLW link

[Question] Best ar­gu­ments against in­stru­men­tal con­ver­gence?

lfrymire5 Apr 2023 17:06 UTC
5 points
7 comments1 min readLW link

Progress links and tweets, 2023-04-05

jasoncrawford5 Apr 2023 16:18 UTC
20 points
0 comments2 min readLW link
(rootsofprogress.org)

Univer­sal­ity and Hid­den In­for­ma­tion in Con­cept Bot­tle­neck Models

Hoagy5 Apr 2023 14:00 UTC
23 points
0 comments11 min readLW link

AI safety and the se­cu­rity mind­set: user in­ter­face de­sign, red-teams, for­mal verification

Allison Duettmann5 Apr 2023 11:33 UTC
34 points
0 comments8 min readLW link

ICA Simulacra

Ozyrus5 Apr 2023 6:41 UTC
26 points
2 comments7 min readLW link

AGI de­ploy­ment as an act of aggression

dr_s5 Apr 2023 6:39 UTC
27 points
29 comments13 min readLW link

A Brief In­tro­duc­tion to Al­gorith­mic Com­mon In­tel­li­gence, ACI . 1

Akira Pyinya5 Apr 2023 5:43 UTC
−2 points
1 comment2 min readLW link

46% of US adults at least “some­what con­cerned” about AI ex­tinc­tion risk.

Foyle5 Apr 2023 5:25 UTC
1 point
0 comments1 min readLW link

[Question] Has any­one thought about how to pro­ceed now that AI notkil­lev­ery­oneism is be­com­ing more rele­vant/​is ap­proach­ing the Over­ton win­dow?

metachirality5 Apr 2023 3:06 UTC
11 points
8 comments1 min readLW link

Em­pa­thy bandaid for im­me­di­ate AI catastrophe

installgentoo5 Apr 2023 2:12 UTC
1 point
2 comments1 min readLW link

“Cor­rigi­bil­ity at some small length” by dath ilan

Christopher King5 Apr 2023 1:47 UTC
32 points
3 comments9 min readLW link
(www.glowfic.com)

New sur­vey: 46% of Amer­i­cans are con­cerned about ex­tinc­tion from AI; 69% sup­port a six-month pause in AI development

Akash5 Apr 2023 1:26 UTC
46 points
9 comments1 min readLW link
(today.yougov.com)

Is AGI suici­dal­ity the golden ray of hope?

Alex Kirko4 Apr 2023 23:29 UTC
−18 points
4 comments1 min readLW link

Re­con­tex­tu­al­iz­ing the Risks of AI in More Pre­dictable Outcomes

ignorepeter4 Apr 2023 23:28 UTC
−19 points
2 comments5 min readLW link

LW Team is ad­just­ing mod­er­a­tion policy

Raemon4 Apr 2023 20:41 UTC
304 points
185 comments3 min readLW link

Ex­ces­sive AI growth-rate yields lit­tle so­cio-eco­nomic benefit.

Cleo Nardo4 Apr 2023 19:13 UTC
27 points
22 comments4 min readLW link

Pe­nal­ize Model Com­plex­ity Via Self-Distillation

research_prime_space4 Apr 2023 18:52 UTC
15 points
7 comments1 min readLW link

The One Heresy to Rule Them All

rogersbacon4 Apr 2023 18:23 UTC
−22 points
0 comments3 min readLW link
(www.secretorum.life)

Gi­ant (In)scrutable Ma­tri­ces: (Maybe) the Best of All Pos­si­ble Worlds

1a3orn4 Apr 2023 17:39 UTC
196 points
37 comments5 min readLW link

Play My Futarchy/​Pre­dic­tion Mar­ket Mafia Game

Arjun Panickssery4 Apr 2023 16:12 UTC
21 points
2 comments1 min readLW link
(arjunpanickssery.substack.com)

[Question] Steel­man /​ Ide­olog­i­cal Tur­ing Test of Yann LeCun’s AI X-Risk ar­gu­ment?

Aryeh Englander4 Apr 2023 15:53 UTC
26 points
14 comments1 min readLW link

Given the Restrict Act, Don’t Ban TikTok

Zvi4 Apr 2023 14:40 UTC
97 points
9 comments4 min readLW link
(thezvi.wordpress.com)

Run­ning many AI var­i­ants to find cor­rect goal generalization

avturchin4 Apr 2023 14:16 UTC
20 points
3 comments1 min readLW link

In­vo­ca­tions: The Other Ca­pa­bil­ities Over­hang?

Robert_AIZI4 Apr 2023 13:38 UTC
29 points
4 comments4 min readLW link
(aizi.substack.com)

Wanted: Men­tal Health Pro­gram Man­ager at Re­think Wel­lbe­ing

Inga G.4 Apr 2023 11:49 UTC
7 points
0 comments1 min readLW link

Where Free Will and Deter­minism Meet

David Bravo4 Apr 2023 10:59 UTC
0 points
0 comments3 min readLW link

Strate­gies to Prevent AI Annihilation

lastchanceformankind4 Apr 2023 8:59 UTC
−2 points
0 comments4 min readLW link

ACX Meetup Madrid

Pablo Villalobos4 Apr 2023 8:53 UTC
5 points
2 comments1 min readLW link

[Question] Best Ways to Try to Get Fund­ing for Align­ment Re­search?

RGRGRG4 Apr 2023 6:35 UTC
9 points
6 comments1 min readLW link

Con­sider ap­ply­ing to a 2-week al­ign­ment pro­ject with former GitHub CEO

jacobjacob4 Apr 2023 6:20 UTC
42 points
0 comments1 min readLW link
(twitter.com)

On how it feels gen­er­at­ing art with DALL-E

cortrinkau4 Apr 2023 4:13 UTC
5 points
0 comments3 min readLW link
(cortrinkau.bearblog.dev)

AI Sum­mer Harvest

Cleo Nardo4 Apr 2023 3:35 UTC
130 points
10 comments1 min readLW link

How to re­spond to the re­cent con­dem­na­tions of the ra­tio­nal­ist community

Christopher King4 Apr 2023 1:42 UTC
−2 points
7 comments4 min readLW link

Steer­ing systems

Max H4 Apr 2023 0:56 UTC
50 points
1 comment15 min readLW link

ChatGPT Suggests Listen­ing To Rus­sell & Yudkowsky

JenniferRM4 Apr 2023 0:30 UTC
7 points
1 comment17 min readLW link

Com­plex Sys­tems are Hard to Control

jsteinhardt4 Apr 2023 0:00 UTC
42 points
5 comments10 min readLW link
(bounded-regret.ghost.io)

Ap­ply to the Cavendish Labs Fel­low­ship (by 4/​15)

3 Apr 2023 23:09 UTC
11 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

Twin Cities ACX Meetup—April 2023

Timothy M.3 Apr 2023 23:07 UTC
5 points
3 comments1 min readLW link

Com­mu­ni­cat­ing effec­tively un­der Knigh­tian norms

Richard_Ngo3 Apr 2023 22:39 UTC
93 points
54 comments6 min readLW link

If in­ter­pretabil­ity re­search goes well, it may get dangerous

So8res3 Apr 2023 21:48 UTC
200 points
11 comments2 min readLW link

Towards em­pa­thy in RL agents and be­yond: In­sights from cog­ni­tive sci­ence for AI Align­ment

Marc Carauleanu3 Apr 2023 19:59 UTC
15 points
6 comments1 min readLW link
(clipchamp.com)

Monthly Roundup #5: April 2023

Zvi3 Apr 2023 18:50 UTC
26 points
12 comments14 min readLW link
(thezvi.wordpress.com)