Slides: Po­ten­tial Risks From Ad­vanced AI

Aryeh EnglanderApr 28, 2022, 2:15 AM
7 points
0 comments1 min readLW link

Naive com­ments on AGIlignment

EricfApr 28, 2022, 1:08 AM
−8 points
4 comments1 min readLW link

AI Alter­na­tive Fu­tures: Sce­nario Map­ping Ar­tifi­cial In­tel­li­gence Risk—Re­quest for Par­ti­ci­pa­tion (*Closed*)

KakiliApr 27, 2022, 10:07 PM
10 points
2 comments8 min readLW link

The Speed + Sim­plic­ity Prior is prob­a­bly anti-deceptive

Yonadav ShavitApr 27, 2022, 7:30 PM
30 points
28 comments12 min readLW link

If you’re very op­ti­mistic about ELK then you should be op­ti­mistic about outer alignment

Sam MarksApr 27, 2022, 7:30 PM
17 points
8 comments3 min readLW link

The Game of Masks

SlimepriestessApr 27, 2022, 6:03 PM
50 points
18 comments11 min readLW link
(hivewired.wordpress.com)

Law-Fol­low­ing AI 3: Lawless AI Agents Un­der­mine Sta­bi­liz­ing Agreements

CullenApr 27, 2022, 5:30 PM
2 points
2 comments3 min readLW link

Law-Fol­low­ing AI 2: In­tent Align­ment + Su­per­in­tel­li­gence → Lawless AI (By De­fault)

CullenApr 27, 2022, 5:27 PM
5 points
2 comments6 min readLW link

Law-Fol­low­ing AI 1: Se­quence In­tro­duc­tion and Structure

CullenApr 27, 2022, 5:26 PM
18 points
10 comments9 min readLW link

[In­tro to brain-like-AGI safety] 13. Sym­bol ground­ing & hu­man so­cial instincts

Steven ByrnesApr 27, 2022, 1:30 PM
73 points
15 comments15 min readLW link

The case for turn­ing glowfic into Sequences

Thomas KwaApr 27, 2022, 6:58 AM
87 points
29 comments5 min readLW link

[Link] Ev­i­dence of Fabri­cated Data in a Vi­tamin C trial by Paul E Marik et al in CHEST

KennyApr 27, 2022, 6:48 AM
6 points
1 comment1 min readLW link

SERI ML Align­ment The­ory Schol­ars Pro­gram 2022

Apr 27, 2022, 12:43 AM
67 points
6 comments3 min readLW link

EU Max­i­miz­ing in a Gloomy World

David UdellApr 27, 2022, 12:28 AM
6 points
2 comments1 min readLW link

Why Copi­lot Ac­cel­er­ates Timelines

Michaël TrazziApr 26, 2022, 10:06 PM
35 points
14 comments7 min readLW link

Univer­sals of Mo­ral­ity: Toward Hu­man-Cen­tric Com­mu­ni­ca­tion Platforms

scafariaApr 26, 2022, 9:15 PM
−3 points
3 comments5 min readLW link
(scafaria.com)

[$20K in Prizes] AI Safety Ar­gu­ments Competition

Apr 26, 2022, 4:13 PM
75 points
518 comments3 min readLW link

Con­ti­nen­tal Philos­o­phy as Un­der­grad­u­ate Mathematics

JanApr 26, 2022, 8:05 AM
17 points
3 comments9 min readLW link
(universalprior.substack.com)

dalle2 comments

nostalgebraistApr 26, 2022, 5:30 AM
183 points
14 comments13 min readLW link
(nostalgebraist.tumblr.com)

Make a neu­ral net­work in ~10 minutes

Arjun YadavApr 26, 2022, 5:24 AM
8 points
0 comments4 min readLW link
(arjunyadav.net)

Fram­ings of De­cep­tive Alignment

peterbarnettApr 26, 2022, 4:25 AM
32 points
7 comments5 min readLW link

Why pes­simism sounds smart

jasoncrawfordApr 25, 2022, 8:10 PM
76 points
15 comments1 min readLW link
(rootsofprogress.org)

[Question] What is be­ing im­proved in re­cur­sive self im­prove­ment?

Lone PineApr 25, 2022, 6:30 PM
7 points
6 comments1 min readLW link

21 on 21

Amir BolousApr 25, 2022, 6:22 PM
43 points
5 comments4 min readLW link

[Question] Ra­tion­al­ist In­spired Com­ing-of-age Rituals

iceplantApr 25, 2022, 5:22 PM
10 points
3 comments1 min readLW link

[Re­quest for Distil­la­tion] Co­her­ence of Distributed De­ci­sions With Differ­ent In­puts Im­plies Conditioning

johnswentworthApr 25, 2022, 5:01 PM
22 points
14 comments2 min readLW link

[Question] Quadratic vot­ing with au­to­matic col­lu­sion?

SarahNibsApr 25, 2022, 4:15 PM
10 points
5 comments1 min readLW link

In­tu­itions about solv­ing hard problems

Richard_NgoApr 25, 2022, 3:29 PM
106 points
23 comments6 min readLW link

Ukraine Post #11: Longer Term Predictions

ZviApr 25, 2022, 2:10 PM
32 points
6 comments11 min readLW link
(thezvi.wordpress.com)

Key ques­tions about ar­tifi­cial sen­tience: an opinionated guide

RobboApr 25, 2022, 12:09 PM
51 points
31 comments18 min readLW link

On Tables and Happiness

AlexanderApr 25, 2022, 9:51 AM
25 points
0 comments2 min readLW link

Why I’m Not a Utili­tar­ian in Modern America

DanBApr 24, 2022, 9:43 PM
5 points
5 comments8 min readLW link

Ex­am­in­ing Evolu­tion as an Up­per Bound for AGI Timelines

meanderingmooseApr 24, 2022, 7:08 PM
6 points
1 comment9 min readLW link

Athens, Greece – ACX Spring Mee­tups 2022

EliasApr 24, 2022, 6:06 PM
1 point
1 comment1 min readLW link

AI safety rais­ing aware­ness re­sources bleg

iivonenApr 24, 2022, 5:13 PM
6 points
0 comments1 min readLW link

[Question] Any­one Fa­mil­iar with Ground News?

jmhApr 24, 2022, 12:46 PM
2 points
2 comments1 min readLW link

[Question] Where can I pub­lish an ar­ti­cle con­tain­ing a list of in­tel­lec­tu­als who pub­li­cly ad­mit­ted their mis­takes once proven wrong?

Hashem ElAssadApr 24, 2022, 9:00 AM
0 points
1 comment1 min readLW link

What Is a Ma­jor Chord?

jefftkApr 24, 2022, 7:20 AM
59 points
11 comments3 min readLW link
(www.jefftk.com)

Slack gives you space to no­tice/​re­flect on sub­tle things

RaemonApr 24, 2022, 2:30 AM
158 points
18 comments1 min readLW link

Cal­ling for Stu­dent Sub­mis­sions: AI Safety Distil­la­tion Contest

ArisApr 24, 2022, 1:53 AM
48 points
15 comments4 min readLW link

Ra­tion­al­ity Dojo

lsusrApr 24, 2022, 12:53 AM
14 points
5 comments1 min readLW link

[Question] Deletion

011eNigma235Apr 23, 2022, 11:01 PM
1 point
1 comment1 min readLW link

Cape Town ACX meetup

Jordan PietersApr 23, 2022, 11:00 PM
1 point
0 comments1 min readLW link

Re: So You Want to Be a Dharma Teacher

lsusr23 Apr 2022 22:31 UTC
30 points
4 comments2 min readLW link
(hardcorezen.info)

Ineffec­tive Altruism

lsusr23 Apr 2022 22:07 UTC
86 points
17 comments1 min readLW link

[Question] Has any­one writ­ten a re­duc­tion­ist the­ory of cre­ativity?

Grant Demaree23 Apr 2022 22:05 UTC
4 points
3 comments1 min readLW link

Progress Re­port 5: ty­ing it together

Nathan Helm-Burger23 Apr 2022 21:07 UTC
10 points
0 comments2 min readLW link

The New Right ap­pears to be on the rise for bet­ter or worse

Chris_Leong23 Apr 2022 19:36 UTC
6 points
18 comments1 min readLW link

[ASoT] Con­se­quen­tial­ist mod­els as a su­per­set of mesaoptimizers

leogao23 Apr 2022 17:57 UTC
38 points
2 comments4 min readLW link

Re­port like­li­hood ratios

Ege Erdil23 Apr 2022 17:10 UTC
80 points
9 comments7 min readLW link