Why We MUST Create an AGI that Disem­pow­ers Hu­man­ity. For Real.

twkaiser22 Mar 2023 23:01 UTC
−17 points
1 comment4 min readLW link

Progress links and tweets, 2023-03-22

jasoncrawford22 Mar 2023 22:19 UTC
13 points
0 comments2 min readLW link
(rootsofprogress.org)

[Question] How to con­vince some­one AGI is com­ing soon?

Zohar Jackson22 Mar 2023 22:16 UTC
5 points
7 comments1 min readLW link

Harry Pot­ter in The World of Path Semantics

Sven Nilsen22 Mar 2023 20:22 UTC
−3 points
17 comments1 min readLW link
(raw.githubusercontent.com)

Books: Lend, Don’t Give

jefftk22 Mar 2023 18:40 UTC
28 points
2 comments1 min readLW link
(www.jefftk.com)

[Linkpost] Shorter ver­sion of re­port on ex­is­ten­tial risk from power-seek­ing AI

Joe Carlsmith22 Mar 2023 18:09 UTC
7 points
0 comments1 min readLW link

An­nounc­ing the Euro­pean Net­work for AI Safety (ENAIS)

Esben Kran22 Mar 2023 17:57 UTC
19 points
0 comments1 min readLW link

[Question] Gen­uine ques­tion: If Eliezer is so ra­tio­nal why is he fat?

DirichletConvolution22 Mar 2023 17:41 UTC
−39 points
7 comments1 min readLW link

Mak­ing bet­ter es­ti­mates with scarce information

Stan Pinsent22 Mar 2023 17:40 UTC
11 points
5 comments10 min readLW link

Anki with Uncer­tainty: Turn any flash­card deck into a cal­ibra­tion train­ing tool

Sage Future22 Mar 2023 17:26 UTC
14 points
2 comments1 min readLW link

Key Ques­tions for Digi­tal Minds

Jacy Reese Anthis22 Mar 2023 17:13 UTC
22 points
0 comments7 min readLW link
(www.sentienceinstitute.org)

Em­piri­cal risk min­i­miza­tion is fun­da­men­tally confused

Jesse Hoogland22 Mar 2023 16:58 UTC
32 points
5 comments1 min readLW link

[Question] Challenge: Does ChatGPT ever claim that a bad out­come for hu­man­ity is ac­tu­ally good?

Yair Halberstadt22 Mar 2023 16:01 UTC
49 points
29 comments1 min readLW link

The space of sys­tems and the space of maps

22 Mar 2023 14:59 UTC
39 points
0 comments5 min readLW link

Fea­ture Re­quest to OpenAI: Share but­ton in ChatGPT

Taleuntum22 Mar 2023 14:19 UTC
14 points
4 comments2 min readLW link

Why AI Safety is Hard

Simon Möller22 Mar 2023 10:44 UTC
3 points
0 comments6 min readLW link

[Question] Was Saga of Ta­ti­ana the Funny made by Fushimi Gaku?

Eve Grey22 Mar 2023 9:59 UTC
−9 points
0 comments1 min readLW link

The Gom Jab­bar scene from Dune is es­sen­tially a short film about what Ra­tion­al­ity is for

mako yass22 Mar 2023 8:33 UTC
6 points
1 comment1 min readLW link

Agen­tic GPT simu­la­tions: a risk and an opportunity

Yair Halberstadt22 Mar 2023 6:24 UTC
24 points
8 comments1 min readLW link

Emer­gent Analog­i­cal Rea­son­ing in Large Lan­guage Models

Roman Leventov22 Mar 2023 5:18 UTC
13 points
2 comments1 min readLW link
(arxiv.org)

[Linkpost] GatesNotes: The Age of AI has begun

WilliamKiely22 Mar 2023 4:20 UTC
19 points
9 comments1 min readLW link

An Ap­peal to AI Su­per­in­tel­li­gence: Rea­sons Not to Pre­serve (most of) Humanity

Alex Beyman22 Mar 2023 4:09 UTC
−15 points
6 comments19 min readLW link

Truth and Ad­van­tage: Re­sponse to a draft of “AI safety seems hard to mea­sure”

So8res22 Mar 2023 3:36 UTC
98 points
9 comments5 min readLW link

A Pro­posed Ap­proach for AI Safety Move­ment Build­ing: Pro­jects, Pro­fes­sions, Skills, and Ideas for the Fu­ture [long post][bounty for feed­back]

peterslattery22 Mar 2023 1:11 UTC
14 points
0 comments32 min readLW link

Prin­ci­ples for Pro­duc­tive Group Meetings

jsteinhardt22 Mar 2023 0:50 UTC
60 points
1 comment13 min readLW link
(bounded-regret.ghost.io)

God vs AI scientifically

Donatas Lučiūnas21 Mar 2023 23:03 UTC
−22 points
45 comments1 min readLW link

A method for em­piri­cal back-test­ing of AI’s abil­ity to self-improve

Michael Tontchev21 Mar 2023 20:24 UTC
3 points
0 comments2 min readLW link

the QACI al­ign­ment plan: table of contents

Tamsin Leake21 Mar 2023 20:22 UTC
107 points
1 comment1 min readLW link
(carado.moe)

AI Fables

Bard21 Mar 2023 19:19 UTC
18 points
12 comments4 min readLW link

[Question] Ad­ver­sar­ial (SEO) GPT train­ing data?

Dagon21 Mar 2023 18:55 UTC
2 points
0 comments1 min readLW link

[Question] Why not con­strain wet­labs in­stead of AI?

Lone Pine21 Mar 2023 18:02 UTC
15 points
10 comments1 min readLW link

[Question] Wouldn’t an in­tel­li­gent agent keep us al­ive and help us al­ign it­self to our val­ues in or­der to pre­vent risk ? by Risk I mean ex­per­i­men­ta­tion by try­ing to al­ign po­ten­tially smarter repli­cas?

Terrence Rotoufle21 Mar 2023 17:44 UTC
−3 points
1 comment2 min readLW link

[Question] Em­ployer con­sid­er­ing part­ner­ing with ma­jor AI labs. What to do?

GraduallyMoreAgitated21 Mar 2023 17:43 UTC
37 points
7 comments2 min readLW link

Sun-fol­low­ing Gar­den Mir­rors?

jefftk21 Mar 2023 16:20 UTC
15 points
5 comments1 min readLW link
(www.jefftk.com)

Some con­struc­tions for proof-based co­op­er­a­tion with­out Löb

James Payor21 Mar 2023 16:12 UTC
43 points
3 comments4 min readLW link

Clar­ify­ing mesa-optimization

21 Mar 2023 15:53 UTC
38 points
6 comments10 min readLW link

“Per­spec­tive: Fo­cused-Ul­tra­sound Guided Neu­ropep­tide De­liv­ery as a Novel Ther­a­peu­tic Ap­proach in Psy­chi­a­try” (Seeds of Science call for re­view­ers)

rogersbacon21 Mar 2023 14:31 UTC
6 points
1 comment2 min readLW link

AI #4: In­tro­duc­ing GPT-4

Zvi21 Mar 2023 14:00 UTC
101 points
32 comments103 min readLW link
(thezvi.wordpress.com)

[Question] Are robotics bot­tle­necked on hard­ware or soft­ware?

tailcalled21 Mar 2023 7:26 UTC
14 points
13 comments1 min readLW link

Truth­seek­ing pro­cesses tend to be frame-invariant

Adele Lopez21 Mar 2023 6:17 UTC
21 points
2 comments2 min readLW link

Smart Peo­ple are Prob­a­bly Dangerous

Program Den21 Mar 2023 6:00 UTC
−28 points
2 comments1 min readLW link

Ex­plor­ing the Pre­cau­tion­ary Prin­ci­ple in AI Devel­op­ment: His­tor­i­cal Analo­gies and Les­sons Learned

Christopher King21 Mar 2023 3:53 UTC
−1 points
2 comments9 min readLW link

Deep Deceptiveness

So8res21 Mar 2023 2:51 UTC
237 points
59 comments14 min readLW link

Stan­ford claims to have repli­cated ChatGPT for < $600

NoSignalNoNoise21 Mar 2023 2:28 UTC
2 points
1 comment1 min readLW link
(crfm.stanford.edu)

Ca­pa­bil­ities De­nial: The Danger of Un­der­es­ti­mat­ing AI

Christopher King21 Mar 2023 1:24 UTC
6 points
5 comments3 min readLW link

Ra­tion­al­ity Ta­boo, The Game

Screwtape21 Mar 2023 0:56 UTC
6 points
0 comments2 min readLW link

Skill Acquisition

Screwtape21 Mar 2023 0:49 UTC
6 points
0 comments4 min readLW link

Lawyers And World-Models

SomeoneYouOnceKnew21 Mar 2023 0:29 UTC
−2 points
1 comment3 min readLW link

My Ob­jec­tions to “We’re All Gonna Die with Eliezer Yud­kowsky”

Quintin Pope21 Mar 2023 0:06 UTC
357 points
230 comments39 min readLW link

LED Brain Stim­u­la­tion for Productivity

Simon Berens20 Mar 2023 22:30 UTC
13 points
6 comments1 min readLW link
(news.ycombinator.com)