Thatcher’s Axiom

Edward P. Könings24 Jan 2023 22:35 UTC
10 points
22 comments4 min readLW link

[Question] Some ques­tions about free will compatibilism

Asking Questions24 Jan 2023 21:54 UTC
3 points
21 comments6 min readLW link

Alexan­der and Yud­kowsky on AGI goals

24 Jan 2023 21:09 UTC
177 points
53 comments26 min readLW link1 review

[Question] Is _The Age of AI: And Our Hu­man Fu­ture_ worth reading

jmh24 Jan 2023 21:05 UTC
4 points
0 comments1 min readLW link

In­verse Scal­ing Prize: Se­cond Round Winners

24 Jan 2023 20:12 UTC
58 points
17 comments15 min readLW link

ChatGPT in­ti­mates a tan­ta­l­iz­ing fu­ture; its core LLM is or­ga­nized on mul­ti­ple lev­els; and it has bro­ken the idea of think­ing.

Bill Benzon24 Jan 2023 19:05 UTC
5 points
0 comments5 min readLW link

How-to Trans­former Mechanis­tic In­ter­pretabil­ity—in 50 lines of code or less!

StefanHex24 Jan 2023 18:45 UTC
47 points
5 comments13 min readLW link

The Cabi­net of Wikipe­dian Curiosities

Sam Enright24 Jan 2023 18:22 UTC
36 points
5 comments6 min readLW link
(samenright.com)

Ex­plana­tory Par­si­mony, Ex­plana­tory Su­perflu­ous­ness and Use­less­ness of New­ton’s First Law

Jimdrix_Hendri24 Jan 2023 17:21 UTC
−2 points
7 comments2 min readLW link

Guessti­mate: Why and how to use it

24 Jan 2023 16:24 UTC
8 points
0 comments3 min readLW link
(forum.effectivealtruism.org)

GWWC Pledge History

jefftk24 Jan 2023 15:50 UTC
15 points
0 comments3 min readLW link
(www.jefftk.com)

Gra­di­ent hack­ing is ex­tremely difficult

beren24 Jan 2023 15:45 UTC
162 points
22 comments5 min readLW link

[Question] What sci-fi books are most rele­vant to a fu­ture with trans­for­ma­tive AI?

sid24 Jan 2023 15:30 UTC
2 points
9 comments1 min readLW link

Grant-mak­ing in EA should con­sider peer-re­view­ing grant ap­pli­ca­tions along the pub­lic-sec­tor model

Ben Smith24 Jan 2023 15:01 UTC
0 points
3 comments1 min readLW link

“Endgame safety” for AGI

Steven Byrnes24 Jan 2023 14:15 UTC
85 points
10 comments6 min readLW link

Thoughts on hard­ware /​ com­pute re­quire­ments for AGI

Steven Byrnes24 Jan 2023 14:03 UTC
59 points
30 comments24 min readLW link

Pa­ram­e­ter Scal­ing Comes for RL, Maybe

1a3orn24 Jan 2023 13:55 UTC
100 points
3 comments14 min readLW link

How to find cool things in a new place

Sam F. Brown24 Jan 2023 11:20 UTC
12 points
0 comments1 min readLW link

[Cross­post] ACX 2022 Pre­dic­tion Con­test Results

24 Jan 2023 6:56 UTC
46 points
6 comments8 min readLW link

The Hu­man-AI Reflec­tive Equilibrium

Allison Duettmann24 Jan 2023 1:32 UTC
22 points
1 comment24 min readLW link

“Sta­tus” can be cor­ro­sive; here’s how I han­dle it

Akash24 Jan 2023 1:25 UTC
71 points
8 comments6 min readLW link

[Question] What area of the digi­tal do­main seems safe from AI in the next 5-10 years?

Adrien Chauvet24 Jan 2023 1:16 UTC
11 points
14 comments1 min readLW link

Some of my dis­agree­ments with List of Lethalities

TurnTrout24 Jan 2023 0:25 UTC
70 points
7 comments10 min readLW link

Round­ing Some­one Off

David Udell24 Jan 2023 0:03 UTC
25 points
0 comments5 min readLW link

Life Has a Cruel Symmetry

philh23 Jan 2023 23:40 UTC
21 points
5 comments11 min readLW link
(reasonableapproximation.net)

High­lights and Prizes from the 2021 Re­view Phase

Raemon23 Jan 2023 21:41 UTC
38 points
14 comments21 min readLW link

[Question] AI safety mile­stones?

Zach Stein-Perlman23 Jan 2023 21:00 UTC
7 points
5 comments1 min readLW link

[Question] A post-quan­tum the­ory of clas­si­cal grav­ity?

Logan Zoellner23 Jan 2023 20:39 UTC
13 points
5 comments1 min readLW link

Meals For Un­clear Die­tary Restrictions

jefftk23 Jan 2023 20:00 UTC
17 points
3 comments2 min readLW link
(www.jefftk.com)

It’s ok

stratospher23 Jan 2023 18:11 UTC
1 point
0 comments2 min readLW link

Ex­per­i­ment­ing with beta.char­ac­ter.ai

svemirski23 Jan 2023 17:31 UTC
−3 points
5 comments1 min readLW link

This week in fashion

Jan23 Jan 2023 17:23 UTC
29 points
7 comments7 min readLW link
(universalprior.substack.com)

Movie Re­view: Megan

Zvi23 Jan 2023 12:50 UTC
60 points
19 comments24 min readLW link
(thezvi.wordpress.com)

[Question] Has pri­vate AGI re­search made in­de­pen­dent safety re­search in­effec­tive already? What should we do about this?

Roman Leventov23 Jan 2023 7:36 UTC
43 points
5 comments5 min readLW link

De­con­fus­ing “Ca­pa­bil­ities vs. Align­ment”

RobertM23 Jan 2023 4:46 UTC
27 points
7 comments2 min readLW link

What a com­pute-cen­tric frame­work says about AI take­off speeds

Tom Davidson23 Jan 2023 4:02 UTC
187 points
30 comments16 min readLW link1 review

Philly Rat Fest

LoganChipkin23 Jan 2023 4:01 UTC
9 points
0 comments1 min readLW link

EA & LW Fo­rum Weekly Sum­mary (16th − 22nd Jan ’23)

Zoe Williams23 Jan 2023 3:46 UTC
13 points
0 comments1 min readLW link

Con­sider Try­ing Dictation

jefftk22 Jan 2023 22:50 UTC
23 points
10 comments2 min readLW link
(www.jefftk.com)

Emo­tional at­tach­ment to AIs opens doors to problems

Igor Ivanov22 Jan 2023 20:28 UTC
20 points
10 comments4 min readLW link

What fills a vac­uum?

Logan Kieller22 Jan 2023 19:25 UTC
11 points
6 comments2 min readLW link

Gem­ini mod­el­ing

TsviBT22 Jan 2023 14:28 UTC
12 points
8 comments11 min readLW link

Large lan­guage mod­els learn to rep­re­sent the world

gjm22 Jan 2023 13:10 UTC
101 points
20 comments3 min readLW link1 review

Quan­tum Suicide, De­ci­sion The­ory, and The Multiverse

Slimepriestess22 Jan 2023 8:44 UTC
7 points
38 comments10 min readLW link
(voidgoddess.org)

NYT: Google will “re­cal­ibrate” the risk of re­leas­ing AI due to com­pe­ti­tion with OpenAI

Michael Huang22 Jan 2023 8:38 UTC
47 points
2 comments1 min readLW link
(www.nytimes.com)

[Question] Just don’t make a util­ity max­i­mizer?

FinalFormal222 Jan 2023 6:33 UTC
−1 points
10 comments1 min readLW link

A “su­per-in­tel­li­gence” un­in­tended con­se­quences “pre­serve life” scenario

Program Den22 Jan 2023 4:38 UTC
−12 points
0 comments1 min readLW link

How Do We Pro­tect AI From Hu­mans?

Alex Beyman22 Jan 2023 3:59 UTC
−4 points
11 comments6 min readLW link

To Ques­tion God

Collapse Kitty22 Jan 2023 3:51 UTC
8 points
2 comments3 min readLW link

Some­what-Brief thoughts on rea­son­able­ness of conspiracy

grabbag22 Jan 2023 3:50 UTC
−14 points
16 comments5 min readLW link