2022 Less Wrong Cen­sus/​Sur­vey: Re­quest for Comments

Screwtape25 Jan 2023 20:57 UTC
5 points
29 comments1 min readLW link

Next steps af­ter AGISF at UMich

JakubK25 Jan 2023 20:57 UTC
10 points
0 comments5 min readLW link
(docs.google.com)

AGI will have learnt util­ity functions

beren25 Jan 2023 19:42 UTC
36 points
3 comments13 min readLW link

[RFC] Pos­si­ble ways to ex­pand on “Dis­cov­er­ing La­tent Knowl­edge in Lan­guage Models Without Su­per­vi­sion”.

25 Jan 2023 19:03 UTC
48 points
6 comments12 min readLW link

Spread­ing mes­sages to help with the most im­por­tant century

HoldenKarnofsky25 Jan 2023 18:20 UTC
75 points
4 comments18 min readLW link
(www.cold-takes.com)

My Model Of EA Burnout

LoganStrohl25 Jan 2023 17:52 UTC
255 points
50 comments5 min readLW link1 review

Thoughts on the im­pact of RLHF research

paulfchristiano25 Jan 2023 17:23 UTC
250 points
102 comments9 min readLW link

[Question] Could AI be used to en­g­ineer a so­ciopoli­ti­cal situ­a­tion where hu­mans can solve the prob­lems sur­round­ing AGI?

hollowing25 Jan 2023 17:17 UTC
1 point
6 comments1 min readLW link

Progress links and tweets, 2023-01-25

jasoncrawford25 Jan 2023 16:12 UTC
8 points
0 comments1 min readLW link
(rootsofprogress.org)

Vi­su­al­i­sa­tion of Prob­a­bil­ity Mass

brook25 Jan 2023 15:09 UTC
7 points
0 comments1 min readLW link

When Did EA Start?

jefftk25 Jan 2023 14:30 UTC
37 points
2 comments2 min readLW link
(www.jefftk.com)

Some Thoughts on AI Art

abramdemski25 Jan 2023 14:18 UTC
74 points
20 comments7 min readLW link

Quick thoughts on “scal­able over­sight” /​ “su­per-hu­man feed­back” research

David Scott Krueger (formerly: capybaralet)25 Jan 2023 12:55 UTC
27 points
9 comments2 min readLW link

Sapir-Whorf for Rationalists

Duncan Sabien (Deactivated)25 Jan 2023 7:58 UTC
154 points
49 comments19 min readLW link

ChatGPT vs the 2-4-6 Task

cwillu25 Jan 2023 6:59 UTC
20 points
4 comments3 min readLW link

Pes­simistic Shard Theory

Garrett Baker25 Jan 2023 0:59 UTC
72 points
13 comments3 min readLW link

Thatcher’s Axiom

Edward P. Könings24 Jan 2023 22:35 UTC
10 points
22 comments4 min readLW link

[Question] Some ques­tions about free will compatibilism

Asking Questions24 Jan 2023 21:54 UTC
3 points
21 comments6 min readLW link

Alexan­der and Yud­kowsky on AGI goals

24 Jan 2023 21:09 UTC
177 points
53 comments26 min readLW link1 review

[Question] Is _The Age of AI: And Our Hu­man Fu­ture_ worth reading

jmh24 Jan 2023 21:05 UTC
4 points
0 comments1 min readLW link

In­verse Scal­ing Prize: Se­cond Round Winners

24 Jan 2023 20:12 UTC
58 points
17 comments15 min readLW link

ChatGPT in­ti­mates a tan­ta­l­iz­ing fu­ture; its core LLM is or­ga­nized on mul­ti­ple lev­els; and it has bro­ken the idea of think­ing.

Bill Benzon24 Jan 2023 19:05 UTC
5 points
0 comments5 min readLW link

How-to Trans­former Mechanis­tic In­ter­pretabil­ity—in 50 lines of code or less!

StefanHex24 Jan 2023 18:45 UTC
47 points
5 comments13 min readLW link

The Cabi­net of Wikipe­dian Curiosities

Sam Enright24 Jan 2023 18:22 UTC
36 points
5 comments6 min readLW link
(samenright.com)

Ex­plana­tory Par­si­mony, Ex­plana­tory Su­perflu­ous­ness and Use­less­ness of New­ton’s First Law

Jimdrix_Hendri24 Jan 2023 17:21 UTC
−2 points
7 comments2 min readLW link

Guessti­mate: Why and how to use it

24 Jan 2023 16:24 UTC
8 points
0 comments3 min readLW link
(forum.effectivealtruism.org)

GWWC Pledge History

jefftk24 Jan 2023 15:50 UTC
15 points
0 comments3 min readLW link
(www.jefftk.com)

Gra­di­ent hack­ing is ex­tremely difficult

beren24 Jan 2023 15:45 UTC
162 points
22 comments5 min readLW link

[Question] What sci-fi books are most rele­vant to a fu­ture with trans­for­ma­tive AI?

sid24 Jan 2023 15:30 UTC
2 points
9 comments1 min readLW link

Grant-mak­ing in EA should con­sider peer-re­view­ing grant ap­pli­ca­tions along the pub­lic-sec­tor model

Ben Smith24 Jan 2023 15:01 UTC
0 points
3 comments1 min readLW link

“Endgame safety” for AGI

Steven Byrnes24 Jan 2023 14:15 UTC
85 points
10 comments6 min readLW link

Thoughts on hard­ware /​ com­pute re­quire­ments for AGI

Steven Byrnes24 Jan 2023 14:03 UTC
59 points
30 comments24 min readLW link

Pa­ram­e­ter Scal­ing Comes for RL, Maybe

1a3orn24 Jan 2023 13:55 UTC
100 points
3 comments14 min readLW link

How to find cool things in a new place

Sam F. Brown24 Jan 2023 11:20 UTC
12 points
0 comments1 min readLW link

[Cross­post] ACX 2022 Pre­dic­tion Con­test Results

24 Jan 2023 6:56 UTC
46 points
6 comments8 min readLW link

The Hu­man-AI Reflec­tive Equilibrium

Allison Duettmann24 Jan 2023 1:32 UTC
22 points
1 comment24 min readLW link

“Sta­tus” can be cor­ro­sive; here’s how I han­dle it

Akash24 Jan 2023 1:25 UTC
71 points
8 comments6 min readLW link

[Question] What area of the digi­tal do­main seems safe from AI in the next 5-10 years?

Adrien Chauvet24 Jan 2023 1:16 UTC
11 points
14 comments1 min readLW link

Some of my dis­agree­ments with List of Lethalities

TurnTrout24 Jan 2023 0:25 UTC
70 points
7 comments10 min readLW link

Round­ing Some­one Off

David Udell24 Jan 2023 0:03 UTC
25 points
0 comments5 min readLW link

Life Has a Cruel Symmetry

philh23 Jan 2023 23:40 UTC
21 points
5 comments11 min readLW link
(reasonableapproximation.net)

High­lights and Prizes from the 2021 Re­view Phase

Raemon23 Jan 2023 21:41 UTC
38 points
14 comments21 min readLW link

[Question] AI safety mile­stones?

Zach Stein-Perlman23 Jan 2023 21:00 UTC
7 points
5 comments1 min readLW link

[Question] A post-quan­tum the­ory of clas­si­cal grav­ity?

Logan Zoellner23 Jan 2023 20:39 UTC
13 points
5 comments1 min readLW link

Meals For Un­clear Die­tary Restrictions

jefftk23 Jan 2023 20:00 UTC
17 points
3 comments2 min readLW link
(www.jefftk.com)

It’s ok

stratospher23 Jan 2023 18:11 UTC
1 point
0 comments2 min readLW link

Ex­per­i­ment­ing with beta.char­ac­ter.ai

svemirski23 Jan 2023 17:31 UTC
−3 points
5 comments1 min readLW link

This week in fashion

Jan23 Jan 2023 17:23 UTC
29 points
7 comments7 min readLW link
(universalprior.substack.com)

Movie Re­view: Megan

Zvi23 Jan 2023 12:50 UTC
60 points
19 comments24 min readLW link
(thezvi.wordpress.com)

[Question] Has pri­vate AGI re­search made in­de­pen­dent safety re­search in­effec­tive already? What should we do about this?

Roman Leventov23 Jan 2023 7:36 UTC
43 points
5 comments5 min readLW link