A differ­ent ob­ser­va­tion of Vav­ilov Day

Elizabeth26 Jan 2023 21:50 UTC
30 points
1 comment1 min readLW link
(acesounderglass.com)

All AGI Safety ques­tions wel­come (es­pe­cially ba­sic ones) [~monthly thread]

26 Jan 2023 21:01 UTC
39 points
81 comments2 min readLW link

Just an­other thought ex­per­i­ment

Bohdan Kudlai 26 Jan 2023 19:29 UTC
−11 points
0 comments1 min readLW link

Exquisite Or­a­cle: A Dadaist-In­spired Liter­ary Game for Many Friends (or 1 AI)

Yitz26 Jan 2023 18:26 UTC
6 points
1 comment1 min readLW link

AI Risk Man­age­ment Frame­work | NIST

DragonGod26 Jan 2023 15:27 UTC
36 points
4 comments2 min readLW link
(www.nist.gov)

“How to Es­cape from the Si­mu­la­tion”—Seeds of Science call for reviewers

rogersbacon26 Jan 2023 15:11 UTC
12 points
0 comments1 min readLW link

Loom: Why and How to use it

brook26 Jan 2023 14:34 UTC
2 points
5 comments1 min readLW link

Covid 1/​26/​23: Case Count Crash

Zvi26 Jan 2023 12:50 UTC
32 points
5 comments9 min readLW link
(thezvi.wordpress.com)

[Question] How are you cur­rently mod­el­ing COVID con­ta­gious­ness?

CounterBlunder26 Jan 2023 4:46 UTC
2 points
2 comments1 min readLW link

[Question] What’s the sim­plest con­crete un­solved prob­lem in AI al­ign­ment?

agg26 Jan 2023 4:15 UTC
28 points
4 comments1 min readLW link

2022 Less Wrong Cen­sus/​Sur­vey: Re­quest for Comments

Screwtape25 Jan 2023 20:57 UTC
5 points
29 comments1 min readLW link

Next steps af­ter AGISF at UMich

JakubK25 Jan 2023 20:57 UTC
10 points
0 comments5 min readLW link
(docs.google.com)

AGI will have learnt util­ity functions

beren25 Jan 2023 19:42 UTC
36 points
3 comments13 min readLW link

[RFC] Pos­si­ble ways to ex­pand on “Dis­cov­er­ing La­tent Knowl­edge in Lan­guage Models Without Su­per­vi­sion”.

25 Jan 2023 19:03 UTC
48 points
6 comments12 min readLW link

Spread­ing mes­sages to help with the most im­por­tant century

HoldenKarnofsky25 Jan 2023 18:20 UTC
75 points
4 comments18 min readLW link
(www.cold-takes.com)

My Model Of EA Burnout

LoganStrohl25 Jan 2023 17:52 UTC
239 points
49 comments5 min readLW link

Thoughts on the im­pact of RLHF research

paulfchristiano25 Jan 2023 17:23 UTC
250 points
102 comments9 min readLW link

[Question] Could AI be used to en­g­ineer a so­ciopoli­ti­cal situ­a­tion where hu­mans can solve the prob­lems sur­round­ing AGI?

hollowing25 Jan 2023 17:17 UTC
1 point
6 comments1 min readLW link

Progress links and tweets, 2023-01-25

jasoncrawford25 Jan 2023 16:12 UTC
8 points
0 comments1 min readLW link
(rootsofprogress.org)

Vi­su­al­i­sa­tion of Prob­a­bil­ity Mass

brook25 Jan 2023 15:09 UTC
7 points
0 comments1 min readLW link

When Did EA Start?

jefftk25 Jan 2023 14:30 UTC
37 points
2 comments2 min readLW link
(www.jefftk.com)

Some Thoughts on AI Art

abramdemski25 Jan 2023 14:18 UTC
74 points
20 comments7 min readLW link

Quick thoughts on “scal­able over­sight” /​ “su­per-hu­man feed­back” research

David Scott Krueger (formerly: capybaralet)25 Jan 2023 12:55 UTC
27 points
9 comments2 min readLW link

Sapir-Whorf for Rationalists

Duncan Sabien (Deactivated)25 Jan 2023 7:58 UTC
149 points
49 comments19 min readLW link

ChatGPT vs the 2-4-6 Task

cwillu25 Jan 2023 6:59 UTC
20 points
4 comments3 min readLW link

Pes­simistic Shard Theory

Garrett Baker25 Jan 2023 0:59 UTC
72 points
13 comments3 min readLW link

Thatcher’s Axiom

Edward P. Könings24 Jan 2023 22:35 UTC
10 points
22 comments4 min readLW link

[Question] Some ques­tions about free will compatibilism

Asking Questions24 Jan 2023 21:54 UTC
3 points
21 comments6 min readLW link

Alexan­der and Yud­kowsky on AGI goals

24 Jan 2023 21:09 UTC
174 points
52 comments26 min readLW link

[Question] Is _The Age of AI: And Our Hu­man Fu­ture_ worth reading

jmh24 Jan 2023 21:05 UTC
4 points
0 comments1 min readLW link

In­verse Scal­ing Prize: Se­cond Round Winners

24 Jan 2023 20:12 UTC
58 points
17 comments15 min readLW link

ChatGPT in­ti­mates a tan­ta­l­iz­ing fu­ture; its core LLM is or­ga­nized on mul­ti­ple lev­els; and it has bro­ken the idea of think­ing.

Bill Benzon24 Jan 2023 19:05 UTC
5 points
0 comments5 min readLW link

How-to Trans­former Mechanis­tic In­ter­pretabil­ity—in 50 lines of code or less!

StefanHex24 Jan 2023 18:45 UTC
47 points
5 comments13 min readLW link

The Cabi­net of Wikipe­dian Curiosities

Sam Enright24 Jan 2023 18:22 UTC
36 points
5 comments6 min readLW link
(samenright.com)

Ex­plana­tory Par­si­mony, Ex­plana­tory Su­perflu­ous­ness and Use­less­ness of New­ton’s First Law

Jimdrix_Hendri24 Jan 2023 17:21 UTC
−2 points
7 comments2 min readLW link

Guessti­mate: Why and how to use it

24 Jan 2023 16:24 UTC
8 points
0 comments3 min readLW link
(forum.effectivealtruism.org)

GWWC Pledge History

jefftk24 Jan 2023 15:50 UTC
15 points
0 comments3 min readLW link
(www.jefftk.com)

Gra­di­ent hack­ing is ex­tremely difficult

beren24 Jan 2023 15:45 UTC
162 points
22 comments5 min readLW link

[Question] What sci-fi books are most rele­vant to a fu­ture with trans­for­ma­tive AI?

sid24 Jan 2023 15:30 UTC
2 points
9 comments1 min readLW link

Grant-mak­ing in EA should con­sider peer-re­view­ing grant ap­pli­ca­tions along the pub­lic-sec­tor model

Ben Smith24 Jan 2023 15:01 UTC
0 points
3 comments1 min readLW link

“Endgame safety” for AGI

Steven Byrnes24 Jan 2023 14:15 UTC
84 points
10 comments6 min readLW link

Thoughts on hard­ware /​ com­pute re­quire­ments for AGI

Steven Byrnes24 Jan 2023 14:03 UTC
52 points
30 comments24 min readLW link

Pa­ram­e­ter Scal­ing Comes for RL, Maybe

1a3orn24 Jan 2023 13:55 UTC
99 points
3 comments14 min readLW link

How to find cool things in a new place

Sam F. Brown24 Jan 2023 11:20 UTC
12 points
0 comments1 min readLW link

[Cross­post] ACX 2022 Pre­dic­tion Con­test Results

24 Jan 2023 6:56 UTC
46 points
6 comments8 min readLW link

The Hu­man-AI Reflec­tive Equilibrium

Allison Duettmann24 Jan 2023 1:32 UTC
22 points
1 comment24 min readLW link

“Sta­tus” can be cor­ro­sive; here’s how I han­dle it

Akash24 Jan 2023 1:25 UTC
71 points
8 comments6 min readLW link

[Question] What area of the digi­tal do­main seems safe from AI in the next 5-10 years?

Adrien Chauvet24 Jan 2023 1:16 UTC
11 points
14 comments1 min readLW link

Some of my dis­agree­ments with List of Lethalities

TurnTrout24 Jan 2023 0:25 UTC
68 points
7 comments10 min readLW link

Round­ing Some­one Off

David Udell24 Jan 2023 0:03 UTC
25 points
0 comments5 min readLW link