My techno-op­ti­mism [By Vi­talik Bu­terin]

habrykaNov 27, 2023, 11:53 PM
107 points
17 comments2 min readLW link
(www.lesswrong.com)

[Question] Could Ger­many have won World War I with high prob­a­bil­ity given the benefit of hind­sight?

RokoNov 27, 2023, 10:52 PM
10 points
18 comments1 min readLW link

[Question] Could World War I have been pre­vented given the benefit of hind­sight?

RokoNov 27, 2023, 10:39 PM
16 points
8 comments1 min readLW link

AISC 2024 - Pro­ject Summaries

NickyPNov 27, 2023, 10:32 PM
48 points
3 comments18 min readLW link

“Epistemic range of mo­tion” and LessWrong moderation

Nov 27, 2023, 9:58 PM
65 points
3 comments12 min readLW link

Ap­ply to the Con­cep­tual Boundaries Work­shop for AI Safety

ChipmonkNov 27, 2023, 9:04 PM
50 points
0 comments3 min readLW link

There is no IQ for AI

Gabriel AlfourNov 27, 2023, 6:21 PM
30 points
10 comments9 min readLW link
(cognition.cafe)

Two con­cepts of an “epi­sode” (Sec­tion 2.2.1 of “Schem­ing AIs”)

Joe CarlsmithNov 27, 2023, 6:01 PM
19 points
1 comment13 min readLW link

[Linkpost] Ge­orge Mack’s Razors

trevorNov 27, 2023, 5:53 PM
38 points
8 comments3 min readLW link
(twitter.com)

On pos­si­ble cross-fer­til­iza­tion be­tween AI and neu­ro­science [Creativity]

Bill BenzonNov 27, 2023, 4:50 PM
15 points
22 comments7 min readLW link

Ethico­physics I

MadHatterNov 27, 2023, 3:44 PM
−1 points
16 comments1 min readLW link
(open.substack.com)

Sen­tience In­sti­tute 2023 End of Year Summary

michael_delloNov 27, 2023, 12:11 PM
11 points
0 comments5 min readLW link
(www.sentienceinstitute.org)

[Question] A Ques­tion about Cor­rigi­bil­ity (2015)

A.H.Nov 27, 2023, 12:05 PM
4 points
2 comments1 min readLW link

Ap­pen­dices to the live agendas

Nov 27, 2023, 11:10 AM
16 points
4 comments1 min readLW link

Shal­low re­view of live agen­das in al­ign­ment & safety

Nov 27, 2023, 11:10 AM
335 points
72 comments29 min readLW link

Napoleon stole the Ro­man In­qui­si­tion archives and in­ves­ti­gated the Gal­ileo case

Meow PNov 27, 2023, 9:41 AM
−3 points
0 comments1 min readLW link
(www.cricetuscricetus.co.uk)

Found Paper: “FDT in an evolu­tion­ary en­vi­ron­ment”

the gears to ascensionNov 27, 2023, 5:27 AM
30 points
47 comments1 min readLW link
(arxiv.org)

[Question] why did OpenAI em­ploy­ees sign

bhauthNov 27, 2023, 5:21 AM
49 points
23 comments1 min readLW link

Un­known Probabilities

transhumanist_atom_understanderNov 27, 2023, 2:30 AM
22 points
0 comments4 min readLW link

Jus­tifi­ca­tion for Induction

KrantzNov 27, 2023, 2:05 AM
2 points
25 comments5 min readLW link

Si­tu­a­tional aware­ness (Sec­tion 2.1 of “Schem­ing AIs”)

Joe CarlsmithNov 26, 2023, 11:00 PM
10 points
5 comments8 min readLW link

AXRP Epi­sode 26 - AI Gover­nance with Eliz­a­beth Seger

DanielFilanNov 26, 2023, 11:00 PM
14 points
0 comments66 min readLW link

Solv­ing Two-Sided Ad­verse Selec­tion with Pre­dic­tion Mar­ket Matchmaking

Saul MunnNov 26, 2023, 8:10 PM
16 points
7 comments4 min readLW link
(www.brasstacks.blog)

Wikipe­dia is not so great, and what can be done about it.

euserxNov 26, 2023, 7:13 PM
0 points
27 comments16 min readLW link
(forum.effectivealtruism.org)

[Question] Help me solve this prob­lem: The basilisk isn’t real, but peo­ple are

canary_itmNov 26, 2023, 5:44 PM
−19 points
4 comments1 min readLW link

Twin Cities ACX Meetup—De­cem­ber 2023

Timothy M.Nov 26, 2023, 5:32 PM
1 point
1 comment1 min readLW link

Spaced rep­e­ti­tion for teach­ing two-year olds how to read (In­ter­view)

ChipmonkNov 26, 2023, 4:52 PM
48 points
9 comments5 min readLW link
(chipmonk.substack.com)

Paper out now on cre­a­tine and cog­ni­tive performance

FabienneNov 26, 2023, 10:58 AM
59 points
2 comments1 min readLW link

Why Q*, if real, might be a game changer

ShmiNov 26, 2023, 6:12 AM
5 points
6 comments1 min readLW link

Mo­ral Real­ity Check (a short story)

jessicataNov 26, 2023, 5:03 AM
148 points
45 comments21 min readLW link1 review
(unstableontology.com)

Ac­count­ing for Fore­gone Pay

jefftkNov 26, 2023, 3:30 AM
11 points
0 comments2 min readLW link
(www.jefftk.com)

Cor­rigi­bil­ity or DWIM is an at­trac­tive pri­mary goal for AGI

Seth HerdNov 25, 2023, 7:37 PM
16 points
4 comments1 min readLW link

On “slack” in train­ing (Sec­tion 1.5 of “Schem­ing AIs”)

Joe CarlsmithNov 25, 2023, 5:51 PM
1 point
0 comments5 min readLW link

An­nounc­ing New Begin­ner-friendly Book on AI Safety and Risk

Darren McKeeNov 25, 2023, 3:57 PM
64 points
2 comments1 min readLW link

Fer­til­ity as Metascience

Maxwell TabarrokNov 25, 2023, 3:42 PM
20 points
1 comment3 min readLW link
(maximumprogress.substack.com)

Re­ac­tion to “Em­pow­er­ment is (al­most) All We Need” : an open-ended alternative

Ryo Nov 25, 2023, 3:35 PM
9 points
3 comments5 min readLW link

How Microsoft’s ruth­less em­ployee eval­u­a­tion sys­tem an­nihilated team col­lab­o­ra­tion.

positivesumNov 25, 2023, 1:25 PM
3 points
2 comments1 min readLW link
(tryingtruly.substack.com)

What are the re­sults of more parental su­per­vi­sion and less out­door play?

juliawiseNov 25, 2023, 12:52 PM
226 points
31 comments5 min readLW link

A sim­ple treach­er­ous turn demonstration

Nikola JurkovicNov 25, 2023, 4:51 AM
22 points
5 comments3 min readLW link

The two para­graph ar­gu­ment for AI risk

CronoDASNov 25, 2023, 2:01 AM
19 points
8 comments1 min readLW link

Good­hart’s Law Ex­am­ple: Train­ing Ver­ifiers to Solve Math Word Problems

Chris_LeongNov 25, 2023, 12:53 AM
27 points
2 comments1 min readLW link
(arxiv.org)

Some thoughts on CBDC

PixelatedPenguinNov 25, 2023, 12:32 AM
−1 points
1 comment1 min readLW link

Test­ing for con­se­quence-blind­ness in LLMs us­ing the HI-ADS unit test.

David Scott Krueger (formerly: capybaralet)Nov 24, 2023, 11:35 PM
25 points
2 comments2 min readLW link

Epoch is hiring an ML Distributed Sys­tems Se­nior Researcher

Nov 24, 2023, 10:33 PM
2 points
0 comments4 min readLW link
(careers.rethinkpriorities.org)

Ar­ti­cle Dis­cus­sion And Free Pizza—St Paul

25HourNov 24, 2023, 9:02 PM
1 point
0 comments1 min readLW link

Why fo­cus on schemers in par­tic­u­lar (Sec­tions 1.3 and 1.4 of “Schem­ing AIs”)

Joe CarlsmithNov 24, 2023, 7:18 PM
8 points
0 comments22 min readLW link

Sur­viv­ing and Shap­ing Long-Term Com­pe­ti­tions: Les­sons from Net Assessment

Nov 24, 2023, 6:18 PM
5 points
0 comments13 min readLW link

Abil­ity to solve long-hori­zon tasks cor­re­lates with want­ing things in the be­hav­iorist sense

So8resNov 24, 2023, 5:37 PM
196 points
84 comments5 min readLW link1 review

The Limi­ta­tions of GPT-4

p.b.Nov 24, 2023, 3:30 PM
27 points
12 comments4 min readLW link

Progress links di­gest, 2023-11-24: Bot­tle­necks of ag­ing, Star­ship launches, and much more

jasoncrawfordNov 24, 2023, 3:25 PM
40 points
1 comment14 min readLW link
(rootsofprogress.org)