Book Launch: “The Carv­ing of Real­ity,” Best of LessWrong vol. III

Raemon16 Aug 2023 23:52 UTC
131 points
22 comments5 min readLW link

One ex­am­ple of how LLM pro­pa­ganda at­tacks can hack the brain

trevor16 Aug 2023 21:41 UTC
24 points
8 comments4 min readLW link

If we had known the at­mo­sphere would ignite

Jeffs16 Aug 2023 20:28 UTC
56 points
63 comments2 min readLW link

Stampy’s AI Safety Info—New Distil­la­tions #4 [July 2023]

markov16 Aug 2023 19:03 UTC
22 points
10 comments1 min readLW link
(aisafety.info)

A Proof of Löb’s The­o­rem us­ing Com­putabil­ity Theory

jessicata16 Aug 2023 18:57 UTC
71 points
0 comments17 min readLW link
(unstableontology.com)

Sum­mary of and Thoughts on the Hotz/​Yud­kowsky Debate

Zvi16 Aug 2023 16:50 UTC
105 points
47 comments9 min readLW link
(thezvi.wordpress.com)

Red Pill vs Blue Pill, Bayes style

ErickBall16 Aug 2023 15:23 UTC
28 points
33 comments1 min readLW link

What does it mean to “trust sci­ence”?

jasoncrawford16 Aug 2023 14:56 UTC
34 points
9 comments1 min readLW link
(rootsofprogress.org)

Ja­son Crawford /​ The Roots of Progress in Ban­ga­lore, Au­gust 21 to Septem­ber 8

jasoncrawford16 Aug 2023 13:36 UTC
13 points
1 comment1 min readLW link
(rootsofprogress.org)

Gain­ing knowl­edge at a price

DavidMadsen16 Aug 2023 10:21 UTC
−4 points
5 comments1 min readLW link

Un­der­stand­ing and vi­su­al­iz­ing syco­phancy datasets

Nina Panickssery16 Aug 2023 5:34 UTC
45 points
0 comments6 min readLW link

Ge­orge Hotz vs Eliezer Yud­kowsky AI Safety De­bate—link and brief discussion

Gerald Monroe16 Aug 2023 4:31 UTC
11 points
26 comments2 min readLW link
(www.youtube.com)

[Question] How to take ad­van­age of the mar­ket’s ir­ra­tional­ity re­gard­ing AGI?

GeneSmith16 Aug 2023 3:30 UTC
23 points
6 comments2 min readLW link

In­finite Ethics: In­finite Problems

omnizoid16 Aug 2023 2:44 UTC
−2 points
25 comments23 min readLW link

Pri­vate Biosta­sis & Cry­on­ics Social

Mati_Roy16 Aug 2023 2:34 UTC
11 points
0 comments1 min readLW link

Some thoughts on Ge­orge Hotz vs Eliezer Yudkowsky

TristanTrim15 Aug 2023 23:33 UTC
10 points
3 comments2 min readLW link

Un­der­stand­ing the In­for­ma­tion Flow in­side Large Lan­guage Models

15 Aug 2023 21:13 UTC
19 points
0 comments17 min readLW link

[Question] Any re­search in “probe-tun­ing” of LLMs?

Roman Leventov15 Aug 2023 21:01 UTC
20 points
3 comments1 min readLW link

Can AI Trans­form the Elec­torate into a Ci­ti­zen’s Assem­bly

RoscoHunter15 Aug 2023 17:52 UTC
−3 points
5 comments3 min readLW link

Ten Thou­sand Years of Solitude

agp15 Aug 2023 17:45 UTC
136 points
19 comments4 min readLW link
(www.discovermagazine.com)

AISN #19: US-China Com­pe­ti­tion on AI Chips, Mea­sur­ing Lan­guage Agent Devel­op­ments, Eco­nomic Anal­y­sis of Lan­guage Model Pro­pa­ganda, and White House AI Cy­ber Challenge

15 Aug 2023 16:10 UTC
21 points
0 comments5 min readLW link
(newsletter.safe.ai)

[Question] What is the most effec­tive anti-tyranny char­ity?

lc15 Aug 2023 15:26 UTC
20 points
10 comments1 min readLW link

My check­list for pub­lish­ing a blog post

Steven Byrnes15 Aug 2023 15:04 UTC
84 points
6 comments3 min readLW link

The Dun­bar Play­book: A CRM sys­tem for your friends

Severin T. Seehrich15 Aug 2023 8:44 UTC
33 points
16 comments5 min readLW link
(amoretlicentia.substack.com)

Op­ti­cal Illu­sions are Out of Distri­bu­tion Errors

vitaliya15 Aug 2023 2:23 UTC
30 points
8 comments2 min readLW link

A short calcu­la­tion about a Twit­ter poll

Ege Erdil14 Aug 2023 19:48 UTC
64 points
64 comments11 min readLW link

De­com­pos­ing in­de­pen­dent gen­er­al­iza­tions in neu­ral net­works via Hes­sian analysis

14 Aug 2023 17:04 UTC
83 points
4 comments1 min readLW link

Memetic Judo #2: In­cor­po­ral Switches and Lev­ers Compendium

Max TK14 Aug 2023 16:53 UTC
19 points
6 comments17 min readLW link

Ex­is­ten­tially rele­vant thought ex­per­i­ment: To kill or not to kill, a sniper, a man and a but­ton.

AlexFromSafeTransition14 Aug 2023 10:53 UTC
−18 points
6 comments4 min readLW link

Step­ping down as mod­er­a­tor on LW

Kaj_Sotala14 Aug 2023 10:46 UTC
82 points
1 comment1 min readLW link

An­nounc­ing Man­i­fest 2023 (Sep 22-24 in Berkeley)

14 Aug 2023 5:13 UTC
31 points
0 comments2 min readLW link

Co­her­ence Ther­apy with LLMs—quick demo

Chipmonk14 Aug 2023 3:34 UTC
19 points
11 comments1 min readLW link

Listen For What You Don’t Hear: The Case for Contrarianism

Yashvardhan Sharma14 Aug 2023 2:53 UTC
1 point
1 comment5 min readLW link

Recipe: Hes­sian eigen­vec­tor com­pu­ta­tion for PyTorch models

Nina Panickssery14 Aug 2023 2:48 UTC
32 points
5 comments5 min readLW link

[Question] As­sum­ing LK99 or similar: how to ac­cel­er­ate com­mer­cial­iza­tion?

ryan_b13 Aug 2023 21:34 UTC
7 points
5 comments1 min readLW link

Twin Cities ACX Meetup Septem­ber 2023

Timothy M.13 Aug 2023 20:10 UTC
1 point
4 comments1 min readLW link

Fun­da­men­tal Uncer­tainty: Chap­ter 1 - How can we know what’s true?

Gordon Seidoh Worley13 Aug 2023 18:55 UTC
17 points
4 comments12 min readLW link

We Should Pre­pare for a Larger Rep­re­sen­ta­tion of Academia in AI Safety

Leon Lang13 Aug 2023 18:03 UTC
90 points
13 comments5 min readLW link

AGI is eas­ier than robotaxis

Daniel Kokotajlo13 Aug 2023 17:00 UTC
41 points
30 comments4 min readLW link

[Question] If we’re al­ive in 5 years, do you think the fund­ing situ­a­tion will be much bet­ter by then? (With large amounts of gov­ern­ment fund­ing, for ex­am­ple)

kuira13 Aug 2023 16:32 UTC
−2 points
6 comments1 min readLW link

Ab­stract The­o­ries of Everything

Philosophistry13 Aug 2023 6:06 UTC
−17 points
0 comments1 min readLW link

[Linkpost] Per­sonal and Psy­cholog­i­cal Di­men­sions of AI Re­searchers Con­fronting AI Catas­trophic Risks

Bogdan Ionut Cirstea12 Aug 2023 22:02 UTC
42 points
0 comments1 min readLW link

The Em­pa­thy Eng­ine: A De­con­struc­tion of the So­cietal Me­ta­mor­pho­sis through Tech­nolog­i­cal Em­pa­thy Augmentation

bigdickproblems12 Aug 2023 18:23 UTC
−30 points
3 comments2 min readLW link

The Benev­olent Ruler’s Hand­book (Part 2): Mo­ral­ity Rules

FCCC12 Aug 2023 14:25 UTC
5 points
0 comments4 min readLW link

Learn­ing as you play: an­thropic shadow in deadly games

dr_s12 Aug 2023 7:34 UTC
37 points
28 comments35 min readLW link

Biolog­i­cal An­chors: The Trick that Might or Might Not Work

Scott Alexander12 Aug 2023 0:53 UTC
91 points
3 comments33 min readLW link
(astralcodexten.substack.com)

Si­mu­late the CEO

robotelvis12 Aug 2023 0:09 UTC
23 points
5 comments5 min readLW link
(messyprogress.substack.com)

How to de­cide un­der low-stakes uncertainty

dkl911 Aug 2023 18:07 UTC
11 points
4 comments1 min readLW link
(dkl9.net)

The Pan­demic is Only Begin­ning: The Long COVID Disaster

salvatore mattera11 Aug 2023 17:36 UTC
−6 points
15 comments8 min readLW link

When dis­cussing AI risks, talk about ca­pa­bil­ities, not intelligence

Vika11 Aug 2023 13:38 UTC
116 points
7 comments3 min readLW link
(vkrakovna.wordpress.com)