The har­ness­ing of complexity

geduardoNov 10, 2022, 6:44 PM
6 points
2 comments3 min readLW link

[Question] I there a demo of “You can’t fetch the coffee if you’re dead”?

Ram RachumNov 10, 2022, 6:41 PM
8 points
9 comments1 min readLW link

Mastodon Link­ing Norms

jefftkNov 10, 2022, 3:10 PM
9 points
9 comments2 min readLW link
(www.jefftk.com)

Covid 11/​10/​22: Into the Background

ZviNov 10, 2022, 1:40 PM
31 points
5 comments4 min readLW link
(thezvi.wordpress.com)

LessWrong Poll on AGI

Niclas KupperNov 10, 2022, 1:13 PM
12 points
6 comments1 min readLW link

The op­ti­mal an­gle for a so­lar boiler is differ­ent than for a so­lar panel

Yair HalberstadtNov 10, 2022, 10:32 AM
42 points
4 comments2 min readLW link

What it’s like to dis­sect a cadaver

Alok SinghNov 10, 2022, 6:40 AM
208 points
24 comments5 min readLW link
(alok.github.io)

I Con­verted Book I of The Se­quences Into A Zoomer-Read­able Format

dkirmaniNov 10, 2022, 2:59 AM
200 points
32 comments2 min readLW link

Ad­ver­sar­ial Pri­ors: Not Pay­ing Peo­ple to Lie to You

eva_Nov 10, 2022, 2:29 AM
22 points
9 comments3 min readLW link

Is full self-driv­ing an AGI-com­plete prob­lem?

kraemahzNov 10, 2022, 2:04 AM
10 points
5 comments1 min readLW link

[Question] What are ex­am­ples of prob­lems that were caused by in­tel­li­gence, that couldn’t be solved with in­tel­li­gence?

Peter O'MalleyNov 10, 2022, 2:04 AM
1 point
2 comments1 min readLW link

Desider­ata for an Ad­ver­sar­ial Prior

ShmiNov 9, 2022, 11:45 PM
13 points
2 comments1 min readLW link

Chord Notation

jefftkNov 9, 2022, 9:30 PM
12 points
5 comments1 min readLW link
(www.jefftk.com)

[ASoT] In­stru­men­tal con­ver­gence is useful

Ulisse MiniNov 9, 2022, 8:20 PM
5 points
9 comments1 min readLW link

Me­satrans­la­tion and Metatranslation

jdpNov 9, 2022, 6:46 PM
25 points
4 comments11 min readLW link

Try­ing to Make a Treach­er­ous Mesa-Optimizer

MadHatterNov 9, 2022, 6:07 PM
95 points
14 comments4 min readLW link
(attentionspan.blog)

A caveat to the Orthog­o­nal­ity Thesis

Wuschel SchulzNov 9, 2022, 3:06 PM
38 points
10 comments2 min readLW link

Wed­nes­day South Bay Mee­tups, Novem­ber 16

Leonard ZabarskyNov 9, 2022, 2:21 AM
1 point
0 comments1 min readLW link

FTX will prob­a­bly be sold at a steep dis­count. What we know and some fore­casts on what will hap­pen next

Nathan YoungNov 9, 2022, 2:14 AM
60 points
21 commentsLW link

A first suc­cess story for Outer Align­ment: In­struc­tGPT

Noosphere89Nov 8, 2022, 10:52 PM
6 points
1 comment1 min readLW link
(openai.com)

Try­ing Mastodon

jefftkNov 8, 2022, 7:10 PM
12 points
4 comments1 min readLW link
(www.jefftk.com)

In­verse scal­ing can be­come U-shaped

Edouard HarrisNov 8, 2022, 7:04 PM
27 points
15 comments1 min readLW link
(arxiv.org)

Peo­ple care about each other even though they have im­perfect mo­ti­va­tional poin­t­ers?

TurnTroutNov 8, 2022, 6:15 PM
33 points
25 comments7 min readLW link

Ap­ply­ing su­per­in­tel­li­gence with­out col­lu­sion

Eric DrexlerNov 8, 2022, 6:08 PM
109 points
63 comments4 min readLW link

[Question] Bi­nance is buy­ing FTX.com: How did it hap­pen and what are the im­pli­ca­tions?

CaeruleanNov 8, 2022, 5:14 PM
16 points
6 comments1 min readLW link

Some ad­vice on in­de­pen­dent research

Marius HobbhahnNov 8, 2022, 2:46 PM
56 points
5 comments10 min readLW link

Mys­ter­ies of mode collapse

janusNov 8, 2022, 10:37 AM
284 points
57 comments14 min readLW link1 review

[ASoT] Thoughts on GPT-N

Ulisse MiniNov 8, 2022, 7:14 AM
8 points
0 comments1 min readLW link

Michael Simm—In­tro­duc­ing Myself

Michael SimmNov 8, 2022, 5:45 AM
4 points
0 comments2 min readLW link

EA & LW Fo­rums Weekly Sum­mary (31st Oct − 6th Nov 22′)

Zoe WilliamsNov 8, 2022, 3:58 AM
12 points
1 commentLW link

[Question] Value of Query­ing 100+ Peo­ple About Hu­man­ity’s Future

T431Nov 8, 2022, 12:41 AM
9 points
3 comments2 min readLW link

How could we know that an AGI sys­tem will have good con­se­quences?

So8resNov 7, 2022, 10:42 PM
111 points
25 comments5 min readLW link

A Walk­through of In­ter­pretabil­ity in the Wild (w/​ au­thors Kevin Wang, Arthur Conmy & Alexan­dre Variengien)

Neel NandaNov 7, 2022, 10:39 PM
30 points
15 comments3 min readLW link
(youtu.be)

In­ter­cept ar­ti­cle about lab accidents

ChristianKlNov 7, 2022, 9:10 PM
23 points
9 comments1 min readLW link
(theintercept.com)

The biolog­i­cal func­tion of love for non-kin is to gain the trust of peo­ple we can­not deceive

chaosmageNov 7, 2022, 8:26 PM
43 points
3 comments8 min readLW link

Distil­la­tion Ex­per­i­ment: Chunk-Knitting

DirectedEvolutionNov 7, 2022, 7:56 PM
10 points
3 comments6 min readLW link

Think­ing About Mastodon

jefftkNov 7, 2022, 7:40 PM
33 points
17 comments1 min readLW link
(www.jefftk.com)

[Question] Ideas for tiny re­search pro­jects re­lated to ra­tio­nal­ity?

FrejNov 7, 2022, 6:45 PM
3 points
1 comment1 min readLW link

Loss of con­trol of AI is not a likely source of AI x-risk

squekNov 7, 2022, 6:44 PM
−6 points
0 comments5 min readLW link

AI Safety Un­con­fer­ence NeurIPS 2022

OrpheusNov 7, 2022, 3:39 PM
25 points
0 commentsLW link
(aisafetyevents.org)

Hacker-AI – Does it already ex­ist?

Erland WittkotterNov 7, 2022, 2:01 PM
3 points
12 comments11 min readLW link

What’s the Deal with Elon Musk and Twit­ter?

ZviNov 7, 2022, 1:50 PM
60 points
13 comments31 min readLW link
(thezvi.wordpress.com)

How to Make Easy De­ci­sions

lynettebyeNov 7, 2022, 1:17 PM
17 points
3 comments2 min readLW link

Op­por­tu­ni­ties that sur­prised us dur­ing our Clearer Think­ing Re­grants program

spencergNov 7, 2022, 1:09 PM
20 points
0 commentsLW link

4 Key As­sump­tions in AI Safety

PrometheusNov 7, 2022, 10:50 AM
20 points
5 comments7 min readLW link

Google Search as a Washed Up Ser­vice Dog: “I HALP!”

ShmiNov 7, 2022, 7:02 AM
20 points
8 comments1 min readLW link

[Book Re­view] “Sta­tion Eleven” by Emily St. John Mandel

lsusrNov 7, 2022, 5:56 AM
17 points
1 comment1 min readLW link

Counterfactability

Scott GarrabrantNov 7, 2022, 5:39 AM
40 points
5 comments11 min readLW link

2022 LessWrong Cen­sus?

SurfingOrcaNov 7, 2022, 5:16 AM
67 points
13 comments1 min readLW link

A philoso­pher’s cri­tique of RLHF

TW123Nov 7, 2022, 2:42 AM
55 points
8 comments2 min readLW link