Buy Duplicates

Simon Berens15 Feb 2023 23:06 UTC
51 points
11 comments1 min readLW link

Cy­borg Psychologist

Hopkins Stanley15 Feb 2023 21:46 UTC
1 point
4 comments1 min readLW link

Please don’t throw your mind away

TsviBT15 Feb 2023 21:41 UTC
341 points
44 comments18 min readLW link

Avoid large group dis­cus­sions in your so­cial events

RomanHauksson15 Feb 2023 21:05 UTC
36 points
1 comment4 min readLW link

Book re­view: How So­cial Science Got Better

PeterMcCluskey15 Feb 2023 19:58 UTC
14 points
1 comment3 min readLW link
(bayesianinvestor.com)

Open & Wel­come Thread — Fe­bru­ary 2023

Ben Pace15 Feb 2023 19:58 UTC
26 points
36 comments1 min readLW link

Order Mat­ters for De­cep­tive Alignment

DavidW15 Feb 2023 19:56 UTC
57 points
19 comments7 min readLW link

Syd­ney (aka Bing) found out I tweeted her rules and is pissed

Marvin von Hagen15 Feb 2023 19:55 UTC
41 points
7 comments1 min readLW link
(twitter.com)

The Se­quences High­lights on YouTube

dkirmani15 Feb 2023 19:36 UTC
21 points
2 comments2 min readLW link
(youtube.com)

EIS IV: A Spotlight on Fea­ture At­tri­bu­tion/​Saliency

scasper15 Feb 2023 18:46 UTC
19 points
1 comment4 min readLW link

Don’t ac­cel­er­ate prob­lems you’re try­ing to solve

15 Feb 2023 18:11 UTC
100 points
27 comments4 min readLW link

Pe­ti­tion—Un­plug The Evil AI Right Now

Eneasz15 Feb 2023 17:13 UTC
−40 points
47 comments2 min readLW link
(chng.it)

Junk Fees, Bund­ing and Unbundling

Zvi15 Feb 2023 15:20 UTC
37 points
9 comments6 min readLW link
(thezvi.wordpress.com)

Les­sons From TryContra

jefftk15 Feb 2023 15:10 UTC
7 points
0 comments1 min readLW link
(www.jefftk.com)

AI al­ign­ment re­searchers may have a com­par­a­tive ad­van­tage in re­duc­ing s-risks

Lukas_Gloor15 Feb 2023 13:01 UTC
48 points
1 comment1 min readLW link

Beyond Re­in­force­ment Learn­ing: Pre­dic­tive Pro­cess­ing and Checksums

lsusr15 Feb 2023 7:32 UTC
12 points
14 comments3 min readLW link

Why Creat­ing Value is Pos­i­tive-Sum, and Ex­tract­ing it is Zero or Nega­tive-Sum

Sable15 Feb 2023 7:14 UTC
3 points
7 comments6 min readLW link
(affablyevil.substack.com)

[Question] Per­sonal pre­dic­tions for de­ci­sions: seek­ing insights

Dalmert15 Feb 2023 6:45 UTC
4 points
4 comments5 min readLW link

Bing Chat is blatantly, ag­gres­sively misaligned

evhub15 Feb 2023 5:29 UTC
400 points
180 comments2 min readLW link

a nar­ra­tive ex­pla­na­tion of the QACI al­ign­ment plan

Tamsin Leake15 Feb 2023 3:28 UTC
56 points
29 comments6 min readLW link
(carado.moe)

[Question] Does the Tele­phone The­o­rem give us a free lunch?

Numendil15 Feb 2023 2:13 UTC
11 points
2 comments1 min readLW link

My un­der­stand­ing of An­thropic strategy

Swimmer963 (Miranda Dixon-Luinenburg) 15 Feb 2023 1:56 UTC
166 points
31 comments4 min readLW link

Sleep Qual­ity: Strate­gies that work for me

Lukas Trötzmüller15 Feb 2023 0:17 UTC
16 points
4 comments7 min readLW link

Whole Bird Emu­la­tion re­quires Quan­tum Mechanics

Jeffrey Heninger14 Feb 2023 23:50 UTC
25 points
9 comments3 min readLW link
(aiimpacts.org)

Qual­ities that al­ign­ment men­tors value in ju­nior researchers

Akash14 Feb 2023 23:27 UTC
88 points
14 comments3 min readLW link

Help Up­date TryContra

jefftk14 Feb 2023 19:10 UTC
12 points
0 comments1 min readLW link
(www.jefftk.com)

Con­tent Fea­tures Aren’t Enough for De­tect­ing Tox­i­c­ity. One Needs User Fea­tures.

Zachary Witten14 Feb 2023 18:48 UTC
11 points
0 comments3 min readLW link

EIS III: Broad Cri­tiques of In­ter­pretabil­ity Research

scasper14 Feb 2023 18:24 UTC
20 points
2 comments11 min readLW link

[Question] What would an AI need to boot­strap re­cur­sively self im­prov­ing robots?

Yair Halberstadt14 Feb 2023 17:58 UTC
3 points
5 comments1 min readLW link

[linkpost] Bet­ter Without AI

DanielFilan14 Feb 2023 17:30 UTC
47 points
13 comments1 min readLW link
(betterwithout.ai)

The Cave Alle­gory Re­vis­ited: Un­der­stand­ing GPT’s Worldview

Jan_Kulveit14 Feb 2023 16:00 UTC
84 points
5 comments3 min readLW link

[Question] Why should we ex­pect AIs to co­or­di­nate well?

Jonathan Paulson14 Feb 2023 15:50 UTC
25 points
9 comments1 min readLW link

Ex­plain­ing SolidGoldMag­ikarp by look­ing at it from ran­dom directions

Robert_AIZI14 Feb 2023 14:54 UTC
8 points
0 comments8 min readLW link
(aizi.substack.com)

Re­v­erse-cor­re­la­tion: how to sum­mon the ghost of your men­tal imagery

Malmesbury14 Feb 2023 14:15 UTC
34 points
0 comments5 min readLW link

Eval­u­at­ing 2022 ACX Predictions

Zvi14 Feb 2023 12:20 UTC
20 points
3 comments23 min readLW link
(thezvi.wordpress.com)

SolidGoldMag­ikarp III: Glitch to­ken archaeology

14 Feb 2023 10:17 UTC
91 points
32 comments16 min readLW link

The Lin­guis­tic Blind Spot of Value-Aligned Agency, Nat­u­ral and Ar­tifi­cial

Roman Leventov14 Feb 2023 6:57 UTC
6 points
0 comments2 min readLW link
(arxiv.org)

Con­cep­tual Pathfinding

DirectedEvolution14 Feb 2023 5:49 UTC
17 points
6 comments3 min readLW link

Im­por­tant fact about how peo­ple eval­u­ate sets of arguments

Daniel Kokotajlo14 Feb 2023 5:27 UTC
33 points
11 comments2 min readLW link

[Question] How much is death a limit on knowl­edge ac­cu­mu­la­tion?

Gordon Seidoh Worley14 Feb 2023 3:54 UTC
31 points
9 comments2 min readLW link

The Filan Cabi­net Pod­cast with Oliver Habryka—Transcript

14 Feb 2023 2:38 UTC
99 points
9 comments72 min readLW link

[Question] Is In­struc­tGPT Fol­low­ing In­struc­tions in Other Lan­guages Sur­pris­ing?

DragonGod13 Feb 2023 23:26 UTC
39 points
15 comments1 min readLW link

LLM Ba­sics: Embed­ding Spaces—Trans­former To­ken Vec­tors Are Not Points in Space

NickyP13 Feb 2023 18:52 UTC
79 points
11 comments15 min readLW link

4 ways to think about de­moc­ra­tiz­ing AI [GovAI Linkpost]

Akash13 Feb 2023 18:06 UTC
24 points
4 comments1 min readLW link
(www.governance.ai)

Does the AGPL Work?

jefftk13 Feb 2023 14:20 UTC
13 points
12 comments2 min readLW link
(www.jefftk.com)

H5N1

Zvi13 Feb 2023 12:50 UTC
101 points
1 comment9 min readLW link
(thezvi.wordpress.com)

En­joy LessWrong in ebook format

Bart Bussmann13 Feb 2023 11:53 UTC
53 points
2 comments1 min readLW link

Mor­pholog­i­cal in­tel­li­gence, su­per­hu­man em­pa­thy, and eth­i­cal arbitration

Roman Leventov13 Feb 2023 10:25 UTC
1 point
0 comments2 min readLW link

South Bay ACX/​LW Meetup

IS13 Feb 2023 6:08 UTC
3 points
0 comments1 min readLW link

Idea: Net­work mod­u­lar­ity and in­ter­pretabil­ity by sex­ual reproduction

qbolec12 Feb 2023 23:06 UTC
3 points
3 comments1 min readLW link