(OLD) An Ex­tremely Opinionated An­no­tated List of My Favourite Mechanis­tic In­ter­pretabil­ity Papers

Neel Nanda18 Oct 2022 21:08 UTC
72 points
5 comments12 min readLW link
(www.neelnanda.io)

Distil­led Rep­re­sen­ta­tions Re­search Agenda

18 Oct 2022 20:59 UTC
15 points
2 comments8 min readLW link

Draft­ing a Covid Survey

jefftk18 Oct 2022 19:30 UTC
15 points
2 comments2 min readLW link
(www.jefftk.com)

How To Make Pre­dic­tion Mar­kets Use­ful For Align­ment Work

johnswentworth18 Oct 2022 19:01 UTC
97 points
18 comments2 min readLW link

A con­ver­sa­tion about Katja’s coun­ter­ar­gu­ments to AI risk

18 Oct 2022 18:40 UTC
43 points
9 comments33 min readLW link

ACX Zurich Oc­to­ber Meetup

MB18 Oct 2022 18:24 UTC
1 point
1 comment1 min readLW link

Un­tapped Po­ten­tial at 13-18

belkarx18 Oct 2022 18:09 UTC
82 points
53 comments1 min readLW link

[Question] How easy is it to su­per­vise pro­cesses vs out­comes?

Noosphere8918 Oct 2022 17:48 UTC
3 points
0 comments1 min readLW link

Is GitHub Copi­lot in le­gal trou­ble?

tcelferact18 Oct 2022 16:19 UTC
35 points
2 comments1 min readLW link

Me­tac­u­lus is build­ing a team ded­i­cated to AI forecasting

ChristianWilliams18 Oct 2022 16:08 UTC
3 points
0 comments1 min readLW link

How to Take Over the Uni­verse (in Three Easy Steps)

Writer18 Oct 2022 15:04 UTC
47 points
17 comments12 min readLW link
(youtu.be)

Science of Deep Learn­ing—a tech­ni­cal agenda

Marius Hobbhahn18 Oct 2022 14:54 UTC
36 points
7 comments4 min readLW link

My search for a re­li­able breakfast

tomdekan18 Oct 2022 9:42 UTC
6 points
17 comments3 min readLW link
(www.tomdekan.com)

In­finite Pos­si­bil­ity Space and the Shut­down Problem

magfrump18 Oct 2022 5:37 UTC
6 points
0 comments2 min readLW link
(www.magfrump.net)

Au­di­tion to perform in Bay Sec­u­lar Solstice

mingyuan18 Oct 2022 3:10 UTC
25 points
3 comments1 min readLW link

De­ci­sion the­ory does not im­ply that we get to have nice things

So8res18 Oct 2022 3:04 UTC
170 points
72 comments26 min readLW link2 reviews

South Bay ACX/​LW Meetup

IS18 Oct 2022 2:50 UTC
2 points
1 comment1 min readLW link

Ver­ti­cal Flutes

jefftk18 Oct 2022 1:40 UTC
10 points
4 comments1 min readLW link
(www.jefftk.com)

Why Weren’t Hot Air Bal­loons In­vented Sooner?

Lost Futures18 Oct 2022 0:41 UTC
115 points
52 comments6 min readLW link
(lostfutures.substack.com)

Is GPT-N bounded by hu­man ca­pa­bil­ities? No.

Cleo Nardo17 Oct 2022 23:26 UTC
48 points
8 comments2 min readLW link

EA & LW Fo­rums Weekly Sum­mary (10 − 16 Oct 22′)

Zoe Williams17 Oct 2022 22:51 UTC
12 points
4 comments1 min readLW link

A prag­matic met­ric for Ar­tifi­cial Gen­eral Intelligence

lorepieri17 Oct 2022 22:07 UTC
6 points
0 comments1 min readLW link
(lorenzopieri.com)

They gave LLMs ac­cess to physics simulators

ryan_b17 Oct 2022 21:21 UTC
50 points
18 comments1 min readLW link
(arxiv.org)

Com­bat­ting perfectionism

tomdekan17 Oct 2022 20:58 UTC
6 points
0 comments2 min readLW link
(tomdekan.com)

Open Prob­lem in Vot­ing Theory

Scott Garrabrant17 Oct 2022 20:42 UTC
75 points
16 comments6 min readLW link

Max­i­mal Lot­tery-Lotteries

Scott Garrabrant17 Oct 2022 20:39 UTC
72 points
15 comments4 min readLW link

[Question] Creat­ing su­per­in­tel­li­gence with­out AGI

Antb17 Oct 2022 19:01 UTC
7 points
3 comments1 min readLW link

AI Safety Ideas: An Open AI Safety Re­search Platform

Esben Kran17 Oct 2022 17:01 UTC
24 points
0 comments1 min readLW link

ACX/​SSC/​Ra­tion­al­ist Meetup Madi­son 10/​22

svfritz17 Oct 2022 16:06 UTC
1 point
0 comments1 min readLW link

Balsa FAQ

Zvi17 Oct 2022 12:40 UTC
19 points
5 comments1 min readLW link
(thezvi.wordpress.com)

Max­i­mal Lotteries

Scott Garrabrant17 Oct 2022 8:54 UTC
75 points
11 comments7 min readLW link

Vot­ing The­ory Introduction

Scott Garrabrant17 Oct 2022 8:48 UTC
80 points
8 comments6 min readLW link

Space

Jarred Filmer17 Oct 2022 6:34 UTC
47 points
0 comments3 min readLW link

GD’s Im­plicit Bias on Separable Data

Xander Davies17 Oct 2022 4:13 UTC
25 points
0 comments7 min readLW link

Cal­ling my First Fam­ily Dance

jefftk17 Oct 2022 2:40 UTC
6 points
0 comments3 min readLW link
(www.jefftk.com)

The harms you don’t see

ViktoriaMalyasova16 Oct 2022 23:45 UTC
63 points
54 comments10 min readLW link

Max­i­mal lot­ter­ies for value learning

ViktoriaMalyasova16 Oct 2022 23:44 UTC
17 points
1 comment5 min readLW link

Pop­u­lar Per­sonal Fi­nan­cial Ad­vice ver­sus the Pro­fes­sors (James Choi, NBER)

BrownHairedEevee16 Oct 2022 22:21 UTC
17 points
5 comments2 min readLW link
(spinup-000d1a-wp-offload-media.s3.amazonaws.com)

Life, Death, and Fi­nance in the Cos­mic Mul­ti­verse

peterb16 Oct 2022 18:57 UTC
2 points
1 comment1 min readLW link

[Question] Sig­nifi­cance of the Lan­guage of Thought Hy­poth­e­sis?

DrFlaggstaff16 Oct 2022 18:09 UTC
1 point
3 comments1 min readLW link

Luck based medicine: my re­sent­ful story of be­com­ing a med­i­cal miracle

Elizabeth16 Oct 2022 17:40 UTC
483 points
121 comments12 min readLW link3 reviews
(acesounderglass.com)

Age changes what you care about

Dentin16 Oct 2022 15:36 UTC
141 points
37 comments2 min readLW link

Hal­i­fax, NS – Monthly Ra­tion­al­ist, EA, and ACX Meetup Kick-Off

Ideopunk16 Oct 2022 13:17 UTC
10 points
0 comments1 min readLW link

Cruxes in Katja Grace’s Counterarguments

azsantosk16 Oct 2022 8:44 UTC
16 points
0 comments7 min readLW link

Build­ing the Loft Beds

jefftk16 Oct 2022 1:10 UTC
10 points
4 comments1 min readLW link
(www.jefftk.com)

[Question] Best re­source to go from “typ­i­cal smart tech-savvy per­son” to “per­son who gets AGI risk ur­gency”?

Liron15 Oct 2022 22:26 UTC
16 points
8 comments1 min readLW link

Bounded dis­trust or Bounded trust?

M. Y. Zuo15 Oct 2022 16:41 UTC
2 points
12 comments3 min readLW link

I learn bet­ter when I frame learn­ing as Vengeance for losses in­curred through ig­no­rance, and you might too

chaosmage15 Oct 2022 12:41 UTC
81 points
9 comments3 min readLW link1 review

James Nor­ris from Upgrad­able on “What is Beyond Liv­ing a Prin­ci­pled Life”—OpenPrin­ci­ples Speaker Session

ti_guo15 Oct 2022 3:27 UTC
2 points
0 comments1 min readLW link

Quick Mock Brownie

jefftk15 Oct 2022 3:00 UTC
8 points
0 comments1 min readLW link
(www.jefftk.com)