AI al­ign­ment con­cepts: philo­soph­i­cal break­ers, stop­pers, and distorters

JustinShovelain24 Jan 2020 19:23 UTC
20 points
3 comments3 min readLW link

The two-layer model of hu­man val­ues, and prob­lems with syn­the­siz­ing preferences

Kaj_Sotala24 Jan 2020 15:17 UTC
70 points
16 comments9 min readLW link

[Question] How much do we know about how brains learn?

Kenny24 Jan 2020 14:46 UTC
8 points
0 comments1 min readLW link

Epistea Sum­mer Ex­per­i­ment (ESE)

24 Jan 2020 10:49 UTC
49 points
3 comments5 min readLW link

2018 Re­view: Vot­ing Re­sults!

Ben Pace24 Jan 2020 2:00 UTC
135 points
59 comments6 min readLW link

Im­prov­ing Group De­ci­sion Making

June Ku24 Jan 2020 1:29 UTC
6 points
2 comments1 min readLW link
(youtu.be)

Emer­gency Pre­scrip­tion Medication

jefftk24 Jan 2020 1:20 UTC
12 points
2 comments2 min readLW link
(www.jefftk.com)

LW/​SSC War­saw Fe­bru­ary Meetup

kaftanowicz23 Jan 2020 19:49 UTC
2 points
0 comments1 min readLW link

[Question] The­ory of Causal Models with Dy­namic Struc­ture?

johnswentworth23 Jan 2020 19:47 UTC
24 points
6 comments1 min readLW link

To­kenis­ing hu­man ver­ifi­ca­tion in or­der to de­rive in­for­ma­tion from the re­sult­ing markets

Will Clark23 Jan 2020 19:20 UTC
3 points
4 comments3 min readLW link

New pa­per: The In­cen­tives that Shape Behaviour

RyanCarey23 Jan 2020 19:07 UTC
23 points
5 comments1 min readLW link
(arxiv.org)

For­mu­lat­ing Re­duc­tive Agency in Causal Models

johnswentworth23 Jan 2020 17:03 UTC
33 points
0 comments2 min readLW link

Cas­sette Tape Thoughts

Elizabeth22 Jan 2020 22:50 UTC
48 points
0 comments2 min readLW link
(acesounderglass.com)

Con­cerns Sur­round­ing CEV: A case for hu­man friendli­ness first

ai-crotes22 Jan 2020 21:03 UTC
1 point
19 comments1 min readLW link

(A → B) → A in Causal DAGs

johnswentworth22 Jan 2020 18:22 UTC
48 points
11 comments2 min readLW link

[AN #83]: Sam­ple-effi­cient deep learn­ing with ReMixMatch

Rohin Shah22 Jan 2020 18:10 UTC
15 points
4 comments11 min readLW link
(mailchi.mp)

[Question] Terms & liter­a­ture for pur­posely lossy communication

ozziegooen22 Jan 2020 10:35 UTC
14 points
6 comments1 min readLW link

Three signs you may be suffer­ing from im­poster syndrome

lc21 Jan 2020 22:17 UTC
9 points
7 comments2 min readLW link

Log­i­cal Rep­re­sen­ta­tion of Causal Models

johnswentworth21 Jan 2020 20:04 UTC
37 points
0 comments3 min readLW link

Disasters

jefftk21 Jan 2020 19:20 UTC
16 points
13 comments3 min readLW link
(www.jefftk.com)

Safety reg­u­la­tors: A tool for miti­gat­ing tech­nolog­i­cal risk

JustinShovelain21 Jan 2020 13:07 UTC
13 points
4 comments4 min readLW link

How Doomed are Large Or­ga­ni­za­tions?

Zvi21 Jan 2020 12:20 UTC
81 points
42 comments9 min readLW link
(thezvi.wordpress.com)

Book Re­view—The Ori­gins of Un­fair­ness: So­cial Cat­e­gories and Cul­tural Evolution

Zack_M_Davis21 Jan 2020 6:28 UTC
27 points
5 comments1 min readLW link
(unremediatedgender.space)

Whipped Cream vs Fancy Butter

jefftk21 Jan 2020 0:30 UTC
−2 points
36 comments1 min readLW link
(www.jefftk.com)

In­ner al­ign­ment re­quires mak­ing as­sump­tions about hu­man values

Matthew Barnett20 Jan 2020 18:38 UTC
26 points
9 comments4 min readLW link

Work­shop on As­sured Au­tonomous Sys­tems (WAAS)

Aryeh Englander20 Jan 2020 16:21 UTC
2 points
0 comments1 min readLW link

Why Do You Keep Hav­ing This Prob­lem?

Davis_Kingsley20 Jan 2020 8:33 UTC
47 points
16 comments1 min readLW link

[Question] Use-cases for com­pu­ta­tions, other than run­ning them?

johnswentworth19 Jan 2020 20:52 UTC
30 points
6 comments2 min readLW link

UML VII: Meta-Learning

Rafael Harth19 Jan 2020 18:23 UTC
14 points
0 comments15 min readLW link

Ad­just­ing Out­door Reset

jefftk19 Jan 2020 18:20 UTC
1 point
0 comments1 min readLW link
(www.jefftk.com)

Madi­son SSC Meetup: Ad­ver­sar­ial Collaborations

marywang19 Jan 2020 16:47 UTC
1 point
0 comments1 min readLW link

Book re­view: Hu­man Compatible

PeterMcCluskey19 Jan 2020 3:32 UTC
37 points
2 comments5 min readLW link
(www.bayesianinvestor.com)

Is NYC Build­ing Much Hous­ing?

jefftk18 Jan 2020 20:50 UTC
0 points
0 comments1 min readLW link
(www.jefftk.com)

The Road to Mazedom

Zvi18 Jan 2020 14:10 UTC
97 points
26 comments7 min readLW link2 reviews
(thezvi.wordpress.com)

[Question] What types of com­pute/​pro­cess­ing could we dis­t­in­guish?

MoritzG18 Jan 2020 10:04 UTC
2 points
9 comments1 min readLW link

[Question] Poli­ti­cal Roko’s basilisk

Abhimanyu Pallavi Sudhir18 Jan 2020 9:34 UTC
10 points
10 comments1 min readLW link

Risk and un­cer­tainty: A false di­chotomy?

MichaelA18 Jan 2020 3:09 UTC
6 points
9 comments20 min readLW link

Re­mote AI al­ign­ment writ­ing group seek­ing new members

rmoehn18 Jan 2020 2:10 UTC
11 points
0 comments1 min readLW link

“How quickly can you get this done?” (es­ti­mat­ing work­load)

kerspoon18 Jan 2020 0:10 UTC
15 points
9 comments4 min readLW link

Study­ing Early Stage Science: Re­search Pro­gram Introduction

habryka17 Jan 2020 22:12 UTC
32 points
1 comment15 min readLW link
(medium.com)

Fid­dle Effects Tech

jefftk17 Jan 2020 17:00 UTC
2 points
0 comments1 min readLW link
(www.jefftk.com)

[Question] How does a Liv­ing Be­ing solve the prob­lem of Sub­sys­tem Align­ment?

Alan Givré17 Jan 2020 9:32 UTC
3 points
7 comments1 min readLW link

Can we always as­sign, and make sense of, sub­jec­tive prob­a­bil­ities?

MichaelA17 Jan 2020 3:05 UTC
11 points
15 comments13 min readLW link

Against Ra­tion­al­iza­tion II: Se­quence Recap

dspeyer16 Jan 2020 22:51 UTC
6 points
2 comments1 min readLW link

Us­ing Ex­pert Disagreement

dspeyer16 Jan 2020 22:42 UTC
13 points
1 comment5 min readLW link

Bay Sols­tice 2019 Retrospective

mingyuan16 Jan 2020 17:15 UTC
75 points
36 comments15 min readLW link

Real­ity-Re­veal­ing and Real­ity-Mask­ing Puzzles

AnnaSalamon16 Jan 2020 16:15 UTC
264 points
57 comments13 min readLW link1 review

How to Es­cape From Im­moral Mazes

Zvi16 Jan 2020 13:10 UTC
79 points
21 comments19 min readLW link1 review
(thezvi.wordpress.com)

Test­ing for Rationalization

dspeyer16 Jan 2020 8:12 UTC
19 points
0 comments2 min readLW link

[Question] How use­ful do you think par­ti­ci­pat­ing to the Hu­man Micro­biome Pro­ject would be?

Mati_Roy15 Jan 2020 23:51 UTC
4 points
0 comments1 min readLW link