In­ner al­ign­ment re­quires mak­ing as­sump­tions about hu­man values

Matthew Barnett20 Jan 2020 18:38 UTC
26 points
9 comments4 min readLW link

Work­shop on As­sured Au­tonomous Sys­tems (WAAS)

Aryeh Englander20 Jan 2020 16:21 UTC
2 points
0 comments1 min readLW link

Why Do You Keep Hav­ing This Prob­lem?

Davis_Kingsley20 Jan 2020 8:33 UTC
47 points
16 comments1 min readLW link

[Question] Use-cases for com­pu­ta­tions, other than run­ning them?

johnswentworth19 Jan 2020 20:52 UTC
30 points
6 comments2 min readLW link

UML VII: Meta-Learning

Rafael Harth19 Jan 2020 18:23 UTC
14 points
0 comments15 min readLW link

Ad­just­ing Out­door Reset

jefftk19 Jan 2020 18:20 UTC
1 point
0 comments1 min readLW link
(www.jefftk.com)

Madi­son SSC Meetup: Ad­ver­sar­ial Collaborations

marywang19 Jan 2020 16:47 UTC
1 point
0 comments1 min readLW link

Book re­view: Hu­man Compatible

PeterMcCluskey19 Jan 2020 3:32 UTC
37 points
2 comments5 min readLW link
(www.bayesianinvestor.com)

Is NYC Build­ing Much Hous­ing?

jefftk18 Jan 2020 20:50 UTC
0 points
0 comments1 min readLW link
(www.jefftk.com)

The Road to Mazedom

Zvi18 Jan 2020 14:10 UTC
97 points
26 comments7 min readLW link2 reviews
(thezvi.wordpress.com)

[Question] What types of com­pute/​pro­cess­ing could we dis­t­in­guish?

MoritzG18 Jan 2020 10:04 UTC
2 points
9 comments1 min readLW link

[Question] Poli­ti­cal Roko’s basilisk

Abhimanyu Pallavi Sudhir18 Jan 2020 9:34 UTC
10 points
10 comments1 min readLW link

Risk and un­cer­tainty: A false di­chotomy?

MichaelA18 Jan 2020 3:09 UTC
6 points
9 comments20 min readLW link

Re­mote AI al­ign­ment writ­ing group seek­ing new members

rmoehn18 Jan 2020 2:10 UTC
11 points
0 comments1 min readLW link

“How quickly can you get this done?” (es­ti­mat­ing work­load)

kerspoon18 Jan 2020 0:10 UTC
15 points
9 comments4 min readLW link

Study­ing Early Stage Science: Re­search Pro­gram Introduction

habryka17 Jan 2020 22:12 UTC
32 points
1 comment15 min readLW link
(medium.com)

Fid­dle Effects Tech

jefftk17 Jan 2020 17:00 UTC
2 points
0 comments1 min readLW link
(www.jefftk.com)

[Question] How does a Liv­ing Be­ing solve the prob­lem of Sub­sys­tem Align­ment?

Alan Givré17 Jan 2020 9:32 UTC
3 points
7 comments1 min readLW link

Can we always as­sign, and make sense of, sub­jec­tive prob­a­bil­ities?

MichaelA17 Jan 2020 3:05 UTC
11 points
15 comments13 min readLW link

Against Ra­tion­al­iza­tion II: Se­quence Recap

dspeyer16 Jan 2020 22:51 UTC
6 points
2 comments1 min readLW link

Us­ing Ex­pert Disagreement

dspeyer16 Jan 2020 22:42 UTC
13 points
1 comment5 min readLW link

Bay Sols­tice 2019 Retrospective

mingyuan16 Jan 2020 17:15 UTC
75 points
36 comments15 min readLW link

Real­ity-Re­veal­ing and Real­ity-Mask­ing Puzzles

AnnaSalamon16 Jan 2020 16:15 UTC
264 points
57 comments13 min readLW link1 review

How to Es­cape From Im­moral Mazes

Zvi16 Jan 2020 13:10 UTC
79 points
21 comments19 min readLW link1 review
(thezvi.wordpress.com)

Test­ing for Rationalization

dspeyer16 Jan 2020 8:12 UTC
19 points
0 comments2 min readLW link

[Question] How use­ful do you think par­ti­ci­pat­ing to the Hu­man Micro­biome Pro­ject would be?

Mati_Roy15 Jan 2020 23:51 UTC
4 points
0 comments1 min readLW link

The Align­ment-Com­pe­tence Trade-Off, Part 1: Coal­i­tion Size and Sig­nal­ing Costs

Gentzel15 Jan 2020 23:10 UTC
30 points
4 comments3 min readLW link
(theconsequentialist.wordpress.com)

In Defense of the Arms Races… that End Arms Races

Gentzel15 Jan 2020 21:30 UTC
38 points
9 comments3 min readLW link
(theconsequentialist.wordpress.com)

Fire Alarm for AGI

user13472315 Jan 2020 20:41 UTC
1 point
0 comments1 min readLW link
(blog.acolyer.org)

Go F*** Someone

Jacob Falkovich15 Jan 2020 18:39 UTC
19 points
23 comments8 min readLW link

[AN #82]: How OpenAI Five dis­tributed their train­ing computation

Rohin Shah15 Jan 2020 18:20 UTC
19 points
0 comments8 min readLW link
(mailchi.mp)

ACDT: a hack-y acausal de­ci­sion theory

Stuart_Armstrong15 Jan 2020 17:22 UTC
50 points
16 comments7 min readLW link

Nashville Jan­uary 2020 SSC Meetup

friedelcraftiness15 Jan 2020 17:02 UTC
1 point
0 comments1 min readLW link

In defense of deviousness

Juan Andrés Hurtado Baeza15 Jan 2020 11:56 UTC
12 points
8 comments4 min readLW link
(medium.com)

[Question] What plau­si­ble be­liefs do you think could likely get some­one di­ag­nosed with a men­tal ill­ness by a psy­chi­a­trist?

Mati_Roy15 Jan 2020 11:13 UTC
4 points
6 comments1 min readLW link

Avoid­ing Rationalization

dspeyer15 Jan 2020 10:55 UTC
15 points
0 comments2 min readLW link

SSC Dublin Meetup

Dan Valentine15 Jan 2020 8:26 UTC
1 point
0 comments1 min readLW link

[Question] What are be­liefs you wouldn’t want (or would feel ap­pre­hen­sive about be­ing) pub­lic if you had (or have) them?

Mati_Roy15 Jan 2020 5:30 UTC
6 points
17 comments1 min readLW link

Reno SSC: Visi­tors from Out of Town

RenoSSC15 Jan 2020 4:35 UTC
1 point
0 comments1 min readLW link

Ar­tifi­cial In­tel­li­gence and Life Sciences (Why Big Data is not enough to cap­ture biolog­i­cal sys­tems?)

HansNauj15 Jan 2020 1:59 UTC
6 points
3 comments6 min readLW link

SSC HIKE—BLACK MOUNTAIN

crewman 5115 Jan 2020 0:18 UTC
1 point
0 comments1 min readLW link

Clar­ify­ing The Mal­ig­nity of the Univer­sal Prior: The Lex­i­cal Update

interstice15 Jan 2020 0:00 UTC
20 points
2 comments3 min readLW link

[Question] Tips on how to pro­mote effec­tive al­tru­ism effec­tively? Less talk, more ac­tion.

culturechange14 Jan 2020 23:17 UTC
3 points
1 comment1 min readLW link

A rant against robots

Lê Nguyên Hoang14 Jan 2020 22:03 UTC
65 points
7 comments5 min readLW link

Is back­wards cau­sa­tion nec­es­sar­ily ab­surd?

Chris_Leong14 Jan 2020 19:25 UTC
22 points
9 comments1 min readLW link

Pre­dic­tors ex­ist: CDT go­ing bonkers… forever

Stuart_Armstrong14 Jan 2020 16:19 UTC
46 points
31 comments1 min readLW link

Austin LW/​SSC Far-com­ers Meetup: Feb. 8, 1:30pm

jchan14 Jan 2020 14:46 UTC
2 points
1 comment1 min readLW link

Red Flags for Rationalization

dspeyer14 Jan 2020 7:34 UTC
25 points
6 comments4 min readLW link

Ad­vanced Anki (Me­moriza­tion Soft­ware)

Arthur Milchior14 Jan 2020 2:25 UTC
4 points
0 comments1 min readLW link

Anki (Me­moriza­tion Soft­ware) for Beginners

Arthur Milchior14 Jan 2020 1:55 UTC
7 points
5 comments1 min readLW link