Self-reg­u­la­tion of safety in AI research

Gordon Seidoh Worley25 Feb 2018 23:17 UTC
12 points
6 comments2 min readLW link

The abrupt­ness of nu­clear weapons

paulfchristiano25 Feb 2018 17:40 UTC
47 points
35 comments2 min readLW link

Like­li­hood of dis­con­tin­u­ous progress around the de­vel­op­ment of AGI

vedevazz25 Feb 2018 15:13 UTC
4 points
2 comments1 min readLW link
(aiimpacts.org)

Open-Source Monasticism

Nathan Rosquist25 Feb 2018 13:52 UTC
26 points
7 comments4 min readLW link

Pass­ing Troll Bridge

Diffractor25 Feb 2018 8:21 UTC
11 points
2 comments10 min readLW link

Three Miniatures

alkjash25 Feb 2018 5:40 UTC
22 points
12 comments3 min readLW link
(radimentary.wordpress.com)

Ar­gu­ments about fast takeoff

paulfchristiano25 Feb 2018 4:53 UTC
92 points
66 comments2 min readLW link1 review
(sideways-view.com)

Meta-tations on Moder­a­tion: Towards Public Archipelago

Raemon25 Feb 2018 3:59 UTC
78 points
176 comments23 min readLW link

Les­sons from the Cold War on In­for­ma­tion Hazards: Why In­ter­nal Com­mu­ni­ca­tion is Critical

Gentzel24 Feb 2018 23:34 UTC
47 points
10 comments4 min readLW link

What we talk about when we talk about max­imis­ing utility

Richard_Ngo24 Feb 2018 22:33 UTC
14 points
18 comments4 min readLW link

Links with underscores

ShardPhoenix24 Feb 2018 11:32 UTC
2 points
3 comments1 min readLW link

A use­ful level distinction

Charlie Steiner24 Feb 2018 6:39 UTC
8 points
4 comments2 min readLW link

CoZE 2

alkjash24 Feb 2018 5:40 UTC
16 points
7 comments2 min readLW link
(radimentary.wordpress.com)

On Build­ing The­o­ries of History

Samo Burja23 Feb 2018 23:40 UTC
30 points
20 comments5 min readLW link

Mythic Mode

Valentine23 Feb 2018 22:45 UTC
68 points
82 comments9 min readLW link

The Mal­i­cious Use of Ar­tifi­cial In­tel­li­gence: Fore­cast­ing, Preven­tion, and Mitigation

Gordon Seidoh Worley23 Feb 2018 21:42 UTC
5 points
8 comments1 min readLW link
(arxiv.org)

Two types of mathematician

drossbucket23 Feb 2018 19:26 UTC
64 points
41 comments4 min readLW link

June 2012: 0/​33 Tur­ing Award win­ners pre­dict com­put­ers beat­ing hu­mans at go within next 10 years.

betterthanwell23 Feb 2018 11:25 UTC
18 points
13 comments2 min readLW link

De­sign 2

alkjash23 Feb 2018 6:20 UTC
18 points
17 comments3 min readLW link
(radimentary.wordpress.com)

AI Align­ment and Phenom­e­nal Consciousness

Gordon Seidoh Worley23 Feb 2018 1:21 UTC
9 points
0 comments6 min readLW link
(mapandterritory.org)

Ex­pla­na­tion vs Rationalization

abramdemski22 Feb 2018 23:46 UTC
16 points
11 comments4 min readLW link

The map has gears. They don’t always turn.

abramdemski22 Feb 2018 20:16 UTC
24 points
0 comments1 min readLW link

The In­tel­li­gent So­cial Web

Valentine22 Feb 2018 18:55 UTC
229 points
112 comments12 min readLW link2 reviews

The Three Stages Of Model Development

katerinjo22 Feb 2018 14:33 UTC
17 points
7 comments2 min readLW link

Pain, fear, sex, and higher or­der preferences

Stuart_Armstrong22 Feb 2018 11:30 UTC
5 points
3 comments1 min readLW link

TAPs 2

alkjash22 Feb 2018 5:10 UTC
25 points
5 comments3 min readLW link
(radimentary.wordpress.com)

Ro­bust­ness to Scale

Scott Garrabrant21 Feb 2018 22:55 UTC
129 points
23 comments2 min readLW link1 review

Don’t Con­di­tion on no Catastrophes

Scott Garrabrant21 Feb 2018 21:50 UTC
37 points
7 comments2 min readLW link

The Logic of Science: 2.2

mpr21 Feb 2018 17:28 UTC
9 points
3 comments1 min readLW link
(pulsarcoffee.com)

Yoda Timers 2

alkjash21 Feb 2018 7:40 UTC
28 points
26 comments3 min readLW link
(radimentary.wordpress.com)

Are you the rider or the elephant?

Qiaochu_Yuan21 Feb 2018 7:25 UTC
36 points
68 comments1 min readLW link

How to not talk about prob­a­bil­ity estimates

boggler21 Feb 2018 3:19 UTC
7 points
4 comments2 min readLW link

Shit ra­tio­nal­ists say − 2018

ChristianKl20 Feb 2018 21:26 UTC
16 points
22 comments1 min readLW link

Sex, Lies, and Dexamethasone

Jacob Falkovich20 Feb 2018 19:56 UTC
15 points
1 comment9 min readLW link

Why we want un­bi­ased learn­ing processes

Stuart_Armstrong20 Feb 2018 15:10 UTC
0 points
0 comments2 min readLW link

Why we want un­bi­ased learn­ing processes

Stuart_Armstrong20 Feb 2018 14:48 UTC
13 points
3 comments3 min readLW link

Bug Hunt 2

alkjash20 Feb 2018 5:00 UTC
35 points
16 comments3 min readLW link
(radimentary.wordpress.com)

For­mally Stat­ing the AI Align­ment Problem

Gordon Seidoh Worley19 Feb 2018 19:06 UTC
14 points
0 comments13 min readLW link
(mapandterritory.org)

User­names in RSS feeds

benwr19 Feb 2018 2:22 UTC
1 point
0 comments1 min readLW link

An al­ter­na­tive way to browse LessWrong 2.0

saturn19 Feb 2018 2:10 UTC
17 points
6 comments1 min readLW link

An al­ter­na­tive way to browse LessWrong 2.0

clone of saturn19 Feb 2018 1:52 UTC
48 points
58 comments1 min readLW link

Whose rea­son­ing can you rely on when your own is faulty?

weft18 Feb 2018 22:41 UTC
29 points
1 comment2 min readLW link

A Sim­ple Motto

katerinjo18 Feb 2018 18:12 UTC
11 points
18 comments1 min readLW link

[Meta] New mod­er­a­tion tools and mod­er­a­tion guidelines

habryka18 Feb 2018 3:22 UTC
42 points
74 comments2 min readLW link

Is skil­led hunt­ing un­eth­i­cal?

JamesFaville17 Feb 2018 18:48 UTC
6 points
18 comments12 min readLW link

Mis­sives from China

alkjash17 Feb 2018 12:30 UTC
12 points
4 comments1 min readLW link
(radimentary.wordpress.com)

In Defence of Con­flict Theory

Richard_Ngo17 Feb 2018 3:33 UTC
34 points
10 comments7 min readLW link
(thinkingcomplete.blogspot.co.uk)

Re­plac­ing ex­pen­sive costly signals

KatjaGrace17 Feb 2018 0:50 UTC
30 points
13 comments1 min readLW link
(meteuphoric.wordpress.com)

Circling

Unreal16 Feb 2018 23:26 UTC
73 points
275 comments9 min readLW link3 reviews

Bayes Rule Applied

Gordon Seidoh Worley16 Feb 2018 18:30 UTC
4 points
0 comments1 min readLW link
(towardsdatascience.com)