All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 201720182019 2020 2021 2022 2023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 262728 29 30

Alignment Newsletter #34

Rohin ShahNov 26, 2018, 11:10 PM

24 points

0 comments10 min readLW link

(mailchi.mp)

Boltzmann Brains, Simulations and self refuting hypothesis

Donald HobsonNov 26, 2018, 7:09 PM

1 point

9 comments1 min readLW link

Quantum Mechanics, Nothing to do with Consciousness

Donald HobsonNov 26, 2018, 6:59 PM

5 points

27 comments3 min readLW link

Status model

BuckyNov 26, 2018, 3:05 PM

26 points

7 comments3 min readLW link

Humans Consulting HCH

paulfchristianoNov 25, 2018, 11:18 PM

39 points

9 comments1 min readLW link

Approval-directed bootstrapping

paulfchristianoNov 25, 2018, 11:18 PM

24 points

0 comments1 min readLW link

How rapidly are GPUs improving in price performance?

gallabytesNov 25, 2018, 7:54 PM

31 points

9 comments LW link

(mediangroup.org)

Values Weren’t Complex, Once.

DavidmanheimNov 25, 2018, 9:17 AM

36 points

13 comments2 min readLW link

A culture of exploitation?

Bae's TheoremNov 24, 2018, 10:00 PM

1 point

3 comments1 min readLW link

Fixed Point Discussion

Scott GarrabrantNov 24, 2018, 8:53 PM

45 points

2 comments4 min readLW link

Four factors that moderate the intensity of emotions

RubyNov 24, 2018, 8:40 PM

63 points

11 comments8 min readLW link

deluks917 on Online Weirdos

Jacob FalkovichNov 24, 2018, 5:03 PM

24 points

3 comments10 min readLW link

[Montreal] Towards High-Assurance Advanced AI Systems by Richard Mallah

Mati_RoyNov 24, 2018, 6:24 AM

3 points

0 comments1 min readLW link

Upcoming: Open Questions

RaemonNov 24, 2018, 1:39 AM

41 points

7 comments2 min readLW link

A Dragon Confronts the Terasem Movement

AlephywrNov 24, 2018, 1:31 AM

−4 points

10 comments25 min readLW link

(dancefighterredux.wordpress.com)

What if people simply forecasted your future choices?

ozziegooenNov 23, 2018, 10:52 AM

16 points

6 comments6 min readLW link

Oversight of Unsafe Systems via Dynamic Safety Envelopes

DavidmanheimNov 23, 2018, 8:37 AM

10 points

2 comments2 min readLW link

On MIRI’s new research directions

Rob BensingerNov 22, 2018, 11:42 PM

53 points

12 comments1 min readLW link

(intelligence.org)

LW Update 2018-11-22 – Abridged Comments

RaemonNov 22, 2018, 10:11 PM

11 points

16 comments1 min readLW link

Approval-directed agents

paulfchristianoNov 22, 2018, 9:15 PM

31 points

10 comments15 min readLW link

Believing others’ priors

rkNov 22, 2018, 8:44 PM

8 points

19 comments7 min readLW link

Speculative Evopsych, Ep. 1

Optimization ProcessNov 22, 2018, 7:00 PM

41 points

9 comments1 min readLW link

If You Want to Win, Stop Conceding

Davis_KingsleyNov 22, 2018, 6:10 PM

47 points

15 comments3 min readLW link

Review: Artifact

ZviNov 22, 2018, 3:00 PM

21 points

3 comments13 min readLW link

(thezvi.wordpress.com)

Perspective Reasoning and the Sleeping Beauty Problem

dadadarrenNov 22, 2018, 11:55 AM

6 points

10 comments2 min readLW link

The Semantic Man

namespaceNov 22, 2018, 8:38 AM

19 points

4 comments1 min readLW link

(www.generalsemantics.org)

Jesus Made Me Rational (An Introduction)

MotasaurusNov 22, 2018, 5:09 AM

−14 points

56 comments3 min readLW link

Iteration Fixed Point Exercises

Scott Garrabrant and SamEisenstat

Nov 22, 2018, 12:35 AM

33 points

12 comments3 min readLW link

Suggestion: New material shouldn’t be released too fast

Chris_LeongNov 21, 2018, 4:39 PM

23 points

7 comments1 min readLW link

EA Bristol Strategy Meeting

thegreatnickNov 21, 2018, 10:57 AM

1 point

0 comments1 min readLW link

Rationality Café No. 6 - The Sequences, Part 1; Section B Repeat

thegreatnickNov 21, 2018, 10:54 AM

8 points

2 comments1 min readLW link

EA Funds: Long-Term Future fund is open to applications until November 24th (this Saturday)

habrykaNov 21, 2018, 3:39 AM

37 points

0 comments1 min readLW link

Incorrect hypotheses point to correct observations

Kaj_SotalaNov 20, 2018, 9:10 PM

169 points

40 comments4 min readLW link

(kajsotala.fi)

Preschool: Much Less Than You Wanted To Know

ZviNov 20, 2018, 7:30 PM

65 points

15 comments2 min readLW link

(thezvi.wordpress.com)

New safety research agenda: scalable agent alignment via reward modeling

VikaNov 20, 2018, 5:29 PM

34 points

12 comments1 min readLW link

(medium.com)

Prosaic AI alignment

paulfchristianoNov 20, 2018, 1:56 PM

48 points

10 comments8 min readLW link

Moscow LW meetup in “Nauchka” library

Alexander230Nov 20, 2018, 12:19 PM

2 points

0 comments1 min readLW link

[Insert clever intro here]

Bae's TheoremNov 20, 2018, 3:26 AM

18 points

13 comments1 min readLW link

Alignment Newsletter #33

Rohin ShahNov 19, 2018, 5:20 PM

23 points

0 comments9 min readLW link

(mailchi.mp)

Games in Kocherga club: Fallacymania, Tower of Chaos, Scientific Discovery

Alexander230Nov 19, 2018, 2:23 PM

2 points

0 comments1 min readLW link

Letting Others Be Vulnerable

lifelonglearnerNov 19, 2018, 2:59 AM

34 points

6 comments7 min readLW link

Clickbait might not be destroying our general Intelligence

Donald HobsonNov 19, 2018, 12:13 AM

25 points

13 comments2 min readLW link

South Bay Meetup 12/8

DavidFriedmanNov 19, 2018, 12:04 AM

3 points

0 comments1 min readLW link

[Link] “They go together: Freedom, Prosperity, and Big Government”

CronoDASNov 18, 2018, 4:51 PM

11 points

3 comments1 min readLW link

Collaboration-by-Design versus Emergent Collaboration

DavidmanheimNov 18, 2018, 7:22 AM

11 points

2 comments2 min readLW link

Diagonalization Fixed Point Exercises

Scott Garrabrant and SamEisenstat

18 Nov 2018 0:31 UTC

40 points

25 comments3 min readLW link

Ia! Ia! Extradimensional Cephalopod Nafl’fhtagn!

ExCeph17 Nov 2018 23:00 UTC

14 points

5 comments1 min readLW link

Effective Altruism, YouTube, and AI (talk by Lê Nguyên Hoang)

Paperclip Minimizer17 Nov 2018 19:21 UTC

3 points

0 comments LW link

(www.youtube.com)

An unaligned benchmark

paulfchristiano17 Nov 2018 15:51 UTC

31 points

0 comments9 min readLW link

On Rigorous Error Handling

Martin Sustrik17 Nov 2018 9:20 UTC

13 points

4 comments6 min readLW link

(250bpm.com)