All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 201820192020 2021 2022 2023 2024

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 262728 29 30 31

Embedded Agency via Abstraction

johnswentworth26 Aug 2019 23:03 UTC

42 points

20 comments11 min readLW link

Reversible changes: consider a bucket of water

Stuart_Armstrong26 Aug 2019 22:55 UTC

25 points

18 comments2 min readLW link

Toy model piece #3: close and distant situations

Stuart_Armstrong26 Aug 2019 22:41 UTC

10 points

0 comments1 min readLW link

[Question] How do you learn foreign language vocabulary, beyond Anki?

Elizabeth26 Aug 2019 21:00 UTC

9 points

21 comments1 min readLW link

[Question] How Can People Evaluate Complex Questions Consistently?

Elizabeth26 Aug 2019 20:33 UTC

46 points

12 comments1 min readLW link

Problems with AI debate

Stuart_Armstrong26 Aug 2019 19:21 UTC

21 points

3 comments5 min readLW link

Schelling Categories, and Simple Membership Tests

Zack_M_Davis26 Aug 2019 2:43 UTC

59 points

10 comments8 min readLW link

Limits of and to (artificial) Intelligence

MoritzG25 Aug 2019 22:16 UTC

1 point

3 comments7 min readLW link

Gratification: a useful concept, maybe new

Stuart_Armstrong25 Aug 2019 18:58 UTC

17 points

7 comments3 min readLW link

Under a week left to win $1,000! By questioning Oracle AIs.

Stuart_Armstrong25 Aug 2019 17:02 UTC

12 points

2 comments1 min readLW link

[Question] I’m interested in a sub-field of AI but don’t know what to call it.

fowlertm25 Aug 2019 14:55 UTC

9 points

4 comments1 min readLW link

[Question] Am I going for a job interview with a woo pusher?

CronoDAS25 Aug 2019 14:39 UTC

6 points

7 comments1 min readLW link

OpenPhil on “GiveWell’s Top Charities Are (Increasingly) Hard to Beat”

Raemon24 Aug 2019 23:28 UTC

17 points

0 comments6 min readLW link

(www.openphilanthropy.org)

Epistemic Spot Check: The Fate of Rome (Kyle Harper)

Elizabeth24 Aug 2019 21:40 UTC

39 points

3 comments5 min readLW link

(acesounderglass.com)

[Question] Performance IQ and higher mathematics

c5pi24 Aug 2019 17:31 UTC

4 points

5 comments1 min readLW link

[Question] how should a second version of “rationality: A to Z” look like?

Yoav Ravid24 Aug 2019 7:01 UTC

6 points

4 comments1 min readLW link

Petrov Day Celebration 2019 - Oxford Campsite

jbeshir24 Aug 2019 3:42 UTC

8 points

1 comment1 min readLW link

[Question] How has rationalism helped you?

Sunny from QAD24 Aug 2019 1:31 UTC

9 points

11 comments1 min readLW link

[Question] Is LW making progress?

zulupineapple24 Aug 2019 0:32 UTC

21 points

11 comments1 min readLW link

LessLong Launch Party

Raemon23 Aug 2019 22:18 UTC

12 points

1 comment1 min readLW link

[Question] Is there a simple parameter that controls human working memory capacity, which has been set tragically low?

Liron23 Aug 2019 22:10 UTC

17 points

8 comments1 min readLW link

Optimization Provenance

Adele Lopez23 Aug 2019 20:08 UTC

38 points

5 comments5 min readLW link

Troll Bridge

abramdemski23 Aug 2019 18:36 UTC

86 points

59 comments12 min readLW link

Understanding understanding

mthq23 Aug 2019 18:10 UTC

24 points

1 comment2 min readLW link

Actually updating

SaraHax23 Aug 2019 17:46 UTC

56 points

10 comments4 min readLW link

When do utility functions constrain?

Hoagy23 Aug 2019 17:19 UTC

30 points

8 comments7 min readLW link

Parables of Constraint and Actualization

Spencer Wyman23 Aug 2019 16:56 UTC

13 points

0 comments6 min readLW link

Thoughts on Retrieving Knowledge from Neural Networks

Jaime Ruiz23 Aug 2019 16:41 UTC

11 points

2 comments5 min readLW link

Algorithmic Similarity

LukasM23 Aug 2019 16:39 UTC

28 points

10 comments11 min readLW link

Soft takeoff can still lead to decisive strategic advantage

Daniel Kokotajlo23 Aug 2019 16:39 UTC

122 points

47 comments8 min readLW link 4 reviews

Moscow LW meetup in “Nauchka” library

Alexander23023 Aug 2019 12:40 UTC

3 points

0 comments1 min readLW link

OpenGPT-2: We Replicated GPT-2 Because You Can Too

avturchin23 Aug 2019 11:32 UTC

18 points

0 comments1 min readLW link

(medium.com)

Torture and Dust Specks and Joy—Oh my! or: Non-Archimedean Utility Functions as Pseudograded Vector Spaces

Louis_Brown23 Aug 2019 11:11 UTC

19 points

29 comments8 min readLW link

Metalignment: Deconfusing metaethics for AI alignment.

Guillaume Corlouer23 Aug 2019 10:25 UTC

13 points

7 comments3 min readLW link

[Question] A basic probability question

Shmi23 Aug 2019 7:13 UTC

11 points

3 comments1 min readLW link

Towards an Intentional Research Agenda

romeostevensit23 Aug 2019 5:27 UTC

20 points

8 comments3 min readLW link

[Question] Why are people so optimistic about superintelligence?

bipolo23 Aug 2019 4:25 UTC

6 points

3 comments1 min readLW link

Vague Thoughts and Questions about Agent Structures

loriphos23 Aug 2019 4:01 UTC

9 points

3 comments2 min readLW link

Formalising decision theory is hard

Lukas Finnveden23 Aug 2019 3:27 UTC

17 points

19 comments2 min readLW link

Creating Environments to Design and Test Embedded Agents

lemonhope23 Aug 2019 3:17 UTC

13 points

5 comments8 min readLW link

Tabooing ‘Agent’ for Prosaic Alignment

Hjalmar_Wijk23 Aug 2019 2:55 UTC

57 points

10 comments6 min readLW link

Vaniver’s View on Factored Cognition

Vaniver23 Aug 2019 2:54 UTC

48 points

4 comments8 min readLW link

Redefining Fast Takeoff

VojtaKovarik23 Aug 2019 2:15 UTC

10 points

1 comment1 min readLW link

[Question] Does Agent-like Behavior Imply Agent-like Architecture?

Scott Garrabrant23 Aug 2019 2:01 UTC

66 points

8 comments1 min readLW link

The Commitment Races problem

Daniel Kokotajlo23 Aug 2019 1:58 UTC

152 points

56 comments5 min readLW link

Analysis of a Secret Hitler Scenario

jaek23 Aug 2019 1:24 UTC

16 points

6 comments4 min readLW link

Thoughts from a Two Boxer

jaek23 Aug 2019 0:24 UTC

18 points

11 comments5 min readLW link

Deconfuse Yourself about Agency

VojtaKovarik23 Aug 2019 0:21 UTC

15 points

9 comments4 min readLW link

Logical Optimizers

Donald Hobson22 Aug 2019 23:54 UTC

11 points

4 comments3 min readLW link

Towards a mechanistic understanding of corrigibility

evhub22 Aug 2019 23:20 UTC

47 points

26 comments4 min readLW link