All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 262728 29 30 31

A basic lexicon of telic concepts

mrcbarbierOct 22, 2022, 9:28 PM

2 points

0 comments3 min readLW link

Do we have the right kind of math for roles, goals and meaning?

mrcbarbierOct 22, 2022, 9:28 PM

13 points

5 comments7 min readLW link

[Question] The Last Year - is there an existing novel about the last year before AI doom?

Luca PetrolatiOct 22, 2022, 8:44 PM

4 points

4 comments1 min readLW link

The highest-probability outcome can be out of distribution

tailcalledOct 22, 2022, 8:00 PM

14 points

5 comments1 min readLW link

Newsletter for Alignment Research: The ML Safety Updates

Esben KranOct 22, 2022, 4:17 PM

25 points

0 comments LW link

Crypto loves impact markets: Notes from Schelling Point Bogotá

Rachel ShuOct 22, 2022, 3:58 PM

17 points

2 comments LW link

[Question] When trying to define general intelligence is ability to achieve goals the best metric?

jmhOct 22, 2022, 3:09 AM

5 points

0 comments1 min readLW link

[Question] Simple question about corrigibility and values in AI.

jmhOct 22, 2022, 2:59 AM

6 points

1 comment1 min readLW link

Moorean Statements

David UdellOct 22, 2022, 12:50 AM

11 points

11 comments1 min readLW link

Wisdom Cannot Be Unzipped

SableOct 22, 2022, 12:28 AM

74 points

17 comments7 min readLW link 1 review

(affablyevil.substack.com)

A framework and open questions for game theoretic shard modeling

Garrett BakerOct 21, 2022, 9:40 PM

11 points

4 comments4 min readLW link

Cooperators are more powerful than agents

Ivan VendrovOct 21, 2022, 8:02 PM

29 points

7 comments3 min readLW link

Intelligent behaviour across systems, scales and substrates

Nora_AmmannOct 21, 2022, 5:09 PM

11 points

0 comments10 min readLW link

Deepfake(?) Phishing

jefftkOct 21, 2022, 2:30 PM

37 points

9 comments1 min readLW link

(www.jefftk.com)

acronyms ftw

EmrikOct 21, 2022, 1:36 PM

−2 points

5 comments2 min readLW link

Crossword puzzle: LessWrong Halloween 2022

jchanOct 21, 2022, 12:41 PM

11 points

11 comments1 min readLW link

Weekly Roundup #2

ZviOct 21, 2022, 12:10 PM

37 points

2 comments11 min readLW link

(thezvi.wordpress.com)

Improved Security to Prevent Hacker-AI and Digital Ghosts

Erland WittkotterOct 21, 2022, 10:11 AM

4 points

3 comments12 min readLW link

Two Guts

chanamessingerOct 21, 2022, 10:01 AM

21 points

0 comments LW link

The importance of studying subjective experience

Q HomeOct 21, 2022, 8:43 AM

10 points

3 comments7 min readLW link

Legal Brief: Plurality Voting is Unconstitutional

c.troutOct 21, 2022, 4:55 AM

6 points

20 comments11 min readLW link

(medium.com)

Learning societal values from law as part of an AGI alignment strategy

John NayOct 21, 2022, 2:03 AM

5 points

18 comments54 min readLW link

Covid 10/20/22: Wait, We Did WHAT?

ZviOct 20, 2022, 9:50 PM

55 points

16 comments16 min readLW link

(thezvi.wordpress.com)

When apparently positive evidence can be negative evidence

cataOct 20, 2022, 9:47 PM

19 points

5 comments1 min readLW link

(www.ncbi.nlm.nih.gov)

Plans Are Predictions, Not Optimization Targets

johnswentworthOct 20, 2022, 9:17 PM

108 points

20 comments4 min readLW link 1 review

Introduction to abstract entropy

Alex_AltairOct 20, 2022, 9:03 PM

238 points

78 comments18 min readLW link 1 review

Trajectories to 2036

ukc10014Oct 20, 2022, 8:23 PM

3 points

1 comment14 min readLW link

[Question] Rough Sketch for Product to Enhance Citizen Participation in Politics

T431Oct 20, 2022, 8:04 PM

13 points

5 comments1 min readLW link

The heritability of human values: A behavior genetic critique of Shard Theory

geoffreymillerOct 20, 2022, 3:51 PM

82 points

63 comments21 min readLW link

A Longtermist case against Veganism

Connor TabarrokOct 20, 2022, 2:30 PM

−3 points

3 comments LW link

AI Research Program Prediction Markets

tailcalledOct 20, 2022, 1:42 PM

38 points

10 comments1 min readLW link

[Question] Is the meaning of words chosen/interpreted to maximize correlations with other relevant queries?

tailcalledOct 20, 2022, 10:03 AM

9 points

9 comments1 min readLW link

How to Write Readable Posts

David HartsoughOct 20, 2022, 7:48 AM

7 points

0 comments LW link

Notes on “Can you control the past”

So8resOct 20, 2022, 3:41 AM

64 points

41 comments21 min readLW link

Rhythmic Baby Toys

jefftkOct 20, 2022, 1:50 AM

15 points

1 comment1 min readLW link

(www.jefftk.com)

[Question] What Does AI Alignment Success Look Like?

ShmiOct 20, 2022, 12:32 AM

23 points

7 comments1 min readLW link

Scaling Laws for Reward Model Overoptimization

leogao, John Schulman and Jacob_Hilton

Oct 20, 2022, 12:20 AM

103 points

13 comments1 min readLW link

(arxiv.org)

What is Consciousness?

belkarxOct 19, 2022, 9:14 PM

3 points

2 comments2 min readLW link

What to do if a nuclear weapon is used in Ukraine?

Valentin2026Oct 19, 2022, 6:43 PM

13 points

9 comments3 min readLW link

[Question] If I asked for an explanation of a perfect Utopia, could you give one?

AkkiraOct 19, 2022, 5:56 PM

−4 points

2 comments1 min readLW link

[Question] Should we push for requiring AI training data to be licensed?

ChristianKlOct 19, 2022, 5:49 PM

37 points

32 comments1 min readLW link

Hacker-AI and Digital Ghosts – Pre-AGI

Erland WittkotterOct 19, 2022, 3:33 PM

9 points

7 comments8 min readLW link

The reward function is already how well you manipulate humans

KerryOct 19, 2022, 1:52 AM

20 points

9 comments2 min readLW link

Response to Katja Grace’s AI x-risk counterarguments

Erik Jenner and Johannes Treutlein

Oct 19, 2022, 1:17 AM

77 points

18 comments15 min readLW link

(OLD) An Extremely Opinionated Annotated List of My Favourite Mechanistic Interpretability Papers

Neel Nanda18 Oct 2022 21:08 UTC

72 points

5 comments12 min readLW link

(www.neelnanda.io)

Distilled Representations Research Agenda

Hoagy and mishajw

18 Oct 2022 20:59 UTC

15 points

2 comments8 min readLW link

Drafting a Covid Survey

jefftk18 Oct 2022 19:30 UTC

15 points

2 comments2 min readLW link

(www.jefftk.com)

How To Make Prediction Markets Useful For Alignment Work

johnswentworth18 Oct 2022 19:01 UTC

97 points

18 comments2 min readLW link

A conversation about Katja’s counterarguments to AI risk

Matthew Barnett and Ege Erdil

18 Oct 2022 18:40 UTC

43 points

9 comments33 min readLW link

ACX Zurich October Meetup

MB18 Oct 2022 18:24 UTC

1 point

1 comment1 min readLW link