All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 345 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Murphyjitsu: an Inner Simulator algorithm

CFAR!DuncanJun 30, 2022, 9:50 PM

67 points

24 comments11 min readLW link 2 reviews

GPT-3 Catching Fish in Morse Code

Megan KinnimentJun 30, 2022, 9:22 PM

117 points

27 comments8 min readLW link

Metacognition in the Rat

Jacob FalkovichJun 30, 2022, 8:53 PM

19 points

0 comments6 min readLW link

On viewquakes

Dalton MaberyJun 30, 2022, 8:08 PM

8 points

0 comments2 min readLW link

The Track Record of Futurists Seems … Fine

HoldenKarnofskyJun 30, 2022, 7:40 PM

91 points

25 comments12 min readLW link

(www.cold-takes.com)

Quick survey on AI alignment resources

frances_lorenzJun 30, 2022, 7:09 PM

14 points

0 comments1 min readLW link

[Linkpost] Solving Quantitative Reasoning Problems with Language Models

YitzJun 30, 2022, 6:58 PM

76 points

15 comments2 min readLW link

(storage.googleapis.com)

Failing to fix a dangerous intersection

alyssavanceJun 30, 2022, 6:09 PM

110 points

17 comments2 min readLW link

Most Functions Have Undesirable Global Extrema

En KepeigJun 30, 2022, 5:10 PM

8 points

5 comments3 min readLW link

Hedonistic Isotopes:

TrozxzrJun 30, 2022, 4:49 PM

1 point

0 comments1 min readLW link

Abadarian Trades

David UdellJun 30, 2022, 4:41 PM

17 points

22 comments2 min readLW link

Covid 6/30/22: Vaccine Update Update

ZviJun 30, 2022, 2:00 PM

32 points

6 comments12 min readLW link

(thezvi.wordpress.com)

[Question] How should I talk about optimal but not subgame-optimal play?

JamesFavilleJun 30, 2022, 1:58 PM

5 points

1 comment3 min readLW link

Formal Philosophy and Alignment Possible Projects

Daniel HerrmannJun 30, 2022, 10:42 AM

34 points

5 comments8 min readLW link

Bangalore LW/ACX Meetup in person

AdityaJun 30, 2022, 7:21 AM

5 points

2 comments1 min readLW link

Cultivating And Destroying Agency

hathJun 30, 2022, 3:59 AM

104 points

11 comments9 min readLW link

$500 bounty for alignment contest ideas

Orpheus16Jun 30, 2022, 1:56 AM

29 points

5 comments2 min readLW link

any good rationalist guides to nutrition / healthy eating?

Ben AJun 30, 2022, 12:50 AM

7 points

15 comments1 min readLW link

A summary of every Replacing Guilt post

Orpheus16Jun 30, 2022, 12:46 AM

35 points

3 comments10 min readLW link

(forum.effectivealtruism.org)

Gradient hacking: definitions and examples

Richard_NgoJun 29, 2022, 9:35 PM

38 points

2 comments5 min readLW link

Progress links and tweets, 2022-06-29

jasoncrawfordJun 29, 2022, 9:33 PM

9 points

0 comments1 min readLW link

(rootsofprogress.org)

[Question] Correcting human error vs doing exactly what you’re told—is there literature on this in context of general system design?

Jan CzechowskiJun 29, 2022, 9:30 PM

6 points

0 comments1 min readLW link

Latent Adversarial Training

Adam JermynJun 29, 2022, 8:04 PM

52 points

13 comments5 min readLW link

Game Review: This Merchant Life

ZviJun 29, 2022, 6:30 PM

20 points

0 comments13 min readLW link

(thezvi.wordpress.com)

Limits to Legibility

Jan_KulveitJun 29, 2022, 5:42 PM

157 points

11 comments5 min readLW link 1 review

Will Capabilities Generalise More?

Ramana KumarJun 29, 2022, 5:12 PM

133 points

39 comments4 min readLW link

Kevin Kelly’s “103 Bits of Advice,” Expanded

Dalton MaberyJun 29, 2022, 1:36 PM

19 points

0 comments5 min readLW link

The table of different sampling assumptions in anthropics

avturchinJun 29, 2022, 10:41 AM

39 points

5 comments12 min readLW link

Can We Align AI by Having It Learn Human Preferences? I’m Scared (summary of last third of Human Compatible)

apollonianbluesJun 29, 2022, 4:09 AM

19 points

3 comments6 min readLW link

Kurzgesagt – The Last Human (Youtube)

habrykaJun 29, 2022, 3:28 AM

54 points

7 comments1 min readLW link

(www.youtube.com)

[Question] Literature on How to Maximize Preferences

joshJun 28, 2022, 10:41 PM

1 point

0 comments1 min readLW link

Challenge: A Much More Alien Message

kmanJun 28, 2022, 9:50 PM

24 points

7 comments1 min readLW link

It’s Probably Not Lithium

NatáliaJun 28, 2022, 9:24 PM

442 points

187 comments28 min readLW link 1 review

Reflections on Living in “Guess Culture”

Dalton MaberyJun 28, 2022, 9:00 PM

13 points

1 comment3 min readLW link

[Question] What is the LessWrong Logo(?) Supposed to Represent?

DragonGodJun 28, 2022, 8:20 PM

8 points

6 comments1 min readLW link

What Are You Tracking In Your Head?

johnswentworthJun 28, 2022, 7:30 PM

289 points

83 comments4 min readLW link 1 review

Why is so much political commentary misleading?

contrarianbritJun 28, 2022, 5:10 PM

−2 points

5 comments6 min readLW link

(thomasprosser.substack.com)

CFAR Handbook: Introduction

CFAR!DuncanJun 28, 2022, 4:53 PM

116 points

12 comments1 min readLW link

Units of Exchange

CFAR!DuncanJun 28, 2022, 4:53 PM

99 points

28 comments11 min readLW link

Scott Aaronson and Steven Pinker Debate AI Scaling

LironJun 28, 2022, 4:04 PM

37 points

7 comments1 min readLW link

(scottaaronson.blog)

A physicist’s approach to Origins of Life

pchvykovJun 28, 2022, 3:23 PM

12 points

6 comments16 min readLW link

What success looks like

Marius Hobbhahn, MaxRa, JasperGeh and Yannick_Muehlhaeuser

Jun 28, 2022, 2:38 PM

19 points

4 comments1 min readLW link

(forum.effectivealtruism.org)

Four reasons I find AI safety emotionally compelling

KatWoods and AmberDawn

Jun 28, 2022, 2:10 PM

39 points

3 comments4 min readLW link

Some alternative AI safety research projects

Michele CampoloJun 28, 2022, 2:09 PM

9 points

0 comments3 min readLW link

Doom doubts—is inner alignment a likely problem?

CrissmanJun 28, 2022, 12:42 PM

6 points

7 comments1 min readLW link

Low-Friction MBTA Predictions

jefftkJun 28, 2022, 12:30 PM

15 points

0 comments1 min readLW link

(www.jefftk.com)

What Diet Books Don’t Teach: A book review and a request for more reading

Lone PineJun 28, 2022, 12:27 PM

22 points

34 comments4 min readLW link

Assessing AlephAlphas Multimodal Model

p.b.28 Jun 2022 9:28 UTC

30 points

5 comments3 min readLW link

[Question] Is there any way someone could post about public policy relating to abortion access (or another sensitive subject) on LessWrong without getting super downvoted?

Evan_Gaensbauer28 Jun 2022 5:45 UTC

18 points

20 comments1 min readLW link

[Test Post Please Ignore] Testing polling features

Lone Pine28 Jun 2022 4:35 UTC

7 points

5 comments1 min readLW link