All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 345 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

AXRP Episode 18 - Concept Extrapolation with Stuart Armstrong

DanielFilanSep 3, 2022, 11:12 PM

12 points

1 comment39 min readLW link

An Update on Academia vs. Industry (one year into my faculty job)

David Scott Krueger (formerly: capybaralet)Sep 3, 2022, 8:43 PM

122 points

18 comments4 min readLW link

[Question] Request for Alignment Research Project Recommendations

Rauno ArikeSep 3, 2022, 3:29 PM

10 points

2 comments1 min readLW link

Three scenarios of pseudo-alignment

Eleni AngelouSep 3, 2022, 12:47 PM

9 points

0 comments3 min readLW link

Bugs or Features?

qbolecSep 3, 2022, 7:04 AM

73 points

9 comments2 min readLW link

[Exploratory] Seperate exploratory writing from public writing

Johannes C. MayerSep 3, 2022, 2:57 AM

6 points

2 comments1 min readLW link

We may be able to see sharp left turns coming

Ethan Perez and Neel Nanda

Sep 3, 2022, 2:55 AM

54 points

29 comments1 min readLW link

[Exploratory] Exploratory Writing Info

Johannes C. MayerSep 3, 2022, 2:50 AM

3 points

3 comments1 min readLW link

[Question] Can someone explain to me why most researchers think alignment is probably something that is humanly tractable?

iamthouthouartiSep 3, 2022, 1:12 AM

32 points

11 comments1 min readLW link

Behaviour Manifolds and the Hessian of the Total Loss—Notes and Criticism

carboniferous_umbraculum Sep 3, 2022, 12:15 AM

35 points

5 comments6 min readLW link

Sticky goals: a concrete experiment for understanding deceptive alignment

evhubSep 2, 2022, 9:57 PM

39 points

13 comments3 min readLW link

Agency engineering: is AI-alignment “to human intent” enough?

catubcSep 2, 2022, 6:14 PM

9 points

10 comments6 min readLW link

Hanover, Germany—ACX Meetups Everywhere 2022

eikowagenknechtSep 2, 2022, 5:31 PM

2 points

0 comments1 min readLW link

Laziness in AI

Richard HenageSep 2, 2022, 5:04 PM

13 points

5 comments1 min readLW link

Exporting Hangouts History

jefftkSep 2, 2022, 3:00 PM

20 points

0 comments2 min readLW link

(www.jefftk.com)

Simulators

janusSep 2, 2022, 12:45 PM

631 points

168 comments41 min readLW link 8 reviews

(generative.ink)

Levelling Up in AI Safety Research Engineering

Gabe MSep 2, 2022, 4:59 AM

58 points

9 comments17 min readLW link

Stop Discouraging Microwave Formula Preparation

jefftkSep 2, 2022, 2:10 AM

68 points

12 comments2 min readLW link

(www.jefftk.com)

A Richly Interactive AGI Alignment Chart

lisperatiSep 2, 2022, 12:44 AM

14 points

6 comments1 min readLW link

Appendix: How to run a successful Hamming circle

CFAR!DuncanSep 2, 2022, 12:22 AM

41 points

6 comments7 min readLW link

Replacement for PONR concept

Daniel KokotajloSep 2, 2022, 12:09 AM

58 points

6 comments2 min readLW link

AI coordination needs clear wins

evhubSep 1, 2022, 11:41 PM

147 points

16 comments2 min readLW link 1 review

Short story speculating on possible ramifications of AI on the art world

YitzSep 1, 2022, 9:15 PM

30 points

8 comments3 min readLW link

(archiveofourown.org)

Why was progress so slow in the past?

jasoncrawfordSep 1, 2022, 8:26 PM

54 points

31 comments6 min readLW link

(rootsofprogress.org)

AI Safety and Neighboring Communities: A Quick-Start Guide, as of Summer 2022

Sam BowmanSep 1, 2022, 7:15 PM

76 points

2 comments7 min readLW link

Gradient Hacker Design Principles From Biology

johnswentworthSep 1, 2022, 7:03 PM

60 points

13 comments3 min readLW link

Book review: Put Your Ass Where Your Heart Wants to Be

RuhulSep 1, 2022, 6:21 PM

1 point

2 comments10 min readLW link

A Survey of Foundational Methods in Inverse Reinforcement Learning

adamkSep 1, 2022, 6:21 PM

27 points

0 comments12 min readLW link

I Tripped and Became GPT! (And How This Updated My Timelines)

FrankophoneSep 1, 2022, 5:56 PM

31 points

0 comments4 min readLW link

[Question] Fixed point theory (locally (α,β,ψ) dominated contractive condition)

muzammilSep 1, 2022, 5:56 PM

0 points

3 comments1 min readLW link

Alignment is hard. Communicating that, might be harder

Eleni AngelouSep 1, 2022, 4:57 PM

7 points

8 comments3 min readLW link

Covid 9/1/22: Meet the New Booster

ZviSep 1, 2022, 2:00 PM

41 points

6 comments14 min readLW link

(thezvi.wordpress.com)

A Starter-kit for Rationality Space

Jesse HooglandSep 1, 2022, 1:04 PM

43 points

0 comments1 min readLW link

(github.com)

Pondering the paucity of volcanic profanity post Pompeii perusal

CraigMichaelSep 1, 2022, 9:29 AM

21 points

2 comments15 min readLW link

Infra-Exercises, Part 1

Diffractor, Jack Parker and Connall Garrod

Sep 1, 2022, 5:06 AM

62 points

10 comments1 min readLW link

Strategy For Conditioning Generative Models

james.lucassen and evhub

Sep 1, 2022, 4:34 AM

31 points

4 comments18 min readLW link

Safety Committee Resources

jefftkSep 1, 2022, 2:30 AM

22 points

2 comments1 min readLW link

(www.jefftk.com)

Progress links and tweets, 2022-08-31

jasoncrawfordAug 31, 2022, 9:54 PM

13 points

4 comments1 min readLW link

(rootsofprogress.org)

Enantiodromia

ChristianKlAug 31, 2022, 9:13 PM

38 points

7 comments3 min readLW link

[Question] Supposing Europe is headed for a serious energy crisis this winter, what can/should one do as an individual to prepare?

Erich_GrunewaldAug 31, 2022, 7:28 PM

18 points

13 comments1 min readLW link

New 80,000 Hours problem profile on existential risks from AI

Benjamin HiltonAug 31, 2022, 5:36 PM

28 points

6 comments7 min readLW link

(80000hours.org)

Grand Theft Education

ZviAug 31, 2022, 11:50 AM

66 points

18 comments20 min readLW link

(thezvi.wordpress.com)

How much impact can any one man have?

GregorDeVillainAug 31, 2022, 10:26 AM

9 points

3 comments4 min readLW link

[Question] How might we make better use of AI capabilities research for alignment purposes?

Jemal YoungAug 31, 2022, 4:19 AM

11 points

4 comments1 min readLW link

[Question] AI Box Experiment: Are people still interested?

DoubleAug 31, 2022, 3:04 AM

30 points

13 comments1 min readLW link

OC ACX/LW in Newport Beach

Michael MichalchikAug 31, 2022, 2:56 AM

1 point

1 comment1 min readLW link

Survey of NLP Researchers: NLP is contributing to AGI progress; major catastrophe plausible

Sam BowmanAug 31, 2022, 1:39 AM

91 points

6 comments2 min readLW link

And the word was “God”

pchvykovAug 30, 2022, 9:13 PM

−22 points

4 comments3 min readLW link

Worlds Where Iterative Design Fails

johnswentworthAug 30, 2022, 8:48 PM

208 points

30 comments10 min readLW link 1 review

Inner Alignment via Superpowers

JamesH, Thomas Larsen and Jeremy Gillen

Aug 30, 2022, 8:01 PM

37 points

13 comments4 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer