All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 141516 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Take 9: No, RLHF/IDA/debate doesn’t solve outer alignment.

Charlie SteinerDec 12, 2022, 11:51 AM

33 points

13 comments2 min readLW link

Creating a database for base rates

nikosDec 12, 2022, 10:09 AM

2 points

1 comment3 min readLW link

(forum.effectivealtruism.org)

Trivial GPT-3.5 limitation workaround

Dave LindberghDec 12, 2022, 8:42 AM

5 points

4 comments1 min readLW link

Ponzi schemes can be highly profitable if your timing is good

GeneSmithDec 12, 2022, 6:42 AM

10 points

18 comments5 min readLW link

Prodding ChatGPT to solve a basic algebra problem

ShmiDec 12, 2022, 4:09 AM

14 points

6 comments1 min readLW link

(twitter.com)

Wider Default Audio Player in Chrome?

jefftkDec 12, 2022, 3:30 AM

11 points

2 comments1 min readLW link

(www.jefftk.com)

A brainteaser for language models

Adam ScherlisDec 12, 2022, 2:43 AM

47 points

3 comments2 min readLW link

Benchmarks for Comparing Human and AI Intelligence

MrThinkDec 11, 2022, 10:06 PM

9 points

4 comments2 min readLW link

Reflections on the PIBBSS Fellowship 2022

Nora_Ammann and particlemania

Dec 11, 2022, 9:53 PM

32 points

0 comments18 min readLW link

A crisis for online communication: bots and bot users will overrun the Internet?

Mitchell_PorterDec 11, 2022, 9:11 PM

15 points

11 comments1 min readLW link

Finite Factored Sets in Pictures

Magdalena WacheDec 11, 2022, 6:49 PM

174 points

35 comments12 min readLW link

Formalization as suspension of intuition

adamShimiDec 11, 2022, 3:16 PM

54 points

18 comments1 min readLW link

(epistemologicalvigilance.substack.com)

An argument on animal consciousness (soliciting criticism)

SciHamsterDec 11, 2022, 3:12 PM

1 point

2 comments1 min readLW link

ChatGPT’s new novel rationality technique of fact checking

ChristianKlDec 11, 2022, 1:54 PM

−14 points

7 comments1 min readLW link

Reframing inner alignment

davidadDec 11, 2022, 1:53 PM

53 points

13 comments4 min readLW link

A poem about applied rationality by ChatGPT

ChristianKlDec 11, 2022, 1:43 PM

4 points

0 comments1 min readLW link

ChatGPT goes through a wormhole hole in our Shandyesque universe [virtual wacky weed]

Bill BenzonDec 11, 2022, 11:59 AM

−1 points

2 comments3 min readLW link

Using Obsidian if you’re used to using Roam

Solenoid_EntityDec 11, 2022, 8:59 AM

19 points

4 comments2 min readLW link

[fiction] Our Final Hour

Mati_RoyDec 11, 2022, 5:49 AM

23 points

5 comments3 min readLW link

Consider using reversible automata for alignment research

Alex_AltairDec 11, 2022, 1:00 AM

88 points

30 comments2 min readLW link

High level discourse structure in ChatGPT: Part 2 [Quasi-symbolic?]

Bill BenzonDec 10, 2022, 10:26 PM

7 points

0 comments6 min readLW link

Poll Results on AGI

Niclas KupperDec 10, 2022, 9:25 PM

18 points

0 comments2 min readLW link

Reflecting on the 2022 Guild of the Rose Workshops

moridinamaelDec 10, 2022, 9:21 PM

26 points

7 comments8 min readLW link

[Question] Reversing a quantum simulation on the planetary scale

MythopoeistDec 10, 2022, 8:26 PM

2 points

3 comments1 min readLW link

ACX Zurich December Meetup

MBDec 10, 2022, 7:23 PM

1 point

0 comments1 min readLW link

[ASoT] Natural abstractions and AlphaZero

Ulisse MiniDec 10, 2022, 5:53 PM

33 points

1 comment1 min readLW link

(arxiv.org)

[Question] How promising are legal avenues to restrict AI training data?

thehalliardDec 10, 2022, 4:31 PM

9 points

2 comments1 min readLW link

Inspiration as a Scarce Resource

zenbu zenbu zenbu zenbuDec 10, 2022, 3:23 PM

7 points

0 comments4 min readLW link

(inflorescence.substack.com)

Will Manifold Markets/Metaculus have built-in support for reflective latent variables by 2025?

tailcalledDec 10, 2022, 1:55 PM

35 points

0 comments1 min readLW link

My thoughts on OpenAI’s Alignment plan

Donald HobsonDec 10, 2022, 10:35 AM

25 points

1 comment6 min readLW link

[Question] How would you improve ChatGPT’s filtering?

Noah ScalesDec 10, 2022, 8:05 AM

9 points

6 comments1 min readLW link

[Question] A thought experiment

sisyphusDec 10, 2022, 5:23 AM

3 points

12 comments1 min readLW link

patio11′s “Observations from an EA-adjacent (?) charitable effort”

RobertMDec 10, 2022, 12:27 AM

43 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

A dynamical systems primer for entropy and optimization

Alex_AltairDec 10, 2022, 12:13 AM

45 points

3 comments7 min readLW link

[Linkpost] The Story Of VaccinateCA

hathDec 9, 2022, 11:54 PM

103 points

4 comments10 min readLW link

(www.worksinprogress.co)

Prosaic misalignment from the Solomonoff Predictor

Cleo NardoDec 9, 2022, 5:53 PM

42 points

3 comments5 min readLW link

Take 8: Queer the inner/outer alignment dichotomy.

Charlie SteinerDec 9, 2022, 5:46 PM

31 points

2 comments2 min readLW link

[Question] Does a LLM have a utility function?

DagonDec 9, 2022, 5:19 PM

17 points

11 comments1 min readLW link

Monthly Roundup #1

ZviDec 9, 2022, 5:10 PM

31 points

6 comments21 min readLW link

(thezvi.wordpress.com)

Working towards AI alignment is better

Johannes C. MayerDec 9, 2022, 3:39 PM

8 points

2 comments2 min readLW link

You can still fetch the coffee today if you’re dead tomorrow

davidadDec 9, 2022, 2:06 PM

96 points

19 comments5 min readLW link

ChatGPT’s Misalignment Isn’t What You Think

stavrosDec 9, 2022, 11:11 AM

3 points

12 comments1 min readLW link

ML Safety at NeurIPS & Paradigmatic AI Safety? MLAISU W49

Esben Kran and Steinthal

Dec 9, 2022, 10:38 AM

19 points

0 comments4 min readLW link

(newsletter.apartresearch.com)

[Question] What are your thoughts on the future of AI-assisted software development?

RomanHaukssonDec 9, 2022, 10:04 AM

4 points

4 comments1 min readLW link

Fear mitigated the nuclear threat, can it do the same to AGI risks?

Igor IvanovDec 9, 2022, 10:04 AM

6 points

8 comments5 min readLW link

Setting the Zero Point

Duncan Sabien (Inactive)Dec 9, 2022, 6:06 AM

90 points

43 comments20 min readLW link 1 review

Systems of Survival

VaniverDec 9, 2022, 5:13 AM

63 points

5 comments5 min readLW link

[Question] Do You Have an Internal Monologue?

belkarxDec 9, 2022, 3:04 AM

23 points

7 comments1 min readLW link

[Question] How is the “sharp left turn defined”?

Chris_LeongDec 9, 2022, 12:04 AM

14 points

4 comments1 min readLW link

Linkpost for a generalist algorithmic learner: capable of carrying out sorting, shortest paths, string matching, convex hull finding in one network

lovetheusersDec 9, 2022, 12:02 AM

7 points

1 comment1 min readLW link

(twitter.com)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer