All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 131415 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

OpenAI: GPT-based LLMs show ability to discriminate between its own wrong answers, but inability to explain how/why it makes that discrimination, even as model scales

Aditya JainJun 13, 2022, 11:33 PM

14 points

5 comments1 min readLW link

(openai.com)

[Question] Who said something like “The fact that putting 2 apples next to 2 other apples leads to there being 4 apples there has nothing to do with the fact that 2 + 2 = 4”?

hunterglennJun 13, 2022, 10:23 PM

1 point

2 comments1 min readLW link

Continuity Assumptions

Jan_KulveitJun 13, 2022, 9:31 PM

44 points

13 comments4 min readLW link

Crypto-fed Computation

aaguirreJun 13, 2022, 9:20 PM

24 points

7 comments7 min readLW link

A Modest Pivotal Act

anonymousaisafetyJun 13, 2022, 7:24 PM

−16 points

1 comment5 min readLW link

Contra EY: Can AGI destroy us without trial & error?

nsokolskyJun 13, 2022, 6:26 PM

137 points

72 comments15 min readLW link

What are some smaller-but-concrete challenges related to AI safety that are impacting people today?

nonzerosumJun 13, 2022, 5:36 PM

4 points

3 comments1 min readLW link

[Link] New SEP article on Bayesian Epistemology

Aryeh EnglanderJun 13, 2022, 3:03 PM

6 points

0 comments1 min readLW link

Training Trace Priors

Adam JermynJun 13, 2022, 2:22 PM

12 points

17 comments4 min readLW link

[Question] Can you MRI a deep learning model?

Yair HalberstadtJun 13, 2022, 1:43 PM

3 points

3 comments1 min readLW link

On A List of Lethalities

ZviJun 13, 2022, 12:30 PM

165 points

50 comments54 min readLW link 1 review

(thezvi.wordpress.com)

D&D.Sci June 2022 Evaluation and Ruleset

abstractapplicJun 13, 2022, 10:31 AM

34 points

11 comments4 min readLW link

[Question] What’s the “This AI is of moral concern.” fire alarm?

Quintin PopeJun 13, 2022, 8:05 AM

37 points

56 comments2 min readLW link

The beautiful magical enchanted golden Dall-e Mini is underrated

p.b.Jun 13, 2022, 7:58 AM

14 points

0 comments1 min readLW link

Why so little AI risk on rationalist-adjacent blogs?

Grant DemareeJun 13, 2022, 6:31 AM

46 points

23 comments8 min readLW link

Code Quality and Rule Consequentialism

Adam ZernerJun 13, 2022, 3:12 AM

17 points

13 comments6 min readLW link

Grokking “Semi-informative priors over AI timelines”

anson.hoJun 12, 2022, 10:17 PM

15 points

7 comments14 min readLW link

[Question] How much does cybersecurity reduce AI risk?

DarmaniJun 12, 2022, 10:13 PM

34 points

23 comments1 min readLW link

[Question] How are compute assets distributed in the world?

Chris van MerwijkJun 12, 2022, 10:13 PM

30 points

7 comments1 min readLW link

Intuitive Explanation of AIXI

Thomas LarsenJun 12, 2022, 9:41 PM

22 points

1 comment5 min readLW link

Why all the fuss about recursive self-improvement?

So8resJun 12, 2022, 8:53 PM

158 points

62 comments7 min readLW link 1 review

Why the Kaldor-Hicks criterion can be non-transitive

RupertJun 12, 2022, 5:26 PM

4 points

10 comments2 min readLW link

[Question] How do you post links here?

skybrianJun 12, 2022, 4:23 PM

1 point

1 comment1 min readLW link

[Question] Filter out tags from the front page?

jaspaxJun 12, 2022, 10:59 AM

9 points

2 comments1 min readLW link

How To: A Workshop (or anything)

Duncan Sabien (Inactive)Jun 12, 2022, 8:00 AM

53 points

13 comments37 min readLW link 1 review

A claim that Google’s LaMDA is sentient

Ben LivengoodJun 12, 2022, 4:18 AM

31 points

133 comments1 min readLW link

[Question] How much stupider than humans can AI be and still kill us all through sheer numbers and resource access?

ShmiJun 12, 2022, 1:01 AM

11 points

11 comments1 min readLW link

ELK Proposal—Make the Reporter care about the Predictor’s beliefs

Adam Jermyn and Nicholas Schiefer

Jun 11, 2022, 10:53 PM

8 points

0 comments6 min readLW link

[Question] Why has no person / group ever taken over the world?

Aryeh EnglanderJun 11, 2022, 8:51 PM

25 points

19 comments1 min readLW link

[Question] Are there English-speaking meetups in Frankfurt/Munich/Zurich?

Grant DemareeJun 11, 2022, 8:02 PM

6 points

2 comments1 min readLW link

Beauty and the Beast

Tomás B.Jun 11, 2022, 6:59 PM

44 points

8 comments6 min readLW link

Poorly-Aimed Death Rays

Thane RuthenisJun 11, 2022, 6:29 PM

48 points

5 comments4 min readLW link

AGI Safety Communications Initiative

inesJun 11, 2022, 5:34 PM

7 points

0 comments1 min readLW link

A gaming group for rationality-aware people

dhatasJun 11, 2022, 4:04 PM

7 points

0 comments1 min readLW link

[Question] Why don’t you introduce really impressive people you personally know to AI alignment (more often)?

VerdenJun 11, 2022, 3:59 PM

33 points

14 comments1 min readLW link

Godzilla Strategies

johnswentworthJun 11, 2022, 3:44 PM

159 points

72 comments3 min readLW link

Steganography and the CycleGAN—alignment failure case study

Jan CzechowskiJun 11, 2022, 9:41 AM

34 points

0 comments4 min readLW link

The Mountain Troll

lsusrJun 11, 2022, 9:14 AM

103 points

26 comments2 min readLW link

Show LW: YodaTimer.com

Adam ZernerJun 11, 2022, 8:52 AM

27 points

4 comments1 min readLW link

How fast can we perform a forward pass?

jsteinhardtJun 10, 2022, 11:30 PM

53 points

9 comments15 min readLW link

(bounded-regret.ghost.io)

Summary of “AGI Ruin: A List of Lethalities”

Stephen McAleeseJun 10, 2022, 10:35 PM

45 points

2 comments8 min readLW link

How dangerous is human-level AI?

Alex_AltairJun 10, 2022, 5:38 PM

21 points

4 comments8 min readLW link

Another plausible scenario of AI risk: AI builds military infrastructure while collaborating with humans, defects later.

avturchinJun 10, 2022, 5:24 PM

10 points

2 comments1 min readLW link

Leaving Google, Joining the Nucleic Acid Observatory

jefftkJun 10, 2022, 5:00 PM

114 points

4 comments3 min readLW link

(www.jefftk.com)

On The Spectrum, On The Guest List: (v) The Fleur Room

party girlJun 10, 2022, 2:50 PM

8 points

1 comment14 min readLW link

(onthespectrumontheguestlist.substack.com)

Progress Report 6: get the tool working

Nathan Helm-BurgerJun 10, 2022, 11:18 AM

4 points

0 comments2 min readLW link

[Question] Is AI Alignment Impossible?

HeighnJun 10, 2022, 10:08 AM

3 points

3 comments1 min readLW link

I No Longer Believe Intelligence to be “Magical”

DragonGodJun 10, 2022, 8:58 AM

28 points

34 comments6 min readLW link

[linkpost] The final AI benchmark: BIG-bench

RomanSJun 10, 2022, 8:53 AM

25 points

21 comments1 min readLW link

[Question] Could Patent-Trolling delay AI timelines?

Pablo RepettoJun 10, 2022, 2:53 AM

1 point

3 comments1 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer