All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 272829 30

Tuning your Cognitive Strategies

Raemon27 Apr 2023 20:32 UTC

155 points

57 comments9 min readLW link

(bewelltuned.com)

The LW crossroads of purpose

Caerulea-Lawrence27 Apr 2023 19:53 UTC

11 points

2 comments2 min readLW link

Metaculus Event: Forecast Friday, April 28th at 12pm ET — Speed Forecasting Session!

ChristianWilliams27 Apr 2023 19:50 UTC

0 points

0 comments1 min readLW link

Infrafunctions Proofs

Diffractor27 Apr 2023 19:25 UTC

12 points

1 comment10 min readLW link

Infrafunctions and Robust Optimization

Diffractor27 Apr 2023 19:25 UTC

61 points

11 comments15 min readLW link

What are the limits of superintelligence?

rainy27 Apr 2023 18:29 UTC

4 points

3 comments5 min readLW link

A Proposal for AI Alignment: Using Directly Opposing Models

Arne B27 Apr 2023 18:05 UTC

0 points

5 comments3 min readLW link

My views on “doom”

paulfchristiano27 Apr 2023 17:50 UTC

248 points

37 comments2 min readLW link 1 review

(ai-alignment.com)

[untitled post]

NeuralSystem_e5e127 Apr 2023 17:37 UTC

3 points

0 comments1 min readLW link

An International Manhattan Project for Artificial Intelligence

Glenn Clayton27 Apr 2023 17:34 UTC

−11 points

2 comments5 min readLW link

Quote quiz: “drifting into dependence”

jasoncrawford27 Apr 2023 15:13 UTC

7 points

6 comments1 min readLW link

(rootsofprogress.org)

Second-Level Empiricism: Reframing the Two-Child Puzzle

Richard Henage27 Apr 2023 15:04 UTC

16 points

5 comments3 min readLW link

Interview with Paul Christiano: How We Prevent the AI’s from Killing us

Dalmert27 Apr 2023 14:39 UTC

12 points

0 comments1 min readLW link

(www.youtube.com)

AI #9: The Merge and the Million Tokens

Zvi27 Apr 2023 14:20 UTC

36 points

8 comments53 min readLW link

(thezvi.wordpress.com)

AI doom from an LLM-plateau-ist perspective

Steven Byrnes27 Apr 2023 13:58 UTC

157 points

24 comments6 min readLW link

Romance, misunderstanding, social stances, and the human LLM

Kaj_Sotala27 Apr 2023 12:59 UTC

69 points

32 comments16 min readLW link

“A Note on the Compatibility of Different Robust Program Equilibria of the Prisoner’s Dilemma”

the gears to ascension27 Apr 2023 7:34 UTC

18 points

5 comments1 min readLW link

(arxiv.org)

AI chatbots don’t know why they did it

skybrian27 Apr 2023 6:57 UTC

18 points

11 comments2 min readLW link

(skybrian.substack.com)

The Great Ideological Conflict: Intuitionists vs. Establishmentarians

Thoth Hermes27 Apr 2023 1:49 UTC

3 points

0 comments11 min readLW link

(thothhermes.substack.com)

Automating the Breath Pulse

jefftk27 Apr 2023 0:10 UTC

11 points

0 comments1 min readLW link

(www.jefftk.com)

Freedom Is All We Need

Leo Glisic27 Apr 2023 0:09 UTC

−1 points

8 comments10 min readLW link

Contra Yudkowsky on Doom from Foom #2

jacob_cannell27 Apr 2023 0:07 UTC

93 points

76 comments6 min readLW link

A very non-technical explanation of the basics of infra-Bayesianism

David Matolcsi26 Apr 2023 22:57 UTC

62 points

9 comments9 min readLW link

LM Situational Awareness, Evaluation Proposal: Violating Imitation

Jacob Pfau26 Apr 2023 22:53 UTC

16 points

2 comments2 min readLW link

Recent Database Migration—Report Bugs

RobertM26 Apr 2023 22:19 UTC

38 points

2 comments1 min readLW link

Infra-Bayesianism naturally leads to the monotonicity principle, and I think this is a problem

David Matolcsi26 Apr 2023 21:39 UTC

19 points

6 comments4 min readLW link

Understanding new terms via etymology

corruptedCatapillar26 Apr 2023 20:48 UTC

4 points

1 comment2 min readLW link

(forum.effectivealtruism.org)

Chad Jones paper modeling AI and x-risk vs. growth

jasoncrawford26 Apr 2023 20:07 UTC

39 points

7 comments2 min readLW link

(web.stanford.edu)

I was Wrong, Simulator Theory is Real

Robert_AIZI26 Apr 2023 17:45 UTC

75 points

7 comments3 min readLW link

(aizi.substack.com)

$250 prize for checking Jake Cannell’s Brain Efficiency

Alexander Gietelink Oldenziel26 Apr 2023 16:21 UTC

123 points

170 comments2 min readLW link

My version of Simulacra Levels

Daniel Kokotajlo26 Apr 2023 15:50 UTC

42 points

15 comments3 min readLW link

[Question] Is the fact that we don’t observe any obvious glitch evidence that we’re not in a simulation?

Jim Buhler26 Apr 2023 14:57 UTC

8 points

16 comments1 min readLW link

Transcript and Brief Response to Twitter Conversation between Yann LeCunn and Eliezer Yudkowsky

Zvi26 Apr 2023 13:10 UTC

190 points

51 comments10 min readLW link

(thezvi.wordpress.com)

What comes after?

rogersbacon26 Apr 2023 12:44 UTC

3 points

0 comments2 min readLW link

(www.secretorum.life)

Accidental Terraforming

Sable26 Apr 2023 6:49 UTC

9 points

16 comments5 min readLW link

(affablyevil.substack.com)

Philosophy by Paul Graham Link

EniScien26 Apr 2023 5:36 UTC

21 points

4 comments1 min readLW link

Boxing at the gym

yakimoff26 Apr 2023 5:10 UTC

1 point

0 comments1 min readLW link

Sibelius + drinks

yakimoff26 Apr 2023 5:08 UTC

1 point

0 comments1 min readLW link

A simple presentation of AI risk arguments

Seth Herd26 Apr 2023 2:19 UTC

16 points

0 comments2 min readLW link

Archetypal Transfer Learning: a Proposed Alignment Solution that solves the Inner & Outer Alignment Problem while adding Corrigible Traits to GPT-2-medium

MiguelDev26 Apr 2023 1:37 UTC

14 points

5 comments10 min readLW link

[Question] How Many Bits Of Optimization Can One Bit Of Observation Unlock?

johnswentworth26 Apr 2023 0:26 UTC

62 points

32 comments3 min readLW link

Believe in Yourself and don’t stop Improving

Johannes C. Mayer25 Apr 2023 22:34 UTC

0 points

0 comments1 min readLW link

Should LW have an official list of norms?

Ruby25 Apr 2023 21:20 UTC

58 points

31 comments5 min readLW link

Implementing a Transformer from scratch in PyTorch—a write-up on my experience

Mislav Jurić25 Apr 2023 20:51 UTC

20 points

0 comments10 min readLW link

Exploring the Lottery Ticket Hypothesis

Rauno Arike25 Apr 2023 20:06 UTC

54 points

3 comments11 min readLW link

Genetic Sequencing of Wastewater: Prevalence to Relative Abundance

jefftk25 Apr 2023 19:30 UTC

17 points

2 comments2 min readLW link

(www.jefftk.com)

[Feedback please] New User’s Guide to LessWrong

Ruby25 Apr 2023 18:54 UTC

38 points

18 comments6 min readLW link

Reframing the burden of proof: Companies should prove that models are safe (rather than expecting auditors to prove that models are dangerous)

Akash25 Apr 2023 18:49 UTC

27 points

11 comments3 min readLW link

(childrenoficarus.substack.com)

LLMs for online discussion moderation

Dave Lindbergh25 Apr 2023 16:53 UTC

12 points

3 comments3 min readLW link

AI Safety Newsletter #3: AI policy proposals and a new challenger approaches

ozhang25 Apr 2023 16:15 UTC

33 points

0 comments1 min readLW link