All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 262728 29 30

A very non-technical explanation of the basics of infra-Bayesianism

David Matolcsi26 Apr 2023 22:57 UTC

62 points

9 comments9 min readLW link

LM Situational Awareness, Evaluation Proposal: Violating Imitation

Jacob Pfau26 Apr 2023 22:53 UTC

16 points

2 comments2 min readLW link

Recent Database Migration—Report Bugs

RobertM26 Apr 2023 22:19 UTC

38 points

2 comments1 min readLW link

Infra-Bayesianism naturally leads to the monotonicity principle, and I think this is a problem

David Matolcsi26 Apr 2023 21:39 UTC

19 points

6 comments4 min readLW link

Understanding new terms via etymology

corruptedCatapillar26 Apr 2023 20:48 UTC

4 points

1 comment2 min readLW link

(forum.effectivealtruism.org)

Chad Jones paper modeling AI and x-risk vs. growth

jasoncrawford26 Apr 2023 20:07 UTC

39 points

7 comments2 min readLW link

(web.stanford.edu)

I was Wrong, Simulator Theory is Real

Robert_AIZI26 Apr 2023 17:45 UTC

75 points

7 comments3 min readLW link

(aizi.substack.com)

$250 prize for checking Jake Cannell’s Brain Efficiency

Alexander Gietelink Oldenziel26 Apr 2023 16:21 UTC

123 points

170 comments2 min readLW link

My version of Simulacra Levels

Daniel Kokotajlo26 Apr 2023 15:50 UTC

42 points

15 comments3 min readLW link

[Question] Is the fact that we don’t observe any obvious glitch evidence that we’re not in a simulation?

Jim Buhler26 Apr 2023 14:57 UTC

8 points

16 comments1 min readLW link

Transcript and Brief Response to Twitter Conversation between Yann LeCunn and Eliezer Yudkowsky

Zvi26 Apr 2023 13:10 UTC

190 points

51 comments10 min readLW link

(thezvi.wordpress.com)

What comes after?

rogersbacon26 Apr 2023 12:44 UTC

3 points

0 comments2 min readLW link

(www.secretorum.life)

Accidental Terraforming

Sable26 Apr 2023 6:49 UTC

9 points

16 comments5 min readLW link

(affablyevil.substack.com)

Philosophy by Paul Graham Link

EniScien26 Apr 2023 5:36 UTC

21 points

4 comments1 min readLW link

Boxing at the gym

yakimoff26 Apr 2023 5:10 UTC

1 point

0 comments1 min readLW link

Sibelius + drinks

yakimoff26 Apr 2023 5:08 UTC

1 point

0 comments1 min readLW link

A simple presentation of AI risk arguments

Seth Herd26 Apr 2023 2:19 UTC

16 points

0 comments2 min readLW link

Archetypal Transfer Learning: a Proposed Alignment Solution that solves the Inner & Outer Alignment Problem while adding Corrigible Traits to GPT-2-medium

MiguelDev26 Apr 2023 1:37 UTC

14 points

5 comments10 min readLW link

[Question] How Many Bits Of Optimization Can One Bit Of Observation Unlock?

johnswentworth26 Apr 2023 0:26 UTC

62 points

32 comments3 min readLW link

Believe in Yourself and don’t stop Improving

Johannes C. Mayer25 Apr 2023 22:34 UTC

0 points

0 comments1 min readLW link

Should LW have an official list of norms?

Ruby25 Apr 2023 21:20 UTC

58 points

31 comments5 min readLW link

Implementing a Transformer from scratch in PyTorch—a write-up on my experience

Mislav Jurić25 Apr 2023 20:51 UTC

20 points

0 comments10 min readLW link

Exploring the Lottery Ticket Hypothesis

Rauno Arike25 Apr 2023 20:06 UTC

54 points

3 comments11 min readLW link

Genetic Sequencing of Wastewater: Prevalence to Relative Abundance

jefftk25 Apr 2023 19:30 UTC

17 points

2 comments2 min readLW link

(www.jefftk.com)

[Feedback please] New User’s Guide to LessWrong

Ruby25 Apr 2023 18:54 UTC

38 points

18 comments6 min readLW link

Reframing the burden of proof: Companies should prove that models are safe (rather than expecting auditors to prove that models are dangerous)

Akash25 Apr 2023 18:49 UTC

27 points

11 comments3 min readLW link

(childrenoficarus.substack.com)

LLMs for online discussion moderation

Dave Lindbergh25 Apr 2023 16:53 UTC

12 points

3 comments3 min readLW link

AI Safety Newsletter #3: AI policy proposals and a new challenger approaches

ozhang25 Apr 2023 16:15 UTC

33 points

0 comments1 min readLW link

EA might systematically generate a scarcity mindset that produces low-integrity actors

Severin T. Seehrich25 Apr 2023 15:50 UTC

26 points

2 comments1 min readLW link

Max Tegmark’s new Time article on how we’re in a Don’t Look Up scenario [Linkpost]

Jonas Hallgren25 Apr 2023 15:41 UTC

39 points

9 comments1 min readLW link

(time.com)

WHO Biological Risk warning

Jonas Kgomo25 Apr 2023 15:10 UTC

−6 points

2 comments1 min readLW link

A Rant on Calculus III

Wofsen25 Apr 2023 14:51 UTC

−5 points

2 comments1 min readLW link

Briefly how I’ve updated since ChatGPT

rime25 Apr 2023 14:47 UTC

48 points

2 comments2 min readLW link

Discuss AI Policy Recommendations

Giles25 Apr 2023 14:21 UTC

8 points

0 comments1 min readLW link

Explaining the Transformer Circuits Framework by Example

Felix Hofstätter25 Apr 2023 13:45 UTC

8 points

0 comments15 min readLW link

Notes on Potential Future AI Tax Policy

Zvi25 Apr 2023 13:30 UTC

33 points

6 comments9 min readLW link

(thezvi.wordpress.com)

Sentience in Silicon: The Challenges of AI Consciousness

Hannes Thurnherr25 Apr 2023 13:15 UTC

5 points

2 comments5 min readLW link

Paths to failure

Karl von Wendt and mespa

25 Apr 2023 8:03 UTC

29 points

1 comment8 min readLW link

My Assessment of the Chinese AI Safety Community

Lao Mein25 Apr 2023 4:21 UTC

250 points

94 comments3 min readLW link

Making Nanobots isn’t a one-shot process, even for an artificial superintelligance

dankrad25 Apr 2023 0:39 UTC

20 points

13 comments6 min readLW link

Mental Models Of People Can Be People

Nox ML25 Apr 2023 0:03 UTC

13 points

55 comments8 min readLW link

Progress links and tweets, 2023-04-24

jasoncrawford24 Apr 2023 21:17 UTC

16 points

1 comment2 min readLW link

(rootsofprogress.org)

Ideas for AI labs: Reading list

Zach Stein-Perlman24 Apr 2023 19:00 UTC

11 points

0 comments4 min readLW link

Deep learning models might be secretly (almost) linear

beren24 Apr 2023 18:43 UTC

117 points

29 comments4 min readLW link

Subjective AI/ML Digest: April II

Boris T24 Apr 2023 18:33 UTC

1 point

0 comments1 min readLW link

(borisagain.substack.com)

The Toxoplasma of AGI Doom and Capabilities?

Robert_AIZI24 Apr 2023 18:11 UTC

72 points

12 comments1 min readLW link

[Question] Measures of Internet Virality and News Popularity

Fer32dwt34r3dfsz24 Apr 2023 17:43 UTC

4 points

4 comments1 min readLW link

A concise sum-up of the basic argument for AI doom

Mergimio H. Doefevmil24 Apr 2023 17:37 UTC

11 points

6 comments2 min readLW link

A response to Conjecture’s CoEm proposal

Kristian Freed24 Apr 2023 17:23 UTC

7 points

0 comments4 min readLW link

Camaraderie at scale: in search of shared identity

eq24 Apr 2023 16:46 UTC

8 points

2 comments8 min readLW link