All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 20232024

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 123 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Increasing IQ is trivial

George3d61 Mar 2024 22:43 UTC

37 points

60 comments6 min readLW link

(epistemink.substack.com)

self-fulfilling prophecies when applying for funding

Chipmonk1 Mar 2024 19:01 UTC

31 points

0 comments1 min readLW link

(chipmonk.substack.com)

Antagonistic AI

Xybermancer1 Mar 2024 18:50 UTC

−8 points

1 comment1 min readLW link

Against Augmentation of Intelligence, Human or Otherwise (An Anti-Natalist Argument)

Benjamin Bourlier1 Mar 2024 18:45 UTC

−29 points

18 comments3 min readLW link

Elon files grave charges against OpenAI

mako yass1 Mar 2024 17:42 UTC

38 points

10 comments1 min readLW link

(www.courthousenews.com)

Notes on Dwarkesh Patel’s Podcast with Demis Hassabis

Zvi1 Mar 2024 16:30 UTC

93 points

0 comments8 min readLW link

(thezvi.wordpress.com)

What does your philosophy maximize?

Antb1 Mar 2024 16:10 UTC

0 points

1 comment1 min readLW link

The Defence production act and AI policy

NathanBarnard1 Mar 2024 14:26 UTC

37 points

0 comments2 min readLW link

Don’t Endorse the Idea of Market Failure

Maxwell Tabarrok1 Mar 2024 14:04 UTC

14 points

22 comments4 min readLW link

(www.maximum-progress.com)

[Question] Is it possible to make more specific bookmarks?

numpyNaN1 Mar 2024 12:47 UTC

1 point

0 comments1 min readLW link

Wholesome Culture

owencb1 Mar 2024 12:08 UTC

29 points

3 comments1 min readLW link

Adding Sensors to Mandolin?

jefftk1 Mar 2024 2:10 UTC

6 points

1 comment1 min readLW link

(www.jefftk.com)

The Parable Of The Fallen Pendulum—Part 1

johnswentworth1 Mar 2024 0:25 UTC

111 points

32 comments2 min readLW link

Gradations of moral weight

MichaelStJules29 Feb 2024 23:08 UTC

1 point

0 comments1 min readLW link

Approaching Human-Level Forecasting with Language Models

Fred Zhang, dannyhalawi and jsteinhardt

29 Feb 2024 22:36 UTC

60 points

6 comments3 min readLW link

Paper review: “The Unreasonable Effectiveness of Easy Training Data for Hard Tasks”

Vassil Tashev29 Feb 2024 18:44 UTC

11 points

0 comments4 min readLW link

What’s in the box?! – Towards interpretability by distinguishing niches of value within neural networks.

Joshua Clancy29 Feb 2024 18:33 UTC

3 points

4 comments128 min readLW link

Short Post: Discerning Truth from Trash

FinalFormal229 Feb 2024 18:09 UTC

−2 points

0 comments1 min readLW link

AI #53: One More Leap

Zvi29 Feb 2024 16:10 UTC

45 points

0 comments38 min readLW link

(thezvi.wordpress.com)

Cryonics p(success) estimates are only weakly associated with interest in pursuing cryonics in the LW 2023 Survey

Andy_McKenzie29 Feb 2024 14:47 UTC

28 points

6 comments1 min readLW link

Bengio’s Alignment Proposal: “Towards a Cautious Scientist AI with Convergent Safety Bounds”

mattmacdermott29 Feb 2024 13:59 UTC

76 points

19 comments14 min readLW link

(yoshuabengio.org)

Tips for Empirical Alignment Research

Ethan Perez29 Feb 2024 6:04 UTC

152 points

4 comments22 min readLW link

[Question] Supposing the 1bit LLM paper pans out

O O29 Feb 2024 5:31 UTC

27 points

11 comments1 min readLW link

Can RLLMv3′s ability to defend against jailbreaks be attributed to datasets containing stories about Jung’s shadow integration theory?

MiguelDev29 Feb 2024 5:13 UTC

7 points

2 comments11 min readLW link

Post series on “Liability Law for reducing Existential Risk from AI”

Nora_Ammann29 Feb 2024 4:39 UTC

42 points

1 comment1 min readLW link

(forum.effectivealtruism.org)

Tour Retrospective February 2024

jefftk29 Feb 2024 3:50 UTC

10 points

0 comments4 min readLW link

(www.jefftk.com)

Locating My Eyes (Part 3 of “The Sense of Physical Necessity”)

LoganStrohl29 Feb 2024 3:09 UTC

43 points

4 comments22 min readLW link

Conspiracy Theorists Aren’t Ignorant. They’re Bad At Epistemology.

omnizoid28 Feb 2024 23:39 UTC

18 points

10 comments5 min readLW link

Discovering alignment windfalls reduces AI risk

goodgravy and stuhlmueller

28 Feb 2024 21:23 UTC

15 points

1 comment8 min readLW link

(blog.elicit.com)

my theory of the industrial revolution

bhauth28 Feb 2024 21:07 UTC

23 points

7 comments3 min readLW link

(www.bhauth.com)

Wholesomeness and Effective Altruism

owencb28 Feb 2024 20:28 UTC

42 points

3 comments1 min readLW link

timestamping through the Singularity

throwaway91811912728 Feb 2024 19:09 UTC

−2 points

4 comments8 min readLW link

Evidential Cooperation in Large Worlds: Potential Objections & FAQ

Chi Nguyen and _will_

28 Feb 2024 18:58 UTC

42 points

5 comments1 min readLW link

Timaeus’s First Four Months

Jesse Hoogland, Daniel Murfet, Stan van Wingerden and Alexander Gietelink Oldenziel

28 Feb 2024 17:01 UTC

172 points

6 comments6 min readLW link

Notes on control evaluations for safety cases

ryan_greenblatt, Buck and Fabien Roger

28 Feb 2024 16:15 UTC

48 points

0 comments32 min readLW link

Corporate Governance for Frontier AI Labs: A Research Agenda

Matthew Wearden28 Feb 2024 11:29 UTC

4 points

0 comments16 min readLW link

(matthewwearden.co.uk)

How AI Will Change Education

robotelvis28 Feb 2024 5:30 UTC

6 points

3 comments5 min readLW link

(messyprogress.substack.com)

Band Lessons?

jefftk28 Feb 2024 3:00 UTC

13 points

3 comments1 min readLW link

(www.jefftk.com)

New LessWrong review winner UI (“The LeastWrong” section and full-art post pages)

kave28 Feb 2024 2:42 UTC

105 points

64 comments1 min readLW link

Counting arguments provide no evidence for AI doom

Nora Belrose and Quintin Pope

27 Feb 2024 23:03 UTC

95 points

188 comments14 min readLW link

Which animals realize which types of subjective welfare?

MichaelStJules27 Feb 2024 19:31 UTC

4 points

0 comments1 min readLW link

Biosecurity and AI: Risks and Opportunities

Steve Newman27 Feb 2024 18:45 UTC

11 points

1 comment7 min readLW link

(www.safe.ai)

The Gemini Incident Continues

Zvi27 Feb 2024 16:00 UTC

45 points

6 comments48 min readLW link

(thezvi.wordpress.com)

How I internalized my achievements to better deal with negative feelings

Raymond Koopmanschap27 Feb 2024 15:10 UTC

42 points

7 comments6 min readLW link

On Frustration and Regret

silentbob27 Feb 2024 12:19 UTC

8 points

0 comments4 min readLW link

Facts vs Interpretations—An Exercise in Cognitive Reframing

Declan Molony27 Feb 2024 7:57 UTC

15 points

0 comments3 min readLW link

San Francisco ACX Meetup “Third Saturday”

Nate Sternberg and guenael

27 Feb 2024 7:07 UTC

7 points

0 comments1 min readLW link

Examining Language Model Performance with Reconstructed Activations using Sparse Autoencoders

Evan Anders and Joseph Bloom

27 Feb 2024 2:43 UTC

42 points

16 comments15 min readLW link

Project idea: an iterated prisoner’s dilemma competition/game

Adam Zerner26 Feb 2024 23:06 UTC

8 points

0 comments5 min readLW link

Acting Wholesomely

owencb26 Feb 2024 21:49 UTC

58 points

64 comments1 min readLW link