All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 2930

Inositol Non-Results

Elizabeth29 Nov 2023 21:40 UTC

20 points

2 comments1 min readLW link

(acesounderglass.com)

Losing Metaphors: Zip and Paste

jefftk29 Nov 2023 20:31 UTC

26 points

6 comments1 min readLW link

(www.jefftk.com)

Preserving our heritage: Building a movement and a knowledge ark for current and future generations

rnk829 Nov 2023 19:20 UTC

0 points

5 comments12 min readLW link

AGI Alignment is Absurd

Youssef Mohamed29 Nov 2023 19:11 UTC

−9 points

4 comments3 min readLW link

The origins of the steam engine: An essay with interactive animated diagrams

jasoncrawford29 Nov 2023 18:30 UTC

30 points

1 comment1 min readLW link

(rootsofprogress.org)

ChatGPT 4 solved all the gotcha problems I posed that tripped ChatGPT 3.5

VipulNaik29 Nov 2023 18:11 UTC

33 points

16 comments14 min readLW link

“Clean” vs. “messy” goal-directedness (Section 2.2.3 of “Scheming AIs”)

Joe Carlsmith29 Nov 2023 16:32 UTC

29 points

1 comment11 min readLW link

Lying Alignment Chart

Zack_M_Davis29 Nov 2023 16:15 UTC

77 points

17 comments1 min readLW link

Rethink Priorities: Seeking Expressions of Interest for Special Projects Next Year

kierangreig29 Nov 2023 13:59 UTC

4 points

0 comments5 min readLW link

[Question] Thoughts on teletransportation with copies?

titotal29 Nov 2023 12:56 UTC

15 points

13 comments1 min readLW link

Interpretability with Sparse Autoencoders (Colab exercises)

CallumMcDougall29 Nov 2023 12:56 UTC

74 points

9 comments4 min readLW link

The 101 Space You Will Always Have With You

Screwtape29 Nov 2023 4:56 UTC

253 points

21 comments6 min readLW link 1 review

Trust your intuition—Kahneman’s book misses the forest for the trees

mnvr29 Nov 2023 4:37 UTC

−2 points

2 comments2 min readLW link

Process Substitution Without Shell?

jefftk29 Nov 2023 3:20 UTC

19 points

18 comments2 min readLW link

(www.jefftk.com)

Deception Chess: Game #2

Zane29 Nov 2023 2:43 UTC

29 points

17 comments2 min readLW link

Black Box Biology

GeneSmith29 Nov 2023 2:27 UTC

62 points

30 comments2 min readLW link

[Question] What would be the shelf life of nuclear weapon-secrecy if nuclear weapons had not immediately been used in combat?

Gram Stone29 Nov 2023 0:53 UTC

7 points

2 comments1 min readLW link

Scaling laws for dominant assurance contracts

jessicata28 Nov 2023 23:11 UTC

36 points

5 comments7 min readLW link

(unstableontology.com)

I’m confused about innate smell neuroanatomy

Steven Byrnes28 Nov 2023 20:49 UTC

39 points

2 comments9 min readLW link

How to Control an LLM’s Behavior (why my P(DOOM) went down)

RogerDearnaley28 Nov 2023 19:56 UTC

64 points

30 comments11 min readLW link

[Question] Is there a word for discrimination against A.I.?

Aaron Bohannon28 Nov 2023 19:03 UTC

1 point

4 comments1 min readLW link

Update #2 to “Dominant Assurance Contract Platform”: EnsureDone

moyamo28 Nov 2023 18:02 UTC

33 points

2 comments1 min readLW link

Ethicophysics II: Politics is the Mind-Savior

MadHatter28 Nov 2023 16:27 UTC

−9 points

9 comments4 min readLW link

(bittertruths.substack.com)

Neither EA nor e/acc is what we need to build the future

jasoncrawford28 Nov 2023 16:04 UTC

0 points

22 comments3 min readLW link

(rootsofprogress.org)

Agentic Growth

Logan Kieller28 Nov 2023 15:45 UTC

1 point

0 comments3 min readLW link

(logankieller.substack.com)

AISC project: How promising is automating alignment research? (literature review)

Bogdan Ionut Cirstea28 Nov 2023 14:47 UTC

4 points

1 comment1 min readLW link

(docs.google.com)

A day in the life of a mechanistic interpretability researcher

Bill Benzon28 Nov 2023 14:45 UTC

3 points

3 comments1 min readLW link

Two sources of beyond-episode goals (Section 2.2.2 of “Scheming AIs”)

Joe Carlsmith28 Nov 2023 13:49 UTC

11 points

1 comment15 min readLW link

Self-Referential Probabilistic Logic Admits the Payor’s Lemma

Yudhister Kumar28 Nov 2023 10:27 UTC

80 points

14 comments6 min readLW link

[Question] How can I use AI without increasing AI-risk?

Yoav Ravid28 Nov 2023 10:05 UTC

18 points

6 comments1 min readLW link

A Reading From The Book Of Sequences

Screwtape28 Nov 2023 6:45 UTC

8 points

0 comments4 min readLW link

Anthropic Fall 2023 Debate Progress Update

Ansh Radhakrishnan28 Nov 2023 5:37 UTC

74 points

9 comments12 min readLW link

Apocalypse insurance, and the hardline libertarian take on AI risk

So8res28 Nov 2023 2:09 UTC

133 points

40 comments7 min readLW link 1 review

My techno-optimism [By Vitalik Buterin]

habryka27 Nov 2023 23:53 UTC

107 points

17 comments2 min readLW link

(www.lesswrong.com)

[Question] Could Germany have won World War I with high probability given the benefit of hindsight?

Roko27 Nov 2023 22:52 UTC

10 points

18 comments1 min readLW link

[Question] Could World War I have been prevented given the benefit of hindsight?

Roko27 Nov 2023 22:39 UTC

16 points

8 comments1 min readLW link

AISC 2024 - Project Summaries

NickyP27 Nov 2023 22:32 UTC

48 points

3 comments18 min readLW link

“Epistemic range of motion” and LessWrong moderation

habryka and Gabriel Alfour

27 Nov 2023 21:58 UTC

65 points

3 comments12 min readLW link

Apply to the Conceptual Boundaries Workshop for AI Safety

Chipmonk27 Nov 2023 21:04 UTC

50 points

0 comments3 min readLW link

There is no IQ for AI

Gabriel Alfour27 Nov 2023 18:21 UTC

30 points

10 comments9 min readLW link

(cognition.cafe)

Two concepts of an “episode” (Section 2.2.1 of “Scheming AIs”)

Joe Carlsmith27 Nov 2023 18:01 UTC

19 points

1 comment13 min readLW link

[Linkpost] George Mack’s Razors

trevor27 Nov 2023 17:53 UTC

38 points

8 comments3 min readLW link

(twitter.com)

On possible cross-fertilization between AI and neuroscience [Creativity]

Bill Benzon27 Nov 2023 16:50 UTC

15 points

22 comments7 min readLW link

Ethicophysics I

MadHatter27 Nov 2023 15:44 UTC

−1 points

16 comments1 min readLW link

(open.substack.com)

Sentience Institute 2023 End of Year Summary

michael_dello27 Nov 2023 12:11 UTC

11 points

0 comments5 min readLW link

(www.sentienceinstitute.org)

[Question] A Question about Corrigibility (2015)

A.H.27 Nov 2023 12:05 UTC

4 points

2 comments1 min readLW link

Appendices to the live agendas

technicalities and Stag

27 Nov 2023 11:10 UTC

16 points

4 comments1 min readLW link

Shallow review of live agendas in alignment & safety

technicalities and Stag

27 Nov 2023 11:10 UTC

332 points

69 comments29 min readLW link

Napoleon stole the Roman Inquisition archives and investigated the Galileo case

Meow P27 Nov 2023 9:41 UTC

−3 points

0 comments1 min readLW link

(www.cricetuscricetus.co.uk)

Found Paper: “FDT in an evolutionary environment”

the gears to ascension27 Nov 2023 5:27 UTC

30 points

47 comments1 min readLW link

(arxiv.org)