All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 282930

Scaling laws for dominant assurance contracts

jessicata28 Nov 2023 23:11 UTC

36 points

5 comments7 min readLW link

(unstableontology.com)

I’m confused about innate smell neuroanatomy

Steven Byrnes28 Nov 2023 20:49 UTC

39 points

2 comments9 min readLW link

How to Control an LLM’s Behavior (why my P(DOOM) went down)

RogerDearnaley28 Nov 2023 19:56 UTC

64 points

30 comments11 min readLW link

[Question] Is there a word for discrimination against A.I.?

Aaron Bohannon28 Nov 2023 19:03 UTC

1 point

4 comments1 min readLW link

Update #2 to “Dominant Assurance Contract Platform”: EnsureDone

moyamo28 Nov 2023 18:02 UTC

33 points

2 comments1 min readLW link

Ethicophysics II: Politics is the Mind-Savior

MadHatter28 Nov 2023 16:27 UTC

−9 points

9 comments4 min readLW link

(bittertruths.substack.com)

Neither EA nor e/acc is what we need to build the future

jasoncrawford28 Nov 2023 16:04 UTC

0 points

22 comments3 min readLW link

(rootsofprogress.org)

Agentic Growth

Logan Kieller28 Nov 2023 15:45 UTC

1 point

0 comments3 min readLW link

(logankieller.substack.com)

AISC project: How promising is automating alignment research? (literature review)

Bogdan Ionut Cirstea28 Nov 2023 14:47 UTC

4 points

1 comment1 min readLW link

(docs.google.com)

A day in the life of a mechanistic interpretability researcher

Bill Benzon28 Nov 2023 14:45 UTC

3 points

3 comments1 min readLW link

Two sources of beyond-episode goals (Section 2.2.2 of “Scheming AIs”)

Joe Carlsmith28 Nov 2023 13:49 UTC

11 points

1 comment15 min readLW link

Self-Referential Probabilistic Logic Admits the Payor’s Lemma

Yudhister Kumar28 Nov 2023 10:27 UTC

80 points

14 comments6 min readLW link

[Question] How can I use AI without increasing AI-risk?

Yoav Ravid28 Nov 2023 10:05 UTC

18 points

6 comments1 min readLW link

A Reading From The Book Of Sequences

Screwtape28 Nov 2023 6:45 UTC

8 points

0 comments4 min readLW link

Anthropic Fall 2023 Debate Progress Update

Ansh Radhakrishnan28 Nov 2023 5:37 UTC

74 points

9 comments12 min readLW link

Apocalypse insurance, and the hardline libertarian take on AI risk

So8res28 Nov 2023 2:09 UTC

133 points

40 comments7 min readLW link 1 review

My techno-optimism [By Vitalik Buterin]

habryka27 Nov 2023 23:53 UTC

107 points

17 comments2 min readLW link

(www.lesswrong.com)

[Question] Could Germany have won World War I with high probability given the benefit of hindsight?

Roko27 Nov 2023 22:52 UTC

10 points

18 comments1 min readLW link

[Question] Could World War I have been prevented given the benefit of hindsight?

Roko27 Nov 2023 22:39 UTC

16 points

8 comments1 min readLW link

AISC 2024 - Project Summaries

NickyP27 Nov 2023 22:32 UTC

48 points

3 comments18 min readLW link

“Epistemic range of motion” and LessWrong moderation

habryka and Gabriel Alfour

27 Nov 2023 21:58 UTC

65 points

3 comments12 min readLW link

Apply to the Conceptual Boundaries Workshop for AI Safety

Chipmonk27 Nov 2023 21:04 UTC

50 points

0 comments3 min readLW link

There is no IQ for AI

Gabriel Alfour27 Nov 2023 18:21 UTC

30 points

10 comments9 min readLW link

(cognition.cafe)

Two concepts of an “episode” (Section 2.2.1 of “Scheming AIs”)

Joe Carlsmith27 Nov 2023 18:01 UTC

19 points

1 comment13 min readLW link

[Linkpost] George Mack’s Razors

trevor27 Nov 2023 17:53 UTC

38 points

8 comments3 min readLW link

(twitter.com)

On possible cross-fertilization between AI and neuroscience [Creativity]

Bill Benzon27 Nov 2023 16:50 UTC

15 points

22 comments7 min readLW link

Ethicophysics I

MadHatter27 Nov 2023 15:44 UTC

−1 points

16 comments1 min readLW link

(open.substack.com)

Sentience Institute 2023 End of Year Summary

michael_dello27 Nov 2023 12:11 UTC

11 points

0 comments5 min readLW link

(www.sentienceinstitute.org)

[Question] A Question about Corrigibility (2015)

A.H.27 Nov 2023 12:05 UTC

4 points

2 comments1 min readLW link

Appendices to the live agendas

technicalities and Stag

27 Nov 2023 11:10 UTC

16 points

4 comments1 min readLW link

Shallow review of live agendas in alignment & safety

technicalities and Stag

27 Nov 2023 11:10 UTC

332 points

69 comments29 min readLW link

Napoleon stole the Roman Inquisition archives and investigated the Galileo case

Meow P27 Nov 2023 9:41 UTC

−3 points

0 comments1 min readLW link

(www.cricetuscricetus.co.uk)

Found Paper: “FDT in an evolutionary environment”

the gears to ascension27 Nov 2023 5:27 UTC

30 points

47 comments1 min readLW link

(arxiv.org)

[Question] why did OpenAI employees sign

bhauth27 Nov 2023 5:21 UTC

49 points

23 comments1 min readLW link

Unknown Probabilities

transhumanist_atom_understander27 Nov 2023 2:30 UTC

22 points

0 comments4 min readLW link

Justification for Induction

Krantz27 Nov 2023 2:05 UTC

2 points

25 comments5 min readLW link

Situational awareness (Section 2.1 of “Scheming AIs”)

Joe Carlsmith26 Nov 2023 23:00 UTC

10 points

5 comments8 min readLW link

AXRP Episode 26 - AI Governance with Elizabeth Seger

DanielFilan26 Nov 2023 23:00 UTC

14 points

0 comments66 min readLW link

Solving Two-Sided Adverse Selection with Prediction Market Matchmaking

Saul Munn26 Nov 2023 20:10 UTC

16 points

7 comments4 min readLW link

(www.brasstacks.blog)

Wikipedia is not so great, and what can be done about it.

euserx26 Nov 2023 19:13 UTC

0 points

27 comments16 min readLW link

(forum.effectivealtruism.org)

[Question] Help me solve this problem: The basilisk isn’t real, but people are

canary_itm26 Nov 2023 17:44 UTC

−19 points

4 comments1 min readLW link

Twin Cities ACX Meetup—December 2023

Timothy M.26 Nov 2023 17:32 UTC

1 point

1 comment1 min readLW link

Spaced repetition for teaching two-year olds how to read (Interview)

Chipmonk26 Nov 2023 16:52 UTC

48 points

9 comments5 min readLW link

(chipmonk.substack.com)

Paper out now on creatine and cognitive performance

Fabienne26 Nov 2023 10:58 UTC

58 points

2 comments1 min readLW link

Why Q*, if real, might be a game changer

Shmi26 Nov 2023 6:12 UTC

5 points

6 comments1 min readLW link

Moral Reality Check (a short story)

jessicata26 Nov 2023 5:03 UTC

148 points

45 comments21 min readLW link 1 review

(unstableontology.com)

Accounting for Foregone Pay

jefftk26 Nov 2023 3:30 UTC

11 points

0 comments2 min readLW link

(www.jefftk.com)

Corrigibility or DWIM is an attractive primary goal for AGI

Seth Herd25 Nov 2023 19:37 UTC

16 points

4 comments1 min readLW link

On “slack” in training (Section 1.5 of “Scheming AIs”)

Joe Carlsmith25 Nov 2023 17:51 UTC

1 point

0 comments5 min readLW link

Announcing New Beginner-friendly Book on AI Safety and Risk

Darren McKee25 Nov 2023 15:57 UTC

64 points

2 comments1 min readLW link