All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 151617 18 19 20 21 22 23 24 25 26 27 28 29

Offering AI safety support calls for ML professionals

Vael Gates15 Feb 2024 23:48 UTC

61 points

1 comment1 min readLW link

7. Evolution and Ethics

RogerDearnaley15 Feb 2024 23:38 UTC

3 points

6 comments6 min readLW link

Mapping the semantic void III: Exploring neighbourhoods

mwatkins15 Feb 2024 23:01 UTC

13 points

0 comments10 min readLW link

Mapping the semantic void II: Above, below and between token embeddings

mwatkins15 Feb 2024 23:00 UTC

31 points

4 comments10 min readLW link

Raising children on the eve of AI

juliawise15 Feb 2024 21:28 UTC

272 points

47 comments5 min readLW link

What’s happening behind the scenes with my HowTruthful project

Bruce Lewis15 Feb 2024 18:27 UTC

7 points

0 comments3 min readLW link

Gemini 1.5 released

Cole Wyeth15 Feb 2024 18:02 UTC

19 points

3 comments1 min readLW link

(blog.google)

AI play for the next 3 years: Lemonade Insurance

Prin (Premek) Paska15 Feb 2024 13:48 UTC

2 points

4 comments1 min readLW link

(docs.google.com)

Collection of Scientific and Other Classifications

niplav15 Feb 2024 12:58 UTC

16 points

0 comments1 min readLW link

“Open Source AI” isn’t Open Source

Davidmanheim15 Feb 2024 8:59 UTC

18 points

16 comments1 min readLW link

(davidmanheim.substack.com)

Research Log, RLLMv3 (GPT2-XL, Phi-1.5 and Falcon-RW-1B)

MiguelDev15 Feb 2024 3:39 UTC

4 points

0 comments262 min readLW link

11 diceware words is enough

DanielFilan and benwr

15 Feb 2024 0:13 UTC

23 points

6 comments1 min readLW link

(threadreaderapp.com)

Searching for Searching for Search

Rubi J. Hudson14 Feb 2024 23:51 UTC

21 points

4 comments7 min readLW link

Some questions for the people at 80,000 Hours

yanni kyriacos14 Feb 2024 23:15 UTC

1 point

0 comments1 min readLW link

(forum.effectivealtruism.org)

Disrupting malicious uses of AI by state-affiliated threat actors

agucova14 Feb 2024 21:28 UTC

11 points

2 comments1 min readLW link

(openai.com)

Critiques of the AI control agenda

Jozdien14 Feb 2024 19:25 UTC

48 points

14 comments9 min readLW link

Bad business advice

Logan Kieller14 Feb 2024 17:01 UTC

12 points

2 comments3 min readLW link

(logankieller.substack.com)

Examples of governments doing good in house (or contracted) technical research

NathanBarnard14 Feb 2024 16:22 UTC

12 points

2 comments2 min readLW link

[Question] How can we legally/illegally enhance the progress of the law of accelerating returns in AI learning?

Gabi QUENE14 Feb 2024 11:06 UTC

−25 points

0 comments1 min readLW link

[Question] What experiment settles the Gary Marcus vs Geoffrey Hinton debate?

Valentin Baltadzhiev14 Feb 2024 9:06 UTC

12 points

8 comments1 min readLW link

[Question] Optimizing for Agency?

Michael Soareverix14 Feb 2024 8:31 UTC

10 points

9 comments2 min readLW link

Requirements for a Basin of Attraction to Alignment

RogerDearnaley14 Feb 2024 7:10 UTC

41 points

12 comments31 min readLW link

FTX expects to return all customer money; clawbacks may go away

Mikhail Samin14 Feb 2024 3:43 UTC

33 points

1 comment1 min readLW link

(www.nytimes.com)

Scale Was All We Needed, At First

Gabe M14 Feb 2024 1:49 UTC

295 points

34 comments8 min readLW link

(aiacumen.substack.com)

CFAR Takeaways: Andrew Critch

Raemon14 Feb 2024 1:37 UTC

217 points

64 comments5 min readLW link

Meetup In a Box: Year In Review

Czynski14 Feb 2024 1:18 UTC

26 points

1 comment4 min readLW link

An EA used deceptive messaging to advance their project; we need mechanisms to avoid deontologically dubious plans

Mikhail Samin13 Feb 2024 23:15 UTC

24 points

1 comment1 min readLW link

Useful starting code for interpretability

eggsyntax13 Feb 2024 23:13 UTC

26 points

2 comments1 min readLW link

Masterpiece

Richard_Ngo13 Feb 2024 23:10 UTC

163 points

21 comments4 min readLW link

(www.narrativeark.xyz)

A Bridge Between Utilitarianism & Stoicism

Jonathan Moregård13 Feb 2024 22:46 UTC

5 points

0 comments5 min readLW link

(honestliving.substack.com)

The “context window” analogy for human minds

Ruby13 Feb 2024 19:29 UTC

38 points

0 comments2 min readLW link

More on the Apple Vision Pro

Zvi13 Feb 2024 17:40 UTC

33 points

5 comments8 min readLW link

(thezvi.wordpress.com)

Linear White

Teja Prabhu13 Feb 2024 16:31 UTC

−3 points

3 comments3 min readLW link

(krez.expert)

Causality is Everywhere

silentbob13 Feb 2024 13:44 UTC

26 points

12 comments8 min readLW link

Technologies and Terminology: AI isn’t Software, it’s… Deepware?

Davidmanheim and abramdemski

13 Feb 2024 13:37 UTC

40 points

10 comments8 min readLW link

[Question] LessWrong Is Very Wrong: Ultimately All Social Media Platforms Are The Same

Amritesh Kumar13 Feb 2024 6:53 UTC

−16 points

2 comments1 min readLW link

Lsusr’s Rationality Dojo

lsusr13 Feb 2024 5:52 UTC

102 points

17 comments2 min readLW link

[Question] Where is the Town Square?

Gretta Duleba13 Feb 2024 3:53 UTC

46 points

8 comments1 min readLW link

My cover story in Jacobin on AI capitalism and the x-risk debates

garrison12 Feb 2024 23:34 UTC

98 points

5 comments1 min readLW link

(jacobin.com)

What is Ontology?

martinkunev12 Feb 2024 23:01 UTC

4 points

0 comments4 min readLW link

Thank you for triggering me

Cissy12 Feb 2024 20:09 UTC

6 points

1 comment6 min readLW link

(www.moremyself.xyz)

Interpreting Quantum Mechanics in Infra-Bayesian Physicalism

Yegreg12 Feb 2024 18:56 UTC

30 points

6 comments43 min readLW link

I played the AI box game as the Gatekeeper — and lost

datawitch12 Feb 2024 18:39 UTC

30 points

53 comments4 min readLW link

The Last Laugh: Exploring the Role of Humor as a Benchmark for Large Language Models

Greg Robison12 Feb 2024 18:34 UTC

4 points

6 comments11 min readLW link

Natural abstractions are observer-dependent: a conversation with John Wentworth

Martín Soto12 Feb 2024 17:28 UTC

39 points

13 comments7 min readLW link

Tort Law Can Play an Important Role in Mitigating AI Risk

Gabriel Weil12 Feb 2024 17:17 UTC

38 points

9 comments5 min readLW link

On the Proposed California SB 1047

Zvi12 Feb 2024 16:40 UTC

46 points

18 comments12 min readLW link

(thezvi.wordpress.com)

Thoughts on “The Offense-Defense Balance Rarely Changes”

Cullen12 Feb 2024 3:26 UTC

46 points

4 comments1 min readLW link

Skepticism About DeepMind’s “Grandmaster-Level” Chess Without Search

Arjun Panickssery12 Feb 2024 0:56 UTC

57 points

13 comments3 min readLW link

[Question] What are the known difficulties with this alignment approach?

tailcalled11 Feb 2024 22:52 UTC

18 points

24 comments1 min readLW link