All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 91011 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29

Close the Gates to an Inhuman Future: How and why we should choose to not develop superhuman general-purpose artificial intelligence

aaguirre9 Feb 2024 20:25 UTC

13 points

0 comments1 min readLW link

(arxiv.org)

[Crosspost] Deep Dive: The Coming Technological Singularity—How to survive in a Post-human Era

simulacra.exe9 Feb 2024 18:49 UTC

2 points

2 comments9 min readLW link

The Ideal Speech Situation as a Tool for AI Ethical Reflection: A Framework for Alignment

kenneth myers9 Feb 2024 18:40 UTC

6 points

12 comments3 min readLW link

What’s ChatGPT’s Favorite Ice Cream Flavor? An Investigation Into Synthetic Respondents

Greg Robison9 Feb 2024 18:38 UTC

19 points

4 comments15 min readLW link

OpenAI wants to raise 5-7 trillion

O O9 Feb 2024 16:15 UTC

13 points

29 comments1 min readLW link

(decrypt.co)

[Question] Constituency-sized AI congress?

Nathan Helm-Burger9 Feb 2024 16:01 UTC

11 points

5 comments1 min readLW link

One True Love

Zvi9 Feb 2024 15:10 UTC

34 points

7 comments10 min readLW link

(thezvi.wordpress.com)

[Question] Executive function advice from people who are good at it?

TeaTieAndHat9 Feb 2024 10:11 UTC

7 points

1 comment1 min readLW link

[Question] Do you want to make an AI Alignment song?

Kabir Kumar9 Feb 2024 8:22 UTC

4 points

0 comments1 min readLW link

Skills I’d like my collaborators to have

Raemon9 Feb 2024 8:20 UTC

106 points

9 comments8 min readLW link

Transfer learning and generalization-qua-capability in Babbage and Davinci (or, why division is better than Spanish)

RP and agg

9 Feb 2024 7:00 UTC

50 points

6 comments3 min readLW link

Biden-Harris Administration Announces First-Ever Consortium Dedicated to AI Safety

Ben Smith9 Feb 2024 6:40 UTC

22 points

0 comments1 min readLW link

(www.nist.gov)

Running the Numbers on a Heat Pump

jefftk9 Feb 2024 3:00 UTC

30 points

12 comments4 min readLW link

(www.jefftk.com)

[Question] How do high-trust societies form?

Shankar Sivarajan9 Feb 2024 1:11 UTC

22 points

17 comments1 min readLW link

[Question] How do health systems work in adequate worlds?

mukashi9 Feb 2024 0:54 UTC

10 points

2 comments1 min readLW link

Twin Cities ACX Meetup—February 2024

Timothy M.8 Feb 2024 23:26 UTC

1 point

2 comments1 min readLW link

A review of “Don’t forget the boundary problem...”

jessicata8 Feb 2024 23:19 UTC

12 points

1 comment12 min readLW link

(unstablerontology.substack.com)

aintelope project update

Gunnar_Zarncke8 Feb 2024 18:32 UTC

24 points

2 comments3 min readLW link

Updatelessness doesn’t solve most problems

Martín Soto8 Feb 2024 17:30 UTC

130 points

45 comments12 min readLW link

Predicting Alignment Award Winners Using ChatGPT 4

Shoshannah Tekofsky8 Feb 2024 14:38 UTC

16 points

2 comments11 min readLW link

AI #50: The Most Dangerous Thing

Zvi8 Feb 2024 14:30 UTC

53 points

4 comments24 min readLW link

(thezvi.wordpress.com)

How to develop a photographic memory 3/3

PhilosophicalSoul8 Feb 2024 9:22 UTC

6 points

2 comments18 min readLW link

Believing In

AnnaSalamon8 Feb 2024 7:06 UTC

230 points

51 comments13 min readLW link

Measuring pre-peer-review epistemic status

Jakub Smékal8 Feb 2024 5:09 UTC

1 point

0 comments2 min readLW link

A Chess-GPT Linear Emergent World Representation

Adam Karvonen8 Feb 2024 4:25 UTC

105 points

14 comments7 min readLW link

(adamkarvonen.github.io)

Domestic Production vs International Wealth Creation

100YearPants8 Feb 2024 4:25 UTC

1 point

0 comments1 min readLW link

Conditional prediction markets are evidential, not causal

philh7 Feb 2024 21:52 UTC

55 points

10 comments2 min readLW link

A Back-Of-The-Envelope Calculation On How Unlikely The Circumstantial Evidence Around Covid-19 Is

Roko7 Feb 2024 21:49 UTC

−1 points

36 comments5 min readLW link

Nitric oxide for covid and other viral infections

Elizabeth7 Feb 2024 21:30 UTC

39 points

6 comments6 min readLW link

(acesounderglass.com)

Debating with More Persuasive LLMs Leads to More Truthful Answers

Akbir Khan, John Hughes, Dan Valentine, Sam Bowman and Ethan Perez

7 Feb 2024 21:28 UTC

88 points

14 comments9 min readLW link

(arxiv.org)

[Question] Choosing a book on causality

martinkunev7 Feb 2024 21:16 UTC

4 points

3 comments1 min readLW link

More Hyphenation

Arjun Panickssery7 Feb 2024 19:43 UTC

87 points

19 comments1 min readLW link

(arjunpanickssery.substack.com)

Reading writing advice doesn’t make writing easier

Henry Sleight7 Feb 2024 19:14 UTC

17 points

0 comments5 min readLW link

(open.substack.com)

[Question] What’s this 3rd secret directive of evolution called? (survive & spread & ___)

lemonhope7 Feb 2024 14:11 UTC

10 points

11 comments1 min readLW link

Training of superintelligence is secretly adversarial

quetzal_rainbow7 Feb 2024 13:38 UTC

15 points

2 comments5 min readLW link

The Math of Suspicious Coincidences

Roko7 Feb 2024 13:32 UTC

24 points

3 comments4 min readLW link

[Question] How to deal with the sense of demotivation that comes from thinking about determinism?

SpectrumDT7 Feb 2024 10:53 UTC

13 points

71 comments1 min readLW link

Quantum Darwinism, social constructs, and the scientific method

pchvykov7 Feb 2024 7:04 UTC

6 points

12 comments9 min readLW link

Why I think it’s net harmful to do technical safety research at AGI labs

Remmelt7 Feb 2024 4:17 UTC

26 points

24 comments1 min readLW link

story-based decision-making

bhauth7 Feb 2024 2:35 UTC

89 points

11 comments4 min readLW link

Full Driving Engagement Optional

jefftk7 Feb 2024 2:30 UTC

14 points

0 comments1 min readLW link

(www.jefftk.com)

How to train your own “Sleeper Agents”

evhub7 Feb 2024 0:31 UTC

91 points

11 comments2 min readLW link

My guess at Conjecture’s vision: triggering a narrative bifurcation

Alexandre Variengien6 Feb 2024 19:10 UTC

75 points

12 comments16 min readLW link

Arrogance and People Pleasing

Jonathan Moregård6 Feb 2024 18:43 UTC

26 points

7 comments4 min readLW link

(honestliving.substack.com)

What does davidad want from «boundaries»?

Chipmonk and davidad

6 Feb 2024 17:45 UTC

44 points

1 comment5 min readLW link

[Question] How can I efficiently read all the Dath Ilan worldbuilding?

mike_hawke6 Feb 2024 16:52 UTC

10 points

1 comment1 min readLW link

Preventing model exfiltration with upload limits

ryan_greenblatt6 Feb 2024 16:29 UTC

69 points

22 comments14 min readLW link

Evolution is an observation, not a process

Neil 6 Feb 2024 14:49 UTC

8 points

11 comments5 min readLW link

[Question] Why do we need an understanding of the real world to predict the next tokens in a body of text?

Valentin Baltadzhiev6 Feb 2024 14:43 UTC

2 points

12 comments1 min readLW link

On the Debate Between Jezos and Leahy

Zvi6 Feb 2024 14:40 UTC

64 points

6 comments63 min readLW link

(thezvi.wordpress.com)