All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

GÖDEL GOING DOWN

Jimdrix_Hendri6 Mar 2023 23:06 UTC

−9 points

3 comments1 min readLW link

Against ubiquitous alignment taxes

beren6 Mar 2023 19:50 UTC

57 points

10 comments2 min readLW link

Addendum: basic facts about language models during training

beren6 Mar 2023 19:24 UTC

22 points

2 comments5 min readLW link

Understanding The Roots Of Mathematics Before Finding The Roots Of A Function.

LiesLaris6 Mar 2023 18:47 UTC

2 points

0 comments1 min readLW link

Discussion: LLaMA Leak & Whistleblowing in pre-AGI era

jirahim6 Mar 2023 18:47 UTC

1 point

4 comments1 min readLW link

[Question] Are we too confident about unaligned AGI killing off humanity?

RomanS6 Mar 2023 16:19 UTC

21 points

63 comments1 min readLW link

Introducing Leap Labs, an AI interpretability startup

Jessica Rumbelow6 Mar 2023 16:16 UTC

103 points

12 comments1 min readLW link

Monthly Roundup #4: March 2023

Zvi6 Mar 2023 14:10 UTC

31 points

0 comments24 min readLW link

(thezvi.wordpress.com)

Fundamental Uncertainty: Chapter 6 - How can we be certain about the truth?

Gordon Seidoh Worley6 Mar 2023 13:52 UTC

10 points

18 comments16 min readLW link

The idea

JNS6 Mar 2023 13:42 UTC

3 points

0 comments9 min readLW link

Honesty, Openness, Trustworthiness, and Secrets

NormanPerlmutter6 Mar 2023 9:03 UTC

13 points

0 comments9 min readLW link

EA & LW Forum Weekly Summary (27th Feb − 5th Mar 2023)

Zoe Williams6 Mar 2023 3:18 UTC

12 points

0 comments1 min readLW link

The Type II Inner-Compass Theorem

Tristan Miano6 Mar 2023 2:35 UTC

−16 points

0 comments22 min readLW link

AGI’s Impact on Employment

TheUnkown 6 Mar 2023 1:56 UTC

1 point

1 comment1 min readLW link

(www.apricitas.io)

Why did you trash the old HPMOR.com?

AnnoyedReader6 Mar 2023 1:55 UTC

55 points

68 comments2 min readLW link

Cap Model Size for AI Safety

research_prime_space6 Mar 2023 1:11 UTC

0 points

4 comments1 min readLW link

What should we do about network-effect monopolies?

benkuhn6 Mar 2023 0:50 UTC

31 points

7 comments1 min readLW link

(www.benkuhn.net)

Who Aligns the Alignment Researchers?

Ben Smith5 Mar 2023 23:22 UTC

48 points

0 comments11 min readLW link

Startups are like firewood

Adam Zerner5 Mar 2023 23:09 UTC

26 points

2 comments3 min readLW link

A concerning observation from media coverage of AI industry dynamics

Justin Olive5 Mar 2023 21:38 UTC

8 points

3 comments3 min readLW link

Steven Pinker on ChatGPT and AGI (Feb 2023)

Evan R. Murphy5 Mar 2023 21:34 UTC

11 points

8 comments1 min readLW link

(news.harvard.edu)

Is it time to talk about AI doomsday prepping yet?

bokov5 Mar 2023 21:17 UTC

0 points

8 comments1 min readLW link

Coordination explosion before intelligence explosion...?

tailcalled5 Mar 2023 20:48 UTC

47 points

9 comments2 min readLW link

The Ogdoad

Tristan Miano5 Mar 2023 20:01 UTC

−15 points

1 comment37 min readLW link

[Question] What are some good ways to heighten my emotions?

oh543215 Mar 2023 18:06 UTC

5 points

5 comments1 min readLW link

Research proposal: Leveraging Jungian archetypes to create values-based models

MiguelDev5 Mar 2023 17:39 UTC

5 points

2 comments2 min readLW link

Abusing Snap Circuits IC

jefftk5 Mar 2023 17:00 UTC

19 points

3 comments3 min readLW link

(www.jefftk.com)

Do humans derive values from fictitious imputed coherence?

TsviBT5 Mar 2023 15:23 UTC

45 points

8 comments14 min readLW link

The Inner-Compass Theorem

Tristan Miano5 Mar 2023 15:21 UTC

−18 points

12 comments16 min readLW link

Halifax Monthly Meetup: AI Safety Discussion

Ideopunk5 Mar 2023 12:42 UTC

10 points

0 comments1 min readLW link

Why kill everyone?

arisAlexis5 Mar 2023 11:53 UTC

7 points

5 comments2 min readLW link

Selective, Corrective, Structural: Three Ways of Making Social Systems Work

Said Achmiz5 Mar 2023 8:45 UTC

99 points

13 comments2 min readLW link

Substitute goods for leisure are abundant

Adam Zerner5 Mar 2023 3:45 UTC

20 points

7 comments5 min readLW link

[Question] Does polyamory at a workplace turn nepotism up to eleven?

Viliam5 Mar 2023 0:57 UTC

45 points

11 comments2 min readLW link

Why We MUST Build an (aligned) Artificial Superintelligence That Takes Over Human Society—A Thought Experiment

twkaiser5 Mar 2023 0:47 UTC

−13 points

12 comments2 min readLW link

Forecasts on Moore v Harper from Samotsvety

gregjustice5 Mar 2023 0:47 UTC

7 points

0 comments1 min readLW link

(samotsvety.org)

Why Not Just… Build Weak AI Tools For AI Alignment Research?

johnswentworth5 Mar 2023 0:12 UTC

175 points

18 comments6 min readLW link

Consciousness is irrelevant—instead solve alignment by asking this question

Oliver Siegel4 Mar 2023 22:06 UTC

−10 points

6 comments1 min readLW link

More money with less risk: sell services instead of model access

lemonhope4 Mar 2023 20:51 UTC

9 points

3 comments1 min readLW link

Contra “Strong Coherence”

DragonGod4 Mar 2023 20:05 UTC

39 points

24 comments1 min readLW link

The Practitioner’s Path 2.0: A new framework for structured self-improvement

Evenflair4 Mar 2023 19:19 UTC

32 points

2 comments11 min readLW link

(guildoftherose.org)

The Benefits of Distillation in Research

Jonas Hallgren4 Mar 2023 17:45 UTC

15 points

2 comments5 min readLW link

Optimal Music Choice

mbazzani4 Mar 2023 17:26 UTC

5 points

0 comments1 min readLW link

Why don’t more people talk about ecological psychology?

Ppau4 Mar 2023 17:03 UTC

21 points

10 comments7 min readLW link

Switching to Electric Mandolin

jefftk4 Mar 2023 15:40 UTC

16 points

0 comments1 min readLW link

(www.jefftk.com)

Predictive Performance on Metaculus vs. Manifold Markets

nikos4 Mar 2023 8:10 UTC

18 points

0 comments5 min readLW link

Contra Hanson on AI Risk

Liron4 Mar 2023 8:02 UTC

36 points

23 comments8 min readLW link

Bite Sized Tasks

Johannes C. Mayer4 Mar 2023 3:31 UTC

18 points

2 comments2 min readLW link

How popular is ChatGPT? Part 2: slower growth than Pokémon GO

Richard Korzekwa 3 Mar 2023 23:40 UTC

42 points

4 comments6 min readLW link

(aiimpacts.org)

Acausal normalcy

Andrew_Critch3 Mar 2023 23:34 UTC

194 points

36 comments8 min readLW link 1 review