All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 789 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Question] What are MIRI’s big achievements in AI alignment?

tailcalled7 Mar 2023 21:30 UTC

29 points

7 comments1 min readLW link

A Brief Defense of Athleticism

Wofsen7 Mar 2023 20:48 UTC

46 points

5 comments1 min readLW link

[Question] How “grifty” is the Foresight Institute? Are they making button soup?

Cedar7 Mar 2023 19:43 UTC

7 points

3 comments1 min readLW link

[Question] What‘s in your list of unsolved problems in AI alignment?

jacquesthibs7 Mar 2023 18:58 UTC

60 points

9 comments1 min readLW link

Introducing AI Alignment Inc., a California public benefit corporation...

TherapistAI7 Mar 2023 18:47 UTC

1 point

4 comments1 min readLW link

Abuse in LessWrong and rationalist communities in Bloomberg News

whistleblower677 Mar 2023 18:45 UTC

1 point

72 comments7 min readLW link

(www.bloomberg.com)

Test post for formatting

Solenoid_Entity7 Mar 2023 17:48 UTC

0 points

2 comments1 min readLW link

The Pinnacle

nem7 Mar 2023 17:07 UTC

11 points

0 comments8 min readLW link

Podcast Transcript: Daniela and Dario Amodei on Anthropic

remember7 Mar 2023 16:47 UTC

46 points

2 comments79 min readLW link

(futureoflife.org)

The View from 30,000 Feet: Preface to the Second EleutherAI Retrospective

StellaAthena, Curtis Huebner and Shivanshu Purohit

7 Mar 2023 16:22 UTC

14 points

0 comments4 min readLW link

(blog.eleuther.ai)

Breaking Rank (Calibration Game)

jenn7 Mar 2023 15:40 UTC

11 points

0 comments2 min readLW link

Outrangeous (Calibration Game)

jenn7 Mar 2023 15:29 UTC

36 points

3 comments9 min readLW link

[Linkpost] Some high-level thoughts on the DeepMind alignment team’s strategy

Vika and Rohin Shah

7 Mar 2023 11:55 UTC

128 points

13 comments5 min readLW link

(drive.google.com)

Alignment works both ways

Karl von Wendt7 Mar 2023 10:41 UTC

23 points

21 comments2 min readLW link

Google’s PaLM-E: An Embodied Multimodal Language Model

SandXbox7 Mar 2023 4:11 UTC

87 points

7 comments1 min readLW link

(palm-e.github.io)

GÖDEL GOING DOWN

Jimdrix_Hendri6 Mar 2023 23:06 UTC

−9 points

3 comments1 min readLW link

Against ubiquitous alignment taxes

beren6 Mar 2023 19:50 UTC

56 points

10 comments2 min readLW link

Addendum: basic facts about language models during training

beren6 Mar 2023 19:24 UTC

22 points

2 comments5 min readLW link

Understanding The Roots Of Mathematics Before Finding The Roots Of A Function.

LiesLaris6 Mar 2023 18:47 UTC

2 points

0 comments1 min readLW link

Discussion: LLaMA Leak & Whistleblowing in pre-AGI era

jirahim6 Mar 2023 18:47 UTC

1 point

4 comments1 min readLW link

[Question] Are we too confident about unaligned AGI killing off humanity?

RomanS6 Mar 2023 16:19 UTC

21 points

63 comments1 min readLW link

Introducing Leap Labs, an AI interpretability startup

Jessica Rumbelow6 Mar 2023 16:16 UTC

103 points

12 comments1 min readLW link

Monthly Roundup #4: March 2023

Zvi6 Mar 2023 14:10 UTC

31 points

0 comments24 min readLW link

(thezvi.wordpress.com)

Fundamental Uncertainty: Chapter 6 - How can we be certain about the truth?

Gordon Seidoh Worley6 Mar 2023 13:52 UTC

10 points

18 comments16 min readLW link

The idea

JNS6 Mar 2023 13:42 UTC

3 points

0 comments9 min readLW link

Honesty, Openness, Trustworthiness, and Secrets

NormanPerlmutter6 Mar 2023 9:03 UTC

13 points

0 comments9 min readLW link

EA & LW Forum Weekly Summary (27th Feb − 5th Mar 2023)

Zoe Williams6 Mar 2023 3:18 UTC

12 points

0 comments1 min readLW link

The Type II Inner-Compass Theorem

Tristan Miano6 Mar 2023 2:35 UTC

−16 points

0 comments22 min readLW link

AGI’s Impact on Employment

TheUnkown 6 Mar 2023 1:56 UTC

1 point

1 comment1 min readLW link

(www.apricitas.io)

Why did you trash the old HPMOR.com?

AnnoyedReader6 Mar 2023 1:55 UTC

55 points

68 comments2 min readLW link

Cap Model Size for AI Safety

research_prime_space6 Mar 2023 1:11 UTC

0 points

4 comments1 min readLW link

What should we do about network-effect monopolies?

benkuhn6 Mar 2023 0:50 UTC

31 points

7 comments1 min readLW link

(www.benkuhn.net)

Who Aligns the Alignment Researchers?

Ben Smith5 Mar 2023 23:22 UTC

48 points

0 comments11 min readLW link

Startups are like firewood

Adam Zerner5 Mar 2023 23:09 UTC

26 points

2 comments3 min readLW link

A concerning observation from media coverage of AI industry dynamics

Justin Olive5 Mar 2023 21:38 UTC

8 points

3 comments3 min readLW link

Steven Pinker on ChatGPT and AGI (Feb 2023)

Evan R. Murphy5 Mar 2023 21:34 UTC

11 points

8 comments1 min readLW link

(news.harvard.edu)

Is it time to talk about AI doomsday prepping yet?

bokov5 Mar 2023 21:17 UTC

0 points

8 comments1 min readLW link

Coordination explosion before intelligence explosion...?

tailcalled5 Mar 2023 20:48 UTC

47 points

9 comments2 min readLW link

The Ogdoad

Tristan Miano5 Mar 2023 20:01 UTC

−15 points

1 comment37 min readLW link

[Question] What are some good ways to heighten my emotions?

oh543215 Mar 2023 18:06 UTC

5 points

5 comments1 min readLW link

Research proposal: Leveraging Jungian archetypes to create values-based models

MiguelDev5 Mar 2023 17:39 UTC

5 points

2 comments2 min readLW link

Abusing Snap Circuits IC

jefftk5 Mar 2023 17:00 UTC

19 points

3 comments3 min readLW link

(www.jefftk.com)

Do humans derive values from fictitious imputed coherence?

TsviBT5 Mar 2023 15:23 UTC

45 points

8 comments14 min readLW link

The Inner-Compass Theorem

Tristan Miano5 Mar 2023 15:21 UTC

−18 points

12 comments16 min readLW link

Halifax Monthly Meetup: AI Safety Discussion

Ideopunk5 Mar 2023 12:42 UTC

10 points

0 comments1 min readLW link

Why kill everyone?

arisAlexis5 Mar 2023 11:53 UTC

7 points

5 comments2 min readLW link

Selective, Corrective, Structural: Three Ways of Making Social Systems Work

Said Achmiz5 Mar 2023 8:45 UTC

99 points

13 comments2 min readLW link

Substitute goods for leisure are abundant

Adam Zerner5 Mar 2023 3:45 UTC

20 points

7 comments5 min readLW link

[Question] Does polyamory at a workplace turn nepotism up to eleven?

Viliam5 Mar 2023 0:57 UTC

45 points

11 comments2 min readLW link

Why We MUST Build an (aligned) Artificial Superintelligence That Takes Over Human Society—A Thought Experiment

twkaiser5 Mar 2023 0:47 UTC

−13 points

12 comments2 min readLW link