All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

A ranking scale for how severe the side effects of solutions to AI x-risk are

Christopher King8 Mar 2023 22:53 UTC

3 points

0 comments2 min readLW link

Progress links and tweets, 2023-03-08

jasoncrawford8 Mar 2023 20:37 UTC

16 points

0 comments1 min readLW link

(rootsofprogress.org)

Project “MIRI as a Service”

RomanS8 Mar 2023 19:22 UTC

42 points

4 comments1 min readLW link

2022 Survey Results

Screwtape8 Mar 2023 19:16 UTC

48 points

8 comments20 min readLW link

Use the Nato Alphabet

Cedar8 Mar 2023 19:14 UTC

6 points

10 comments1 min readLW link

LessWrong needs a sage mechanic

lc8 Mar 2023 18:57 UTC

34 points

5 comments1 min readLW link

[Question] Mathematical models of Ethics

Victors8 Mar 2023 17:40 UTC

4 points

2 comments1 min readLW link

Against LLM Reductionism

Erich_Grunewald8 Mar 2023 15:52 UTC

140 points

17 comments18 min readLW link

(www.erichgrunewald.com)

Agency, LLMs and AI Safety—A First Pass

Giulio8 Mar 2023 15:42 UTC

2 points

0 comments4 min readLW link

(www.giuliostarace.com)

Why Uncontrollable AI Looks More Likely Than Ever

otto.barten and Roman_Yampolskiy

8 Mar 2023 15:41 UTC

18 points

0 comments4 min readLW link

(time.com)

Universal Modelers

George3d68 Mar 2023 15:39 UTC

6 points

4 comments20 min readLW link

(epistem.ink)

The Kids are Not Okay

Zvi8 Mar 2023 13:30 UTC

85 points

43 comments32 min readLW link

(thezvi.wordpress.com)

Alignment Targets and The Natural Abstraction Hypothesis

Stephen Fowler8 Mar 2023 11:45 UTC

10 points

0 comments3 min readLW link

Computer Input Sucks—A Brain Dump

Johannes C. Mayer8 Mar 2023 11:06 UTC

14 points

11 comments3 min readLW link

Under-Appreciated Ways to Use Flashcards—Part II

Florence Hinder8 Mar 2023 9:54 UTC

25 points

6 comments4 min readLW link

(blog.thoughtsaver.com)

Squeezing foundations research assistance out of formal logic narrow AI.

Donald Hobson8 Mar 2023 9:38 UTC

16 points

1 comment2 min readLW link

Monthly Shorts 1&2/23

Celer8 Mar 2023 7:10 UTC

9 points

0 comments2 min readLW link

(keller.substack.com)

Chapter 1: Pursuing Understanding

Xavier Shrier8 Mar 2023 6:40 UTC

2 points

0 comments10 min readLW link

[Question] Is religion locally correct for consequentialists in some instances?

Robert Feinstein8 Mar 2023 4:02 UTC

4 points

8 comments1 min readLW link

A Polemic

Wofsen8 Mar 2023 3:51 UTC

−15 points

1 comment1 min readLW link

AI Safety in a World of Vulnerable Machine Learning Systems

AdamGleave and EuanMcLean

8 Mar 2023 2:40 UTC

70 points

28 comments29 min readLW link

(far.ai)

[Question] Educating people about rationality: where are we?

plurple8 Mar 2023 1:59 UTC

5 points

3 comments1 min readLW link

[Question] What are MIRI’s big achievements in AI alignment?

tailcalled7 Mar 2023 21:30 UTC

29 points

7 comments1 min readLW link

A Brief Defense of Athleticism

Wofsen7 Mar 2023 20:48 UTC

46 points

5 comments1 min readLW link

[Question] How “grifty” is the Foresight Institute? Are they making button soup?

Cedar7 Mar 2023 19:43 UTC

7 points

3 comments1 min readLW link

[Question] What‘s in your list of unsolved problems in AI alignment?

jacquesthibs7 Mar 2023 18:58 UTC

60 points

9 comments1 min readLW link

Introducing AI Alignment Inc., a California public benefit corporation...

TherapistAI7 Mar 2023 18:47 UTC

1 point

4 comments1 min readLW link

Abuse in LessWrong and rationalist communities in Bloomberg News

whistleblower677 Mar 2023 18:45 UTC

1 point

72 comments7 min readLW link

(www.bloomberg.com)

Test post for formatting

Solenoid_Entity7 Mar 2023 17:48 UTC

0 points

2 comments1 min readLW link

The Pinnacle

nem7 Mar 2023 17:07 UTC

11 points

0 comments8 min readLW link

Podcast Transcript: Daniela and Dario Amodei on Anthropic

remember7 Mar 2023 16:47 UTC

46 points

2 comments79 min readLW link

(futureoflife.org)

The View from 30,000 Feet: Preface to the Second EleutherAI Retrospective

StellaAthena, Curtis Huebner and Shivanshu Purohit

7 Mar 2023 16:22 UTC

14 points

0 comments4 min readLW link

(blog.eleuther.ai)

Breaking Rank (Calibration Game)

jenn7 Mar 2023 15:40 UTC

11 points

0 comments2 min readLW link

Outrangeous (Calibration Game)

jenn7 Mar 2023 15:29 UTC

36 points

3 comments9 min readLW link

[Linkpost] Some high-level thoughts on the DeepMind alignment team’s strategy

Vika and Rohin Shah

7 Mar 2023 11:55 UTC

128 points

13 comments5 min readLW link

(drive.google.com)

Alignment works both ways

Karl von Wendt7 Mar 2023 10:41 UTC

23 points

21 comments2 min readLW link

Google’s PaLM-E: An Embodied Multimodal Language Model

SandXbox7 Mar 2023 4:11 UTC

87 points

7 comments1 min readLW link

(palm-e.github.io)

GÖDEL GOING DOWN

Jimdrix_Hendri6 Mar 2023 23:06 UTC

−9 points

3 comments1 min readLW link

Against ubiquitous alignment taxes

beren6 Mar 2023 19:50 UTC

56 points

10 comments2 min readLW link

Addendum: basic facts about language models during training

beren6 Mar 2023 19:24 UTC

22 points

2 comments5 min readLW link

Understanding The Roots Of Mathematics Before Finding The Roots Of A Function.

LiesLaris6 Mar 2023 18:47 UTC

2 points

0 comments1 min readLW link

Discussion: LLaMA Leak & Whistleblowing in pre-AGI era

jirahim6 Mar 2023 18:47 UTC

1 point

4 comments1 min readLW link

[Question] Are we too confident about unaligned AGI killing off humanity?

RomanS6 Mar 2023 16:19 UTC

21 points

63 comments1 min readLW link

Introducing Leap Labs, an AI interpretability startup

Jessica Rumbelow6 Mar 2023 16:16 UTC

103 points

12 comments1 min readLW link

Monthly Roundup #4: March 2023

Zvi6 Mar 2023 14:10 UTC

31 points

0 comments24 min readLW link

(thezvi.wordpress.com)

Fundamental Uncertainty: Chapter 6 - How can we be certain about the truth?

Gordon Seidoh Worley6 Mar 2023 13:52 UTC

10 points

18 comments16 min readLW link

The idea

JNS6 Mar 2023 13:42 UTC

3 points

0 comments9 min readLW link

Honesty, Openness, Trustworthiness, and Secrets

NormanPerlmutter6 Mar 2023 9:03 UTC

13 points

0 comments9 min readLW link

EA & LW Forum Weekly Summary (27th Feb − 5th Mar 2023)

Zoe Williams6 Mar 2023 3:18 UTC

12 points

0 comments1 min readLW link

The Type II Inner-Compass Theorem

Tristan Miano6 Mar 2023 2:35 UTC

−16 points

0 comments22 min readLW link