All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 171819 20 21 22 23 24 25 26 27 28 29 30

AGIs may value intrinsic rewards more than extrinsic ones

catubcNov 17, 2022, 9:49 PM

8 points

6 comments4 min readLW link

LLMs may capture key components of human agency

catubcNov 17, 2022, 8:14 PM

27 points

0 comments4 min readLW link

Mastodon Replies as Comments

jefftkNov 17, 2022, 8:10 PM

20 points

0 comments1 min readLW link

(www.jefftk.com)

Announcing the Progress Forum

jasoncrawfordNov 17, 2022, 7:26 PM

83 points

9 comments1 min readLW link

[Question] What kind of bias is this?

Daniel SamuelNov 17, 2022, 6:44 PM

3 points

2 comments1 min readLW link

AI Forecasting Research Ideas

JsevillamolNov 17, 2022, 5:37 PM

21 points

2 comments LW link

Results from the interpretability hackathon

Esben Kran and Neel Nanda

Nov 17, 2022, 2:51 PM

81 points

0 comments6 min readLW link

(alignmentjam.com)

Covid 11/17/22: Slow Recovery

ZviNov 17, 2022, 2:50 PM

33 points

3 comments4 min readLW link

(thezvi.wordpress.com)

Sadly, FTX

ZviNov 17, 2022, 2:30 PM

133 points

18 comments47 min readLW link

(thezvi.wordpress.com)

Deontology and virtue ethics as “effective theories” of consequentialist ethics

Jan_KulveitNov 17, 2022, 2:11 PM

68 points

9 comments LW link 1 review

The Ground Truth Problem (Or, Why Evaluating Interpretability Methods Is Hard)

Jessica RumbelowNov 17, 2022, 11:06 AM

27 points

2 comments2 min readLW link

[Question] [Personal Question] Can anyone help me navigate this potentially painful interpersonal dynamic rationally?

SlainLadyMondegreenNov 17, 2022, 8:53 AM

9 points

3 comments4 min readLW link

Massive Scaling Should be Frowned Upon

harsimonyNov 17, 2022, 8:43 AM

4 points

6 comments5 min readLW link

[Question] Why are profitable companies laying off staff?

Yair HalberstadtNov 17, 2022, 6:19 AM

15 points

10 comments1 min readLW link

Discussion: Was SBF a naive utilitarian, or a sociopath?

Nicholas / Heather KrossNov 17, 2022, 2:52 AM

0 points

4 comments LW link

Kelsey Piper’s recent interview of SBF

agucovaNov 16, 2022, 8:30 PM

51 points

29 comments LW link

The Echo Principle

Jonathan MoregårdNov 16, 2022, 8:09 PM

4 points

0 comments3 min readLW link

(honestliving.substack.com)

[Question] Is there some reason LLMs haven’t seen broader use?

tailcalledNov 16, 2022, 8:04 PM

25 points

27 comments1 min readLW link

When should we be surprised that an invention took “so long”?

jasoncrawfordNov 16, 2022, 8:04 PM

32 points

11 comments4 min readLW link

(rootsofprogress.org)

Questions about Value Lock-in, Paternalism, and Empowerment

Sam F. BrownNov 16, 2022, 3:33 PM

13 points

2 comments12 min readLW link

(sambrown.eu)

If Professional Investors Missed This...

jefftkNov 16, 2022, 3:00 PM

37 points

18 comments3 min readLW link

(www.jefftk.com)

Disagreement with bio anchors that lead to shorter timelines

Marius HobbhahnNov 16, 2022, 2:40 PM

75 points

17 comments7 min readLW link 1 review

Current themes in mechanistic interpretability research

Lee Sharkey, Sid Black and beren

Nov 16, 2022, 2:14 PM

89 points

2 comments12 min readLW link

Unpacking “Shard Theory” as Hunch, Question, Theory, and Insight

Jacy Reese AnthisNov 16, 2022, 1:54 PM

31 points

9 comments2 min readLW link

Miracles and why not to believe them

mruwnikNov 16, 2022, 12:07 PM

4 points

0 comments2 min readLW link

[Question] How do people do remote research collaborations effectively?

KriegerNov 16, 2022, 11:51 AM

8 points

0 comments1 min readLW link

Method of statements: an alternative to taboo

Q HomeNov 16, 2022, 10:57 AM

7 points

0 comments41 min readLW link

The two conceptions of Active Inference: an intelligence architecture and a theory of agency

Roman LeventovNov 16, 2022, 9:30 AM

17 points

0 comments4 min readLW link

Developer experience for the motivation

Adam ZernerNov 16, 2022, 7:12 AM

49 points

7 comments4 min readLW link

Progress links and tweets, 2022-11-15

jasoncrawfordNov 16, 2022, 3:21 AM

9 points

0 comments2 min readLW link

(rootsofprogress.org)

EA & LW Forums Weekly Summary (7th Nov − 13th Nov 22′)

Zoe WilliamsNov 16, 2022, 3:04 AM

19 points

0 comments LW link

The FTX Saga—Simplified

AnnapurnaNov 16, 2022, 2:42 AM

44 points

10 comments7 min readLW link

(jorgevelez.substack.com)

Utilitarianism and the idea of a “rational agent” are fundamentally inconsistent with reality

banevNov 16, 2022, 12:19 AM

−4 points

1 comment1 min readLW link

[Question] Is the speed of training large models going to increase significantly in the near future due to Cerebras Andromeda?

Amal Nov 15, 2022, 10:50 PM

13 points

11 comments1 min readLW link

[Question] What is our current best infohazard policy for AGI (safety) research?

Roman LeventovNov 15, 2022, 10:33 PM

12 points

2 comments1 min readLW link

ACX/SSC Meetup 1 pm Sunday Nov 20

svfritzNov 15, 2022, 8:39 PM

2 points

0 comments1 min readLW link

SBF x LoL

Nicholas / Heather KrossNov 15, 2022, 8:24 PM

17 points

6 comments LW link

Some research ideas in forecasting

JsevillamolNov 15, 2022, 7:47 PM

35 points

2 comments LW link

Strategy of Inner Conflict

Jonathan MoregårdNov 15, 2022, 7:38 PM

9 points

4 comments6 min readLW link

(honestliving.substack.com)

The limited upside of interpretability

Peter S. ParkNov 15, 2022, 6:46 PM

13 points

11 comments LW link

Why bet Kelly?

AlexMennenNov 15, 2022, 6:12 PM

32 points

14 comments5 min readLW link

Entropy Scaling And Intrinsic Memory

Alexander Gietelink Oldenziel and Adam Shai

Nov 15, 2022, 6:11 PM

20 points

5 comments5 min readLW link

[Question] Will nanotech/biotech be what leads to AI doom?

tailcalledNov 15, 2022, 5:38 PM

4 points

9 comments2 min readLW link

Value Formation: An Overarching Model

Thane Ruthenis15 Nov 2022 17:16 UTC

34 points

20 comments34 min readLW link

Internal communication framework

rosehadshar and Nora_Ammann

15 Nov 2022 12:41 UTC

38 points

13 comments12 min readLW link

Better Mastodon Aliases

jefftk15 Nov 2022 12:10 UTC

14 points

3 comments1 min readLW link

(www.jefftk.com)

The economy as an analogy for advanced AI systems

rosehadshar and particlemania

15 Nov 2022 11:16 UTC

28 points

0 comments5 min readLW link

We need better prediction markets

eigen15 Nov 2022 4:54 UTC

9 points

8 comments1 min readLW link

Preventing, reversing, and addressing data leakage: some thoughts

VipulNaik15 Nov 2022 2:09 UTC

14 points

4 comments25 min readLW link

Winners of the AI Safety Nudge Competition

Marc Carauleanu15 Nov 2022 1:06 UTC

4 points

0 comments LW link