All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 123 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

A Library and Tutorial for Factored Cognition with Language Models

stuhlmueller, justin_dan and goodgravy

Sep 28, 2022, 6:15 PM

47 points

0 comments1 min readLW link

Reward IS the Optimization Target

CarnSep 28, 2022, 5:59 PM

−2 points

3 comments5 min readLW link

AI Safety Endgame Stories

Ivan VendrovSep 28, 2022, 4:58 PM

31 points

11 comments11 min readLW link

Will Values and Competition Decouple?

intersticeSep 28, 2022, 4:27 PM

15 points

11 comments17 min readLW link

Georgism in Space

harsimonySep 28, 2022, 4:05 PM

42 points

12 comments4 min readLW link

(harsimony.wordpress.com)

QAPR 3: interpretability-guided training of neural nets

Quintin PopeSep 28, 2022, 4:02 PM

58 points

2 comments10 min readLW link

Strange Loops—Self-Reference from Number Theory to AI

ojorgensenSep 28, 2022, 2:10 PM

19 points

6 comments18 min readLW link

Why I think strong general AI is coming soon

porbySep 28, 2022, 5:40 AM

337 points

141 comments34 min readLW link 1 review

About Q Home

Q HomeSep 28, 2022, 4:56 AM

11 points

4 comments1 min readLW link

[Linkpost] “Intensity and frequency of extreme novel epidemics” by Mariani et al. (2021)

T431Sep 28, 2022, 3:31 AM

10 points

0 comments LW link

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

DiffractorSep 28, 2022, 1:20 AM

162 points

19 comments53 min readLW link 2 reviews

7 traps that (we think) new alignment researchers often fall into

Orpheus16 and Thomas Larsen

Sep 27, 2022, 11:13 PM

176 points

10 comments4 min readLW link

Failure modes in a shard theory alignment plan

Thomas KwaSep 27, 2022, 10:34 PM

26 points

2 comments7 min readLW link

[Question] Is a PhD necessary to contribute meaningfully to a field?

TrudosKudosSep 27, 2022, 9:27 PM

4 points

7 comments1 min readLW link

Why we’re not founding a human-data-for-alignment org

L Rudolf L and Matt Putz

Sep 27, 2022, 8:14 PM

88 points

6 comments29 min readLW link

(forum.effectivealtruism.org)

A Poorly Planned Loft Bed

jefftkSep 27, 2022, 5:50 PM

9 points

2 comments1 min readLW link

(www.jefftk.com)

Wise Crowd & Democratic Spirit

Hristo ZaykovSep 27, 2022, 5:45 PM

1 point

0 comments2 min readLW link

(www.hristo.blog)

Soft skills for meetups

mingyuanSep 27, 2022, 5:26 PM

49 points

3 comments5 min readLW link

[Question] Enriching Youtube content recommendations

Martín SotoSep 27, 2022, 4:54 PM

8 points

4 comments1 min readLW link

The Onion Test for Personal and Institutional Honesty

chanamessinger and Andrew_Critch

Sep 27, 2022, 3:26 PM

163 points

31 comments3 min readLW link 3 reviews

Book review: “The Heart of the Brain: The Hypothalamus and Its Hormones”

Steven ByrnesSep 27, 2022, 1:20 PM

65 points

3 comments18 min readLW link

My Thoughts on the ML Safety Course

zeshenSep 27, 2022, 1:15 PM

50 points

3 comments17 min readLW link

Summary of ML Safety Course

zeshenSep 27, 2022, 1:05 PM

7 points

0 comments6 min readLW link

Probabilistic reasoning for description and experience

Q HomeSep 27, 2022, 10:57 AM

0 points

0 comments26 min readLW link

A Prince, a Pauper, Power, Panama

Alok SinghSep 27, 2022, 7:10 AM

10 points

0 comments1 min readLW link

(alok.github.io)

Double Asteroid Redirection Test succeeds

sanxiynSep 27, 2022, 6:37 AM

19 points

5 comments1 min readLW link

(twitter.com)

[Question] How would I know if a PhD is the right career path?

Bob GuranSep 27, 2022, 5:49 AM

4 points

4 comments1 min readLW link

Review of Examine.com’s vitamin write-ups

Elizabeth and Martin Bernstorff

Sep 26, 2022, 11:40 PM

60 points

1 comment5 min readLW link

(acesounderglass.com)

D&D.Sci September 2022 Evaluation and Ruleset

abstractapplicSep 26, 2022, 10:19 PM

30 points

5 comments3 min readLW link

[MLSN #5]: Prize Compilation

Dan HSep 26, 2022, 9:55 PM

15 points

1 comment2 min readLW link

Loss of Alignment is not the High-Order Bit for AI Risk

yieldthoughtSep 26, 2022, 9:16 PM

14 points

18 comments2 min readLW link

Inverse Scaling Prize: Round 1 Winners

Ethan Perez and Ian McKenzie

Sep 26, 2022, 7:57 PM

93 points

16 comments4 min readLW link

(irmckenzie.co.uk)

[Question] Does the existence of shared human values imply alignment is “easy”?

MorpheusSep 26, 2022, 6:01 PM

7 points

15 comments1 min readLW link

Meetup: Madison, WI (Oct 8)

svfritzSep 26, 2022, 5:55 PM

1 point

0 comments1 min readLW link

Ambiguity in Prediction Market Resolution is Harmful

aphyerSep 26, 2022, 4:22 PM

69 points

17 comments5 min readLW link

Framery Phone Booth CO2 Accumulation

jefftk26 Sep 2022 16:10 UTC

25 points

0 comments1 min readLW link

(www.jefftk.com)

[Question] How can I remove the launch button from my LW home page?

sudo26 Sep 2022 15:15 UTC

8 points

4 comments1 min readLW link

Brief Notes on Transformers

Adam Jermyn26 Sep 2022 14:46 UTC

48 points

3 comments2 min readLW link

You are Underestimating The Likelihood That Convergent Instrumental Subgoals Lead to Aligned AGI

Mark Neyer26 Sep 2022 14:22 UTC

3 points

6 comments3 min readLW link

Climate-contingent Finance, and A Generalized Mechanism for X-Risk Reduction Financing

John Nay26 Sep 2022 13:23 UTC

0 points

2 comments LW link

Self-Control Secrets of the Puritan Masters

David Hugh-Jones26 Sep 2022 9:04 UTC

67 points

3 comments5 min readLW link

(wyclif.substack.com)

How I buy things when Lightcone wants them fast

Bird Concept26 Sep 2022 5:02 UTC

224 points

21 comments8 min readLW link

Oren’s Field Guide of Bad AGI Outcomes

Eris Discordia26 Sep 2022 4:06 UTC

0 points

0 comments1 min readLW link

On Generality

Eris Discordia26 Sep 2022 4:06 UTC

2 points

0 comments5 min readLW link

Planning a Loft Bed

jefftk26 Sep 2022 0:10 UTC

15 points

15 comments2 min readLW link

(www.jefftk.com)

Becoming Black Boxish

vitaliya25 Sep 2022 23:35 UTC

16 points

0 comments2 min readLW link

Announcing Balsa Research

Zvi25 Sep 2022 22:50 UTC

235 points

64 comments2 min readLW link 1 review

(thezvi.wordpress.com)

[Question] How to learn: Struggle VS Lookup-Table?

Nicholas / Heather Kross25 Sep 2022 21:58 UTC

16 points

2 comments2 min readLW link

An Unexpected GPT-3 Decision in a Simple Gamble

casualphysicsenjoyer25 Sep 2022 16:46 UTC

8 points

4 comments1 min readLW link

“Agency” needs nuance

Evie Cottrell25 Sep 2022 7:40 UTC

23 points

1 comment14 min readLW link