All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 151617 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Thoughts on Hardware limits to Prevent AGI?

jrincayc15 Oct 2023 23:45 UTC

4 points

1 comment9 min readLW link

[Question] Training a RL Model with Continuous State & Action Space in a Real-World Scenario

Alexander Ries15 Oct 2023 22:59 UTC

0 points

0 comments1 min readLW link

On Frequentism and Bayesian Dogma

DanielFilan and Adrià Garriga-alonso

15 Oct 2023 22:23 UTC

59 points

27 comments6 min readLW link

More or Fewer Fights over Principles and Values?

Ben Pace and Vaniver

15 Oct 2023 21:35 UTC

24 points

10 comments14 min readLW link

Mapping ChatGPT’s ontological landscape, gradients and choices [interpretability]

Bill Benzon15 Oct 2023 20:12 UTC

1 point

0 comments18 min readLW link

The Hidden Perils of Hydrogen

Yudhister Kumar15 Oct 2023 19:51 UTC

17 points

3 comments3 min readLW link

(ykumar.org)

Arguments for optimism on AI Alignment (I don’t endorse this version, will reupload a new version soon.)

Noosphere8915 Oct 2023 14:51 UTC

28 points

130 comments25 min readLW link

Hyperreals in a Nutshell

Yudhister Kumar15 Oct 2023 14:23 UTC

35 points

27 comments5 min readLW link

(ykumar.org)

Discovering Latent Knowledge in the Human Brain: Part 1 – Clarifying the concepts of belief and knowledge

Joseph Emerson15 Oct 2023 9:02 UTC

5 points

0 comments12 min readLW link

[Question] Rationalist horror movies

Elizabeth15 Oct 2023 7:42 UTC

46 points

35 comments1 min readLW link

Unity Gridworlds

WillPetillo15 Oct 2023 4:36 UTC

9 points

0 comments1 min readLW link

In memory of Louise Glück

Joe Carlsmith15 Oct 2023 2:59 UTC

41 points

1 comment8 min readLW link

Book Review: Invisible China

Yudhister Kumar14 Oct 2023 21:51 UTC

4 points

0 comments4 min readLW link

(ykumar.org)

Book Review: Radical Markets

Yudhister Kumar14 Oct 2023 21:41 UTC

11 points

0 comments15 min readLW link

(ykumar.org)

[Question] One-on-one tutoring for any subject

yakimoff14 Oct 2023 20:58 UTC

8 points

5 comments1 min readLW link

The Puritans would one-box: evidential decision theory in the 17th century

Jacob G-W14 Oct 2023 20:23 UTC

86 points

5 comments3 min readLW link

(jacobgw.com)

Natural Abstraction: Convergent Preferences Over Information Structures

paulom14 Oct 2023 18:34 UTC

13 points

1 comment36 min readLW link

ChatGPT tells 20 versions of its prototypical story, with a short note on method

Bill Benzon14 Oct 2023 15:27 UTC

6 points

0 comments5 min readLW link

Will no one rid me of this turbulent pest?

Metacelsus14 Oct 2023 15:27 UTC

154 points

23 comments10 min readLW link

(denovo.substack.com)

Which Anaesthetic To Choose?

dadadarren14 Oct 2023 14:55 UTC

10 points

15 comments1 min readLW link

Is the Wave non-disparagement thingy okay?

Ruby, Linch and Auckland

14 Oct 2023 5:31 UTC

29 points

13 comments11 min readLW link

The Gods of Straight Lines

Richard_Ngo14 Oct 2023 4:10 UTC

64 points

13 comments5 min readLW link

(www.narrativeark.xyz)

Eight Magic Lamps

Richard_Ngo14 Oct 2023 4:10 UTC

39 points

0 comments6 min readLW link

(www.narrativeark.xyz)

RSPs are pauses done right

evhub14 Oct 2023 4:06 UTC

164 points

70 comments7 min readLW link

Dishonorable Gossip and Going Crazy

Ben Pace and Unreal

14 Oct 2023 4:00 UTC

29 points

31 comments23 min readLW link

Disentangling Our Terminal and Instrumental Values

PeterMcCluskey14 Oct 2023 3:35 UTC

11 points

1 comment4 min readLW link

(bayesianinvestor.com)

Global Pause AI Protest 10/21

Holly_Elmore, Joseph Miller and joepio

14 Oct 2023 3:20 UTC

5 points

0 comments1 min readLW link

[Question] Literature On Existential Risk From Atmospheric Contamination?

Yitz13 Oct 2023 22:27 UTC

6 points

3 comments1 min readLW link

How to partition teams to move fast? Debating “low-dimensional cuts”

jacobjacob and kave

13 Oct 2023 21:43 UTC

41 points

2 comments11 min readLW link

Gothenburg LW / ACX meetup

Stefan13 Oct 2023 21:39 UTC

2 points

0 comments1 min readLW link

Meta-Regulations

Sable13 Oct 2023 21:23 UTC

18 points

5 comments10 min readLW link

(affablyevil.substack.com)

Hiring: Lighthaven Events & Venue Lead

Raemon13 Oct 2023 21:02 UTC

68 points

2 comments4 min readLW link

Prediction markets covered in the NYT podcast “Hard Fork”

Austin Chen13 Oct 2023 18:43 UTC

56 points

6 comments1 min readLW link

(www.nytimes.com)

[Paper] All’s Fair In Love And Love: Copy Suppression in GPT-2 Small

CallumMcDougall, Arthur Conmy, starship006, Tom McGrath and Neel Nanda

13 Oct 2023 18:32 UTC

82 points

4 comments8 min readLW link

[Question] Intelligence Enhancement (Monthly Thread) 13 Oct 2023

Nicholas / Heather Kross13 Oct 2023 17:28 UTC

52 points

40 comments1 min readLW link

FLI podcast series, “Imagine A World”, about aspirational futures with AGI

Jackson Wagner13 Oct 2023 16:07 UTC

9 points

0 comments4 min readLW link

To open-source or to not open-source, that is (an oversimplification of) the question.

Justin Bullock13 Oct 2023 15:10 UTC

12 points

5 comments5 min readLW link

Combination Lock Boxes

jefftk13 Oct 2023 12:50 UTC

17 points

9 comments1 min readLW link

(www.jefftk.com)

Circle of Support (Oct 14th @ 10am PST)

Alexei13 Oct 2023 9:24 UTC

19 points

1 comment1 min readLW link

[Question] How can the world handle the HAMAS situation?

Annapurna13 Oct 2023 9:15 UTC

5 points

43 comments1 min readLW link

UVic AI Ethics Conference

TristanTrim and Leo Mckee-Reid

13 Oct 2023 7:31 UTC

3 points

1 comment1 min readLW link

LW UI features you might not have tried

Elizabeth13 Oct 2023 3:04 UTC

46 points

6 comments1 min readLW link

Revisiting Guide Dogs and Blindness Prevention

jefftk13 Oct 2023 2:30 UTC

22 points

0 comments2 min readLW link

(www.jefftk.com)

Paper: Understanding and Controlling a Maze-Solving Policy Network

TurnTrout, Ulisse Mini, peligrietzer, mrinank_sharma, Austin Meek, Monte M and lisathiergart

13 Oct 2023 1:38 UTC

70 points

0 comments1 min readLW link

(arxiv.org)

OPTIC: Announcing Intercollegiate Forecasting Tournaments in SF, DC, Boston

Saul Munn, Jingyi Wang and Tom Shlomi

13 Oct 2023 1:36 UTC

6 points

0 comments1 min readLW link

Progress links digest, 2023-10-12: Dyson sphere thermodynamics and a cure for cavities

jasoncrawford13 Oct 2023 0:41 UTC

14 points

1 comment10 min readLW link

(rootsofprogress.org)

What do Marginal Grants at EAIF Look Like? Funding Priorities and Grantmaking Thresholds at the EA Infrastructure Fund

Linch12 Oct 2023 21:40 UTC

20 points

0 comments1 min readLW link

unRLHF—Efficiently undoing LLM safeguards

Pranav Gade, Jeffrey Ladish and Simon Lermen

12 Oct 2023 19:58 UTC

117 points

15 comments20 min readLW link

LoRA Fine-tuning Efficiently Undoes Safety Training from Llama 2-Chat 70B

Simon Lermen and Jeffrey Ladish

12 Oct 2023 19:58 UTC

151 points

29 comments14 min readLW link

[Question] Looking for reading recommendations: Theories of right/justice that safeguard against having one’s job automated?

bulKlub12 Oct 2023 19:40 UTC

−1 points

1 comment1 min readLW link