All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 234 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

VC Theory Overview

Joar Skalse2 Jul 2023 22:45 UTC

11 points

2 comments11 min readLW link

Sources of evidence in Alignment

Martín Soto2 Jul 2023 20:38 UTC

20 points

0 comments11 min readLW link

Quantitative cruxes in Alignment

Martín Soto2 Jul 2023 20:38 UTC

19 points

0 comments23 min readLW link

Going Crazy and Getting Better Again

Evenstar2 Jul 2023 18:55 UTC

139 points

13 comments7 min readLW link 1 review

Shall We Throw A Huge Party Before AGI Bids Us Adieu?

GeorgeMan2 Jul 2023 17:56 UTC

−1 points

6 comments1 min readLW link

Why it’s so hard to talk about Consciousness

Rafael Harth2 Jul 2023 15:56 UTC

131 points

159 comments9 min readLW link 1 review

How Smart Are Humans?

Joar Skalse2 Jul 2023 15:46 UTC

9 points

19 comments2 min readLW link

Through a panel, darkly: a case study in internet BS detection

jchan2 Jul 2023 13:40 UTC

22 points

7 comments3 min readLW link

LLMs, Batches, and Emergent Episodic Memory

Lao Mein2 Jul 2023 7:55 UTC

5 points

4 comments1 min readLW link

Negativity enhances positivity

Adam Zerner2 Jul 2023 2:47 UTC

12 points

7 comments2 min readLW link

faster latent diffusion

bhauth2 Jul 2023 1:30 UTC

10 points

8 comments2 min readLW link

(www.bhauth.com)

Using (Uninterpretable) LLMs to Generate Interpretable AI Code

Joar Skalse2 Jul 2023 1:01 UTC

13 points

12 comments3 min readLW link

Grant applications and grand narratives

Elizabeth2 Jul 2023 0:16 UTC

191 points

22 comments6 min readLW link

An Introduction, an Overview of my personal resources, and how one might make use of them

ProofBySonnet1 Jul 2023 21:00 UTC

4 points

6 comments3 min readLW link

My “2.9 trauma limit”

Raemon1 Jul 2023 19:32 UTC

193 points

31 comments7 min readLW link

Alpha

Erich_Grunewald1 Jul 2023 16:05 UTC

65 points

2 comments14 min readLW link

(www.erichgrunewald.com)

Forum Karma: view stats and find highly-rated comments for any LW user

Max H1 Jul 2023 15:36 UTC

60 points

16 comments2 min readLW link

(forumkarma.com)

[ASoT] GPT2 Steering & The Tuned Lens

Ulisse Mini1 Jul 2023 14:12 UTC

23 points

0 comments2 min readLW link

[Linkpost] A shared linguistic space for transmitting our thoughts from brain to brain in natural conversations

Bogdan Ionut Cirstea1 Jul 2023 13:57 UTC

17 points

2 comments1 min readLW link

Elements of Computational Philosophy, Vol. I: Truth

Paul Bricman and Tom Feeney

1 Jul 2023 11:44 UTC

12 points

6 comments1 min readLW link

(compphil.github.io)

Micro Habits that Improve One’s Day

silentbob1 Jul 2023 10:53 UTC

62 points

9 comments5 min readLW link

Ateliers: But what is an Atelier?

Stephen Fowler1 Jul 2023 5:57 UTC

4 points

2 comments10 min readLW link

Predicting: Quick Start

duck_master1 Jul 2023 3:43 UTC

9 points

3 comments14 min readLW link

EA/LW/SSC Argentina Group!

daviddelauba1 Jul 2023 2:47 UTC

1 point

0 comments1 min readLW link

Despedida a Pablo Stafforini

daviddelauba1 Jul 2023 2:44 UTC

1 point

0 comments1 min readLW link

Horizontal and Vertical Integration

Jeffrey Heninger1 Jul 2023 1:15 UTC

17 points

1 comment2 min readLW link

Inflection AI announces $1.3 billion of funding led by current investors, Microsoft, and NVIDIA

SandXbox30 Jun 2023 21:32 UTC

7 points

0 comments1 min readLW link

(inflection.ai)

Introduction

Robert Kralisch, Eris, teahorse and Sohaib Imran

30 Jun 2023 20:45 UTC

7 points

0 comments2 min readLW link

Inherently Interpretable Architectures

Robert Kralisch, teahorse, Eris and Sohaib Imran

30 Jun 2023 20:43 UTC

4 points

0 comments7 min readLW link

Positive Attractors

Robert Kralisch, teahorse, Eris and Sohaib Imran

30 Jun 2023 20:43 UTC

6 points

0 comments13 min readLW link

Agency from a causal perspective

tom4everitt, mattmacdermott, James Fox, Francis Rhys Ward and Jonathan Richens

30 Jun 2023 17:37 UTC

39 points

5 comments6 min readLW link

Little attention seems to be on discouraging hardware progress

RussellThor30 Jun 2023 10:14 UTC

5 points

3 comments1 min readLW link

Introducing EffiSciences’ AI Safety Unit

WCargo, Charbel-Raphaël and Florent_Berthet

30 Jun 2023 7:44 UTC

68 points

0 comments12 min readLW link

Contra Anton 🏴‍☠️ on Kolmogorov complexity and recursive self improvement

DaemonicSigil30 Jun 2023 5:15 UTC

25 points

12 comments2 min readLW link

Foom Liability

PeterMcCluskey30 Jun 2023 3:55 UTC

21 points

10 comments6 min readLW link

(bayesianinvestor.com)

I Think Eliezer Should Go on Glenn Beck

Lao Mein30 Jun 2023 3:12 UTC

29 points

21 comments1 min readLW link

Bengio’s FAQ on Catastrophic AI Risks

Vaniver29 Jun 2023 23:04 UTC

39 points

0 comments1 min readLW link

(yoshuabengio.org)

AGI & War

Calecute29 Jun 2023 22:20 UTC

9 points

1 comment1 min readLW link

Biosafety Regulations (BMBL) and their relevance for AI

Štěpán Los29 Jun 2023 19:22 UTC

4 points

0 comments4 min readLW link

Nature Releases A Stupid Editorial On AI Risk

omnizoid29 Jun 2023 19:00 UTC

2 points

1 comment3 min readLW link

AI Safety without Alignment: How humans can WIN against AI

vicchain29 Jun 2023 17:53 UTC

1 point

1 comment2 min readLW link

Challenge proposal: smallest possible self-hardening backdoor for RLHF

Christopher King29 Jun 2023 16:56 UTC

7 points

0 comments2 min readLW link

AI #18: The Great Debate Debate

Zvi29 Jun 2023 16:20 UTC

47 points

9 comments52 min readLW link

(thezvi.wordpress.com)

Bruce Sterling on the AI mania of 2023

Mitchell_Porter29 Jun 2023 5:00 UTC

25 points

1 comment1 min readLW link

(www.newsweek.com)

Cheat sheet of AI X-risk

momom229 Jun 2023 4:28 UTC

19 points

1 comment7 min readLW link

Anthropically Blind: the anthropic shadow is reflectively inconsistent

Christopher King29 Jun 2023 2:36 UTC

43 points

40 comments10 min readLW link

One path to coherence: conditionalization

porby29 Jun 2023 1:08 UTC

28 points

4 comments4 min readLW link

AXRP announcement: Survey, Store Closing, Patreon

DanielFilan28 Jun 2023 23:40 UTC

14 points

0 comments1 min readLW link

Metaphors for AI, and why I don’t like them

boazbarak28 Jun 2023 22:47 UTC

33 points

18 comments12 min readLW link

Transforming Democracy: A Unique Funding Opportunity for US Federal Approval Voting

Aaron Hamlin28 Jun 2023 22:07 UTC

25 points

6 comments2 min readLW link