All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 192021 22 23 24 25 26 27 28 29 30 31

The Gallery for Painting Transformations—A GPT-3 Analogy

Robert_AIZI19 Jan 2023 23:32 UTC

1 point

0 comments6 min readLW link

(aizi.substack.com)

AGI safety field building projects I’d like to see

Severin T. Seehrich19 Jan 2023 22:40 UTC

68 points

28 comments9 min readLW link

Extensionality and the univalence axiom of type theory

Thomas Kehrenberg19 Jan 2023 22:36 UTC

6 points

2 comments16 min readLW link

The spiritual benefits of material progress

jasoncrawford19 Jan 2023 21:35 UTC

24 points

15 comments7 min readLW link

(rootsofprogress.org)

Announcing Cavendish Labs

derikk and agg

19 Jan 2023 20:15 UTC

59 points

5 comments2 min readLW link

(forum.effectivealtruism.org)

Thoughts on refusing harmful requests to large language models

William_S19 Jan 2023 19:49 UTC

32 points

4 comments2 min readLW link

MA RMV Overloaded

jefftk19 Jan 2023 16:40 UTC

16 points

0 comments2 min readLW link

(www.jefftk.com)

“Heretical Thoughts on AI” by Eli Dourado

DragonGod19 Jan 2023 16:11 UTC

146 points

38 comments3 min readLW link

(www.elidourado.com)

Covid 1/19/23: Flipped Numbers

Zvi19 Jan 2023 13:30 UTC

19 points

4 comments4 min readLW link

(thezvi.wordpress.com)

List of technical AI safety exercises and projects

JakubK19 Jan 2023 9:35 UTC

41 points

5 comments1 min readLW link

(docs.google.com)

Group-level Consequences of Psychological Problems

adamShimi and Gabriel Alfour

19 Jan 2023 9:27 UTC

28 points

3 comments2 min readLW link

6-paragraph AI risk intro for MAISI

JakubK19 Jan 2023 9:22 UTC

11 points

0 comments2 min readLW link

(www.maisi.club)

200 COP in MI: Studying Learned Features in Language Models

Neel Nanda19 Jan 2023 3:48 UTC

24 points

2 comments30 min readLW link

Amazon closing AmazonSmile to focus its philanthropic giving to programs with greater impact

Gordon Seidoh Worley19 Jan 2023 1:15 UTC

10 points

8 comments1 min readLW link

Gradient Filtering

Jozdien and janus

18 Jan 2023 20:09 UTC

55 points

16 comments13 min readLW link

[Cross-post] Is the Fermi Paradox due to the Flaw of Averages?

Aryeh Englander, Lonnie Chrisman and Yaakov T

18 Jan 2023 19:22 UTC

41 points

27 comments15 min readLW link

(lumina.com)

First Three Episodes of The Filan Cabinet

DanielFilan18 Jan 2023 19:20 UTC

17 points

1 comment1 min readLW link

[Question] Best Questions To Vet Potential Ai-Safety Applicants

jacksonjezion18 Jan 2023 19:01 UTC

6 points

1 comment1 min readLW link

[Question] Looking for a specific group of people

FriggenRedChickenMan18 Jan 2023 19:00 UTC

15 points

21 comments1 min readLW link

A problem with group epistemics

Mckay Jensen18 Jan 2023 17:06 UTC

4 points

4 comments3 min readLW link

(quevivasbien.github.io)

Why you should learn sign language

Noah Topper18 Jan 2023 17:03 UTC

53 points

23 comments7 min readLW link

(naivebayes.substack.com)

Flying With Covid

jefftk18 Jan 2023 17:00 UTC

44 points

29 comments3 min readLW link

(www.jefftk.com)

Prototype of Using GPT-3 to Generate Textbook-length Content

Rafael Cosman18 Jan 2023 14:25 UTC

2 points

8 comments40 min readLW link

(github.com)

How many people are working (directly) on reducing existential risk from AI?

Benjamin Hilton18 Jan 2023 8:46 UTC

20 points

1 comment1 min readLW link

EA & LW Forum Summaries (9th Jan to 15th Jan 23′)

Zoe Williams18 Jan 2023 7:29 UTC

17 points

0 comments1 min readLW link

OpenAI’s Alignment Plan is not S.M.A.R.T.

Søren Elverlin18 Jan 2023 6:39 UTC

9 points

19 comments4 min readLW link

[Question] Formal definition of Ontology Mismatch?

NathanBarnard18 Jan 2023 5:52 UTC

6 points

0 comments1 min readLW link

[Question] Transformer Mech Interp: Any visualizations?

Joyee Chen18 Jan 2023 4:32 UTC

3 points

0 comments1 min readLW link

Neural networks generalize because of this one weird trick

Jesse Hoogland18 Jan 2023 0:10 UTC

179 points

29 comments53 min readLW link 1 review

(www.jessehoogland.com)

Progress links and tweets, 2023-01-17

jasoncrawford17 Jan 2023 21:31 UTC

13 points

3 comments2 min readLW link

(rootsofprogress.org)

Quotes Worth Talking About

akaTrickster17 Jan 2023 21:26 UTC

−1 points

0 comments3 min readLW link

Building a transhumanist future: 15 years of hplusroadmap, now Discord

kanzure17 Jan 2023 21:17 UTC

42 points

1 comment1 min readLW link

(twitter.com)

Ad Fraud Detection Prediction Market

jefftk17 Jan 2023 18:10 UTC

17 points

0 comments2 min readLW link

(www.jefftk.com)

Collin Burns on Alignment Research And Discovering Latent Knowledge Without Supervision

Michaël Trazzi17 Jan 2023 17:21 UTC

25 points

5 comments4 min readLW link

(theinsideview.ai)

Lessons learned and review of the AI Safety Nudge Competition

Marc Carauleanu17 Jan 2023 17:13 UTC

3 points

0 comments1 min readLW link

Five Reasons to Lie

Dzoldzaya17 Jan 2023 16:53 UTC

0 points

19 comments3 min readLW link

On AI and Interest Rates

Zvi17 Jan 2023 15:00 UTC

79 points

13 comments8 min readLW link

(thezvi.wordpress.com)

Language models can generate superior text compared to their input

ChristianKl17 Jan 2023 10:57 UTC

48 points

28 comments1 min readLW link

Löbian emotional processing of emergent cooperation: an example

Andrew_Critch17 Jan 2023 5:59 UTC

23 points

0 comments8 min readLW link

Preparing for AI-assisted alignment research: we need data!

CBiddulph17 Jan 2023 3:28 UTC

31 points

3 comments1 min readLW link

Tesla Model 3 Review

jefftk17 Jan 2023 1:10 UTC

18 points

15 comments4 min readLW link

(www.jefftk.com)

[Question] Should AI writers be prohibited in education?

Eleni Angelou17 Jan 2023 0:42 UTC

6 points

2 comments1 min readLW link

What can thought-experiments do?

Cleo Nardo17 Jan 2023 0:35 UTC

16 points

3 comments5 min readLW link

Experiment Idea: RL Agents Evading Learned Shutdownability

Leon Lang16 Jan 2023 22:46 UTC

31 points

7 comments17 min readLW link

(docs.google.com)

Consequentialists: One-Way Pattern Traps

David Udell16 Jan 2023 20:48 UTC

59 points

3 comments14 min readLW link

Book Review: Worlds of Flow

remember16 Jan 2023 20:17 UTC

83 points

3 comments9 min readLW link

For the Record: DL ∩ ASI = ∅

maximkazhenkov16 Jan 2023 19:04 UTC

12 points

13 comments2 min readLW link

[Question] What determines female romantic “market value”?

anon_girl16 Jan 2023 18:45 UTC

16 points

50 comments1 min readLW link

Status conscious

avantika.mehra16 Jan 2023 17:44 UTC

2 points

0 comments5 min readLW link

Confusing the ideal for the necessary

adamShimi16 Jan 2023 17:29 UTC

79 points

6 comments1 min readLW link

(epistemologicalvigilance.substack.com)