All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 234 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

[Question] Monotonous Work

Gideon Bauer2 Feb 2023 21:35 UTC

1 point

0 comments1 min readLW link

Is AI risk assessment too anthropocentric?

Craig Mattson2 Feb 2023 21:34 UTC

3 points

6 comments1 min readLW link

Halifax Monthly Meetup: Introduction to Effective Altruism

Ideopunk2 Feb 2023 21:10 UTC

10 points

0 comments1 min readLW link

Conditioning Predictive Models: Outer alignment via careful conditioning

evhub, Adam Jermyn, Johannes Treutlein, Rubi J. Hudson and kcwoolverton

2 Feb 2023 20:28 UTC

72 points

15 comments57 min readLW link

Conditioning Predictive Models: Large language models as predictors

evhub, Adam Jermyn, Johannes Treutlein, Rubi J. Hudson and kcwoolverton

2 Feb 2023 20:28 UTC

88 points

4 comments13 min readLW link

Normative vs Descriptive Models of Agency

mattmacdermott2 Feb 2023 20:28 UTC

26 points

5 comments4 min readLW link

Andrew Huberman on How to Optimize Sleep

Leon Lang2 Feb 2023 20:17 UTC

37 points

6 comments6 min readLW link

[Question] How can I help inflammation-based nerve damage be temporary?

Optimization Process2 Feb 2023 19:20 UTC

17 points

4 comments1 min readLW link

More findings on maximal data dimension

Marius Hobbhahn2 Feb 2023 18:33 UTC

27 points

1 comment11 min readLW link

Heritability, Behaviorism, and Within-Lifetime RL

Steven Byrnes2 Feb 2023 16:34 UTC

39 points

3 comments4 min readLW link

Covid 2/2/23: The Emergency Ends on 5/11

Zvi2 Feb 2023 14:00 UTC

22 points

6 comments7 min readLW link

(thezvi.wordpress.com)

You are probably not a good alignment researcher, and other blatant lies

junk heap homotopy2 Feb 2023 13:55 UTC

83 points

16 comments2 min readLW link

Don’t Judge a Tool by its Average Output

silentbob2 Feb 2023 13:42 UTC

11 points

2 comments4 min readLW link

Epoch Impact Report 2022

Jsevillamol2 Feb 2023 13:09 UTC

16 points

0 comments1 min readLW link

You Don’t Exist, Duncan

Duncan Sabien (Deactivated)2 Feb 2023 8:37 UTC

247 points

107 comments9 min readLW link

Temporally Layered Architecture for Adaptive, Distributed and Continuous Control

Roman Leventov2 Feb 2023 6:29 UTC

6 points

4 comments1 min readLW link

(arxiv.org)

Research agenda: Formalizing abstractions of computations

Erik Jenner2 Feb 2023 4:29 UTC

92 points

10 comments31 min readLW link

Progress links and tweets, 2023-02-01

jasoncrawford2 Feb 2023 2:25 UTC

10 points

0 comments1 min readLW link

(rootsofprogress.org)

Retrospective on the AI Safety Field Building Hub

Vael Gates2 Feb 2023 2:06 UTC

30 points

0 comments1 min readLW link

How to export Android Chrome tabs to an HTML file in Linux (as of February 2023)

Adam Scherlis2 Feb 2023 2:03 UTC

7 points

3 comments2 min readLW link

(adam.scherlis.com)

Hacked Account Spam

jefftk2 Feb 2023 1:50 UTC

13 points

5 comments1 min readLW link

(www.jefftk.com)

A simple technique to reduce negative rumination

cranberry_bear2 Feb 2023 1:33 UTC

9 points

0 comments1 min readLW link

A Brief Overview of AI Safety/Alignment Orgs, Fields, Researchers, and Resources for ML Researchers

Austin Witte2 Feb 2023 1:02 UTC

18 points

1 comment2 min readLW link

Interviews with 97 AI Researchers: Quantitative Analysis

Maheen Shermohammed and Vael Gates

2 Feb 2023 1:01 UTC

23 points

0 comments7 min readLW link

“AI Risk Discussions” website: Exploring interviews from 97 AI Researchers

Vael Gates, Lukas Trötzmüller, Maheen Shermohammed, michaelkeenan and zchuang

2 Feb 2023 1:00 UTC

43 points

1 comment1 min readLW link

Predicting researcher interest in AI alignment

Vael Gates2 Feb 2023 0:58 UTC

25 points

0 comments1 min readLW link

Focus on the places where you feel shocked everyone’s dropping the ball

So8res2 Feb 2023 0:27 UTC

438 points

63 comments4 min readLW link 3 reviews

Exercise is Good, Actually

Gordon Seidoh Worley2 Feb 2023 0:09 UTC

91 points

27 comments3 min readLW link

Product safety is a poor model for AI governance

Richard Korzekwa 1 Feb 2023 22:40 UTC

36 points

0 comments5 min readLW link

(aiimpacts.org)

Hinton: “mortal” efficient analog hardware may be learned-in-place, uncopyable

the gears to ascension1 Feb 2023 22:19 UTC

12 points

3 comments1 min readLW link

Can we “cure” cancer?

jasoncrawford1 Feb 2023 22:03 UTC

41 points

31 comments2 min readLW link

(rootsofprogress.org)

Eli Lifland on Navigating the AI Alignment Landscape

ozziegooen1 Feb 2023 21:17 UTC

9 points

1 comment31 min readLW link

(quri.substack.com)

Schizophrenia as a deficiency in long-range cortex-to-cortex communication

Steven Byrnes1 Feb 2023 19:32 UTC

35 points

36 comments11 min readLW link

AI Safety Arguments: An Interactive Guide

Lukas Trötzmüller1 Feb 2023 19:26 UTC

20 points

0 comments3 min readLW link

More findings on Memorization and double descent

Marius Hobbhahn1 Feb 2023 18:26 UTC

53 points

2 comments19 min readLW link

Language Models can be Utility-Maximising Agents

Raymond D1 Feb 2023 18:13 UTC

22 points

1 comment2 min readLW link

Trends in the dollar training cost of machine learning systems

Ben Cottier1 Feb 2023 14:48 UTC

23 points

0 comments2 min readLW link

(epochai.org)

Polis: Why and How to Use it

brook1 Feb 2023 14:03 UTC

5 points

0 comments1 min readLW link

Subitisation of Self

vitaliya1 Feb 2023 9:18 UTC

14 points

4 comments2 min readLW link

Directed Babbling

Yudhister Kumar1 Feb 2023 9:10 UTC

20 points

1 comment3 min readLW link

(www.ykumar.org)

Voting Results for the 2021 Review

Raemon1 Feb 2023 8:02 UTC

66 points

10 comments38 min readLW link

Abstraction As Symmetry and Other Thoughts

Numendil1 Feb 2023 6:25 UTC

28 points

9 comments2 min readLW link

The effect of horizon length on scaling laws

Jacob_Hilton1 Feb 2023 3:59 UTC

23 points

2 comments1 min readLW link

(arxiv.org)

Contra Dance Lengths

jefftk1 Feb 2023 3:30 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

Aiming for Convergence Is Like Discouraging Betting

Zack_M_Davis1 Feb 2023 0:03 UTC

61 points

18 comments11 min readLW link 1 review

On value in humans, other animals, and AI

Michele Campolo31 Jan 2023 23:33 UTC

3 points

17 comments5 min readLW link

Criticism of the main framework in AI alignment

Michele Campolo31 Jan 2023 23:01 UTC

19 points

2 comments6 min readLW link

Nice Clothes are Good, Actually

Gordon Seidoh Worley31 Jan 2023 19:22 UTC

71 points

28 comments4 min readLW link

[Linkpost] Human-narrated audio version of “Is Power-Seeking AI an Existential Risk?”

Joe Carlsmith31 Jan 2023 19:21 UTC

12 points

1 comment1 min readLW link

No Really, Attention is ALL You Need—Attention can do feedforward networks

Robert_AIZI31 Jan 2023 18:48 UTC

29 points

7 comments6 min readLW link

(aizi.substack.com)