All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 131415 16 17 18 19 20 21 22 23 24 25 26 27 28

[Question] Is InstructGPT Following Instructions in Other Languages Surprising?

DragonGod13 Feb 2023 23:26 UTC

39 points

15 comments1 min readLW link

LLM Basics: Embedding Spaces—Transformer Token Vectors Are Not Points in Space

NickyP13 Feb 2023 18:52 UTC

79 points

11 comments15 min readLW link

4 ways to think about democratizing AI [GovAI Linkpost]

Akash13 Feb 2023 18:06 UTC

24 points

4 comments1 min readLW link

(www.governance.ai)

Does the AGPL Work?

jefftk13 Feb 2023 14:20 UTC

13 points

12 comments2 min readLW link

(www.jefftk.com)

H5N1

Zvi13 Feb 2023 12:50 UTC

101 points

1 comment9 min readLW link

(thezvi.wordpress.com)

Enjoy LessWrong in ebook format

Bart Bussmann13 Feb 2023 11:53 UTC

53 points

2 comments1 min readLW link

Morphological intelligence, superhuman empathy, and ethical arbitration

Roman Leventov13 Feb 2023 10:25 UTC

1 point

0 comments2 min readLW link

South Bay ACX/LW Meetup

IS13 Feb 2023 6:08 UTC

3 points

0 comments1 min readLW link

Idea: Network modularity and interpretability by sexual reproduction

qbolec12 Feb 2023 23:06 UTC

3 points

3 comments1 min readLW link

The End of Anonymity Online

Spiorad12 Feb 2023 21:23 UTC

3 points

9 comments2 min readLW link

Matt Clancy AMA on the Progress Forum

jasoncrawford12 Feb 2023 20:23 UTC

17 points

0 comments1 min readLW link

(progressforum.org)

Latent variables for prediction markets: motivation, technical guide, and design considerations

tailcalled12 Feb 2023 17:54 UTC

100 points

18 comments23 min readLW link 1 review

The conceptual Doppelgänger problem

TsviBT12 Feb 2023 17:23 UTC

12 points

5 comments4 min readLW link

How Cardioid Are Cardioids?

jefftk12 Feb 2023 16:20 UTC

9 points

0 comments2 min readLW link

(www.jefftk.com)

How many of these jobs will have a 15% or more drop in employment plausibly attributable to AI by 2031?

tailcalled12 Feb 2023 15:40 UTC

12 points

5 comments1 min readLW link

(manifold.markets)

Human-AI collaborative writing

DirectedEvolution12 Feb 2023 14:57 UTC

20 points

2 comments5 min readLW link

RaD-AI workshop

Ram Rachum12 Feb 2023 12:46 UTC

3 points

0 comments1 min readLW link

Elements of Rationalist Discourse

Rob Bensinger12 Feb 2023 7:58 UTC

223 points

49 comments3 min readLW link 1 review

Conflict Theory of Bounded Distrust

Zack_M_Davis12 Feb 2023 5:30 UTC

108 points

30 comments3 min readLW link 1 review

Why almost every RL agent does learned optimization

Lee Sharkey12 Feb 2023 4:58 UTC

32 points

3 comments5 min readLW link

How I Learn From Textbooks

DirectedEvolution12 Feb 2023 4:45 UTC

24 points

3 comments8 min readLW link

Top YouTube channel Veritasium releases video on Sleeping Beauty Problem

Alex_Altair11 Feb 2023 20:36 UTC

25 points

22 comments1 min readLW link

(www.youtube.com)

Shortening Timelines: There’s No Buffer Anymore

Jeff Rose11 Feb 2023 19:53 UTC

10 points

5 comments1 min readLW link

We Found An Neuron in GPT-2

Joseph Miller and Clement Neo

11 Feb 2023 18:27 UTC

143 points

23 comments7 min readLW link

(clementneo.com)

The Practitioner’s Path 2.0: the Pragmatist Archetype

Evenflair11 Feb 2023 15:48 UTC

21 points

0 comments2 min readLW link

(guildoftherose.org)

The Illusion of Simplicity: Monetary Policy as a Problem of Complexity and Alignment

Edward P. Könings11 Feb 2023 15:04 UTC

8 points

0 comments8 min readLW link

(edwardknings.substack.com)

In Defense of Chatbot Romance

Kaj_Sotala11 Feb 2023 14:30 UTC

123 points

52 comments11 min readLW link

(kajsotala.fi)

Threatening to do the impossible: A solution to spurious counterfactuals for functional decision theory via proof theory

Christopher King11 Feb 2023 7:57 UTC

5 points

4 comments5 min readLW link

Rationality-related things I don’t know as of 2023

Adam Zerner11 Feb 2023 6:04 UTC

64 points

59 comments3 min readLW link

A note on ‘semiotic physics’

metasemi11 Feb 2023 5:12 UTC

11 points

13 comments6 min readLW link

Inequality Penalty: Morality in Many Worlds

Shmi11 Feb 2023 4:08 UTC

11 points

17 comments6 min readLW link

The Importance of AI Alignment, explained in 5 points

Daniel_Eth11 Feb 2023 2:56 UTC

33 points

2 comments1 min readLW link

Acting Normal is Good, Actually

Gordon Seidoh Worley10 Feb 2023 23:35 UTC

14 points

5 comments3 min readLW link

[S] D&D.Sci: All the D8a. Allllllll of it.

aphyer10 Feb 2023 21:14 UTC

43 points

17 comments6 min readLW link

A Different Kind of Ark: My failed attempt to build a bridge between universes

ChrisM10 Feb 2023 20:49 UTC

2 points

2 comments6 min readLW link

(www.vesselproject.io)

Prizes for the 2021 Review

Raemon10 Feb 2023 19:47 UTC

69 points

2 comments4 min readLW link

A proposed method for forecasting transformative AI

Matthew Barnett10 Feb 2023 19:34 UTC

121 points

21 comments10 min readLW link

The best way so far to explain AI risk: The Precipice (p. 137-149)

trevor10 Feb 2023 19:33 UTC

50 points

2 comments17 min readLW link

Is this a weak pivotal act: creating nanobots that eat evil AGIs (but nothing else)?

Christopher King10 Feb 2023 19:26 UTC

0 points

3 comments1 min readLW link

Why I’m not working on {debate, RRM, ELK, natural abstractions}

Steven Byrnes10 Feb 2023 19:22 UTC

71 points

19 comments9 min readLW link

Conditioning Predictive Models: Open problems, Conclusion, and Appendix

evhub, Adam Jermyn, Johannes Treutlein, Rubi J. Hudson and kcwoolverton

10 Feb 2023 19:21 UTC

36 points

3 comments11 min readLW link

Jobs that can help with the most important century

HoldenKarnofsky10 Feb 2023 18:20 UTC

24 points

0 comments19 min readLW link

(www.cold-takes.com)

[Question] Is it a coincidence that GPT-3 requires roughly the same amount of compute as is necessary to emulate the human brain?

RomanS10 Feb 2023 16:26 UTC

11 points

10 comments1 min readLW link

Contra: Changing Role Terms

jefftk10 Feb 2023 15:00 UTC

8 points

0 comments3 min readLW link

(www.jefftk.com)

Cyborgism

NicholasKees and janus

10 Feb 2023 14:47 UTC

337 points

46 comments35 min readLW link

FLI Podcast: Connor Leahy on AI Progress, Chimps, Memes, and Markets (Part 1/3)

remember and Andrea_Miotti

10 Feb 2023 13:55 UTC

39 points

0 comments43 min readLW link

Many important technologies start out as science fiction before becoming real

trevor10 Feb 2023 9:36 UTC

28 points

2 comments2 min readLW link

[Question] What’s actually going on in the “mind” of the model when we fine-tune GPT-3 to InstructGPT?

rpglover6410 Feb 2023 7:57 UTC

18 points

3 comments1 min readLW link

Mechanism Design for AI Safety—Agenda Creation Retreat

Rubi J. Hudson10 Feb 2023 3:05 UTC

24 points

2 comments1 min readLW link

[Question] On utility functions

jodaru10 Feb 2023 1:22 UTC

11 points

10 comments1 min readLW link