Language Models Model Us

eggsyntax17 May 2024 21:00 UTC

81 points

19 comments7 min readLW link

Instruction-following AGI is easier and more likely than value aligned AGI

Seth Herd15 May 2024 19:38 UTC

35 points

18 comments12 min readLW link

Ilya Sutskever and Jan Leike resign from OpenAI [updated]

Zach Stein-Perlman15 May 2024 0:45 UTC

230 points

84 comments2 min readLW link

DeepMind’s “Frontier Safety Framework” is weak and unambitious

Zach Stein-Perlman18 May 2024 3:00 UTC

118 points

10 comments4 min readLW link

Using GPT-3 for preventing conflict during messaging — a pitch for an app

Eli_17 Mar 2022 11:02 UTC

22 points

17 comments3 min readLW link

Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

Joar Skalse17 May 2024 19:13 UTC

47 points

2 comments2 min readLW link

“If we go extinct due to misaligned AI, at least nature will continue, right? … right?”

plex18 May 2024 14:09 UTC

40 points

9 comments2 min readLW link

(aisafety.info)

Scientific Notation Options

jefftk18 May 2024 15:10 UTC

19 points

7 comments1 min readLW link

(www.jefftk.com)

[Question] In the context of AI interp. What is a feature exactly?

f3mi14 May 2024 13:46 UTC

9 points

1 comment1 min readLW link

[Question] Is acausal extortion possible?

sisyphus11 Nov 2022 19:48 UTC

−20 points

36 comments3 min readLW link

Do you believe in hundred dollar bills lying on the ground? Consider humming

Elizabeth16 May 2024 0:00 UTC

128 points

11 comments6 min readLW link

(acesounderglass.com)

Einstein’s Arrogance

Eliezer Yudkowsky25 Sep 2007 1:29 UTC

155 points

90 comments3 min readLW link

Why you should learn a musical instrument

cata15 May 2024 20:36 UTC

48 points

23 comments3 min readLW link

How much AI inference can we do?

Benjamin_Todd14 May 2024 15:10 UTC

16 points

6 comments5 min readLW link

(benjamintodd.substack.com)

A Dozen Ways to Get More Dakka

Davidmanheim8 Apr 2024 4:45 UTC

97 points

9 comments3 min readLW link

What Are Non-Zero-Sum Games?—A Primer

James Stephen Brown18 May 2024 9:19 UTC

4 points

1 comment3 min readLW link

[Crosspost] Introducing the Save State Paradox

Suzie. EXE18 May 2024 17:00 UTC

1 point

0 comments7 min readLW link

Advice for Activists from the History of Environmentalism

Jeffrey Heninger16 May 2024 18:40 UTC

75 points

5 comments6 min readLW link

(blog.aiimpacts.org)

Davidad’s Bold Plan for Alignment: An In-Depth Explanation

Charbel-Raphaël and Gabin

19 Apr 2023 16:09 UTC

154 points

32 comments21 min readLW link

What Do We Mean By “Rationality”?

Eliezer Yudkowsky16 Mar 2009 22:33 UTC

333 points

18 comments6 min readLW link