All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 121314 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Question] Does a LLM have a utility function?

DagonDec 9, 2022, 5:19 PM

17 points

11 comments1 min readLW link

Monthly Roundup #1

ZviDec 9, 2022, 5:10 PM

31 points

6 comments21 min readLW link

(thezvi.wordpress.com)

Working towards AI alignment is better

Johannes C. MayerDec 9, 2022, 3:39 PM

8 points

2 comments2 min readLW link

You can still fetch the coffee today if you’re dead tomorrow

davidadDec 9, 2022, 2:06 PM

96 points

19 comments5 min readLW link

ChatGPT’s Misalignment Isn’t What You Think

stavrosDec 9, 2022, 11:11 AM

3 points

12 comments1 min readLW link

ML Safety at NeurIPS & Paradigmatic AI Safety? MLAISU W49

Esben Kran and Steinthal

Dec 9, 2022, 10:38 AM

19 points

0 comments4 min readLW link

(newsletter.apartresearch.com)

[Question] What are your thoughts on the future of AI-assisted software development?

RomanHaukssonDec 9, 2022, 10:04 AM

4 points

4 comments1 min readLW link

Fear mitigated the nuclear threat, can it do the same to AGI risks?

Igor IvanovDec 9, 2022, 10:04 AM

6 points

8 comments5 min readLW link

Setting the Zero Point

Duncan Sabien (Inactive)Dec 9, 2022, 6:06 AM

90 points

43 comments20 min readLW link 1 review

Systems of Survival

VaniverDec 9, 2022, 5:13 AM

63 points

5 comments5 min readLW link

[Question] Do You Have an Internal Monologue?

belkarxDec 9, 2022, 3:04 AM

23 points

7 comments1 min readLW link

[Question] How is the “sharp left turn defined”?

Chris_LeongDec 9, 2022, 12:04 AM

14 points

4 comments1 min readLW link

Linkpost for a generalist algorithmic learner: capable of carrying out sorting, shortest paths, string matching, convex hull finding in one network

lovetheusersDec 9, 2022, 12:02 AM

7 points

1 comment1 min readLW link

(twitter.com)

[Question] Where’s the economic incentive for wokism coming from?

ValentineDec 8, 2022, 11:28 PM

12 points

105 comments1 min readLW link

I Believe we are in a Hardware Overhang

nemDec 8, 2022, 11:18 PM

8 points

0 comments1 min readLW link

Of pumpkins, the Falcon Heavy, and Groucho Marx: High-Level discourse structure in ChatGPT

Bill BenzonDec 8, 2022, 10:25 PM

2 points

0 comments8 min readLW link

How Many Lives Does X-Risk Work Save From Nonexistence On Average?

Jordan ArelDec 8, 2022, 9:57 PM

4 points

5 comments14 min readLW link

AI Safety Seems Hard to Measure

HoldenKarnofskyDec 8, 2022, 7:50 PM

71 points

6 comments14 min readLW link

(www.cold-takes.com)

Playing shell games with definitions

weverkaDec 8, 2022, 7:35 PM

9 points

3 comments1 min readLW link

Notes on OpenAI’s alignment plan

Alex FlintDec 8, 2022, 7:13 PM

40 points

5 comments7 min readLW link

Relevant to natural abstractions: Euclidean Symmetry Equivariant Machine Learning—Overview, Applications, and Open Questions

the gears to ascensionDec 8, 2022, 6:01 PM

8 points

0 comments1 min readLW link

(youtu.be)

I’ve started publishing the novel I wrote to promote EA

Timothy UnderwoodDec 8, 2022, 5:30 PM

10 points

3 comments1 min readLW link

Neural networks biased towards geometrically simple functions?

DavidHolmesDec 8, 2022, 4:16 PM

16 points

2 comments3 min readLW link

If Wentworth is right about natural abstractions, it would be bad for alignment

Wuschel SchulzDec 8, 2022, 3:19 PM

29 points

5 comments4 min readLW link

Covid 12/8/22: Another Winter Wave

ZviDec 8, 2022, 2:40 PM

23 points

8 comments11 min readLW link

(thezvi.wordpress.com)

Why I’m Sceptical of Foom

DragonGodDec 8, 2022, 10:01 AM

20 points

36 comments3 min readLW link

Take 7: You should talk about “the human’s utility function” less.

Charlie SteinerDec 8, 2022, 8:14 AM

50 points

22 comments2 min readLW link

Machine Learning Consent

jefftkDec 8, 2022, 3:50 AM

38 points

14 comments3 min readLW link

(www.jefftk.com)

Riffing on the agent type

QuinnDec 8, 2022, 12:19 AM

21 points

3 comments4 min readLW link

[Question] Looking for ideas of public assets (stocks, funds, ETFs) that I can invest in to have a chance at profiting from the mass adoption and commercialization of AI technology

AnnapurnaDec 7, 2022, 10:35 PM

15 points

9 comments1 min readLW link

A Fallibilist Wordview

Toni MUENDELDec 7, 2022, 8:59 PM

−13 points

2 comments13 min readLW link

Thoughts on AGI organizations and capabilities work

Rob Bensinger and So8res

Dec 7, 2022, 7:46 PM

102 points

17 comments5 min readLW link

How to Think About Climate Models and How to Improve Them

clansDec 7, 2022, 7:37 PM

7 points

0 comments2 min readLW link

(locationtbd.home.blog)

The novelty quotient

River LewisDec 7, 2022, 5:16 PM

4 points

7 comments2 min readLW link

(heytraveler.substack.com)

ChatGPT: “An error occurred. If this issue persists...”

Bill BenzonDec 7, 2022, 3:41 PM

5 points

11 comments3 min readLW link

Take 6: CAIS is actually Orwellian.

Charlie SteinerDec 7, 2022, 1:50 PM

12 points

8 comments2 min readLW link

Peter Thiel on Technological Stagnation and Out of Touch Rationalists

Matt GoldenbergDec 7, 2022, 1:15 PM

9 points

26 comments1 min readLW link

(youtu.be)

[Link] Wavefunctions: from Linear Algebra to Spinors

senDec 7, 2022, 12:44 PM

11 points

12 comments1 min readLW link

(paperclip.substack.com)

Why I like Zulip instead of Slack or Discord

Alok SinghDec 7, 2022, 9:28 AM

31 points

10 comments1 min readLW link

Bioweapons, and ChatGPT (another vulnerability story)

BeeblebroxDec 7, 2022, 7:27 AM

−5 points

0 comments2 min readLW link

Where to be an AI Safety Professor

scasperDec 7, 2022, 7:09 AM

31 points

12 comments2 min readLW link

[Question] Are there any tools to convert LW sequences to PDF or any other file format?

quetzal_rainbowDec 7, 2022, 5:28 AM

2 points

2 comments1 min readLW link

Manifold Markets community meetup

Sinclair ChenDec 7, 2022, 3:25 AM

4 points

0 comments1 min readLW link

“Attention Passengers”: not for Signs

jefftkDec 7, 2022, 2:00 AM

27 points

10 comments1 min readLW link

(www.jefftk.com)

[ASoT] Probability Infects Concepts it Touches

Ulisse MiniDec 7, 2022, 1:48 AM

10 points

4 comments1 min readLW link

Simple Way to Prevent Power-Seeking AI

research_prime_spaceDec 7, 2022, 12:26 AM

12 points

1 comment1 min readLW link

In defense of probably wrong mechanistic models

evhubDec 6, 2022, 11:24 PM

55 points

10 comments2 min readLW link

AI Safety in a Vulnerable World: Requesting Feedback on Preliminary Thoughts

Jordan ArelDec 6, 2022, 10:35 PM

4 points

2 comments3 min readLW link

ChatGPT and the Human Race

Ben ReillyDec 6, 2022, 9:38 PM

6 points

1 comment3 min readLW link

[Question] How do finite factored sets compare with phase space?

Alex_AltairDec 6, 2022, 8:05 PM

15 points

1 comment1 min readLW link