Hu­man­i­tar­ian Phase Tran­si­tion needed be­fore Tech­nolog­i­cal Sin­gu­lar­ity

Dr_What7 Apr 2023 23:17 UTC
−9 points
5 comments2 min readLW link

[Question] Thoughts about Hug­ging Face?

Ariel Kwiatkowski7 Apr 2023 23:17 UTC
7 points
0 comments1 min readLW link

[Question] Is it cor­rect to frame al­ign­ment as “pro­gram­ming a good philos­o­phy of mean­ing”?

Util7 Apr 2023 23:16 UTC
2 points
3 comments1 min readLW link

Select Agent Speci­fi­ca­tions as Nat­u­ral Abstractions

lukemarks7 Apr 2023 23:16 UTC
19 points
3 comments5 min readLW link

n=3 AI Risk Quick Math and Reasoning

lionhearted (Sebastian Marshall)7 Apr 2023 20:27 UTC
6 points
3 comments4 min readLW link

[Question] What are good al­ter­na­tives to Pre­dic­tion­book for per­sonal pre­dic­tion track­ing? Edited: I origi­nally thought it was down but it was just 500 un­til I though of clear­ing cook­ies.

sortega7 Apr 2023 19:18 UTC
4 points
4 comments1 min readLW link

En­vi­ron­ments for Mea­sur­ing De­cep­tion, Re­source Ac­qui­si­tion, and Eth­i­cal Violations

Dan H7 Apr 2023 18:40 UTC
51 points
2 comments2 min readLW link
(arxiv.org)

Su­per­in­tel­li­gence Is Not Omniscience

Jeffrey Heninger7 Apr 2023 16:30 UTC
15 points
20 comments7 min readLW link
(aiimpacts.org)

An ‘AGI Emer­gency Eject Cri­te­ria’ con­sen­sus could be re­ally use­ful.

tcelferact7 Apr 2023 16:21 UTC
5 points
0 comments1 min readLW link

Reli­a­bil­ity, Se­cu­rity, and AI risk: Notes from in­fosec text­book chap­ter 1

Akash7 Apr 2023 15:47 UTC
34 points
1 comment4 min readLW link

Pre-reg­is­ter­ing a study

Robert_AIZI7 Apr 2023 15:46 UTC
10 points
0 comments6 min readLW link
(aizi.substack.com)

Live dis­cus­sion at Eastercon

Douglas_Reay7 Apr 2023 15:25 UTC
5 points
0 comments1 min readLW link

[Question] ChatGTP “Writ­ing ” News Sto­ries for The Guardian?

jmh7 Apr 2023 12:16 UTC
1 point
4 comments1 min readLW link

Sto­ry­tel­ler’s con­ven­tion, 2223 A.D.

plex7 Apr 2023 11:54 UTC
8 points
0 comments2 min readLW link

Stampy’s AI Safety Info—New Distil­la­tions #1 [March 2023]

markov7 Apr 2023 11:06 UTC
42 points
0 comments2 min readLW link
(aisafety.info)

Beren’s “De­con­fus­ing Direct vs Amor­tised Op­ti­mi­sa­tion”

DragonGod7 Apr 2023 8:57 UTC
52 points
10 comments3 min readLW link

Goal al­ign­ment with­out al­ign­ment on episte­mol­ogy, ethics, and sci­ence is futile

Roman Leventov7 Apr 2023 8:22 UTC
20 points
2 comments2 min readLW link

Po­lio Lab Leak Caught with Wastew­a­ter Sampling

Cullen7 Apr 2023 1:06 UTC
82 points
3 comments1 min readLW link

Catch­ing the Eye of Sauron

Casey B.7 Apr 2023 0:40 UTC
221 points
68 comments4 min readLW link

[Question] How to par­allelize “in­her­ently” se­rial the­ory work?

Nicholas / Heather Kross7 Apr 2023 0:08 UTC
16 points
6 comments1 min readLW link

If Align­ment is Hard, then so is Self-Improvement

PavleMiha7 Apr 2023 0:08 UTC
21 points
20 comments1 min readLW link

An­thropic is fur­ther ac­cel­er­at­ing the Arms Race?

sapphire6 Apr 2023 23:29 UTC
82 points
22 comments1 min readLW link
(techcrunch.com)

Sugges­tion for safe AI struc­ture (Cu­rated Trans­par­ent De­ci­sions)

Kane Gregory6 Apr 2023 22:00 UTC
5 points
6 comments3 min readLW link

10 rea­sons why lists of 10 rea­sons might be a win­ning strategy

trevor6 Apr 2023 21:24 UTC
109 points
7 comments1 min readLW link

A Defense of Utilitarianism

Pareto Optimal6 Apr 2023 21:09 UTC
−3 points
2 comments5 min readLW link
(paretooptimal.substack.com)

One Does Not Sim­ply Re­place the Hu­mans

JerkyTreats6 Apr 2023 20:56 UTC
9 points
3 comments4 min readLW link
(www.lesswrong.com)

[Question] Where to be­gin in ML/​AI?

Jake the Student6 Apr 2023 20:45 UTC
9 points
4 comments1 min readLW link

Mis­gen­er­al­iza­tion as a misnomer

So8res6 Apr 2023 20:43 UTC
129 points
22 comments4 min readLW link

You can use GPT-4 to cre­ate prompt in­jec­tions against GPT-4

WitchBOT6 Apr 2023 20:39 UTC
87 points
7 comments2 min readLW link

AI scares and chang­ing pub­lic beliefs

Seth Herd6 Apr 2023 18:51 UTC
45 points
21 comments6 min readLW link

AISafety.world is a map of the AIS ecosystem

Hamish Doodles6 Apr 2023 18:37 UTC
79 points
0 comments1 min readLW link

I asked my sen­a­tor to slow AI

Omid6 Apr 2023 18:18 UTC
21 points
5 comments2 min readLW link

Pause AI Devel­op­ment?

PeterMcCluskey6 Apr 2023 17:23 UTC
11 points
0 comments2 min readLW link
(bayesianinvestor.com)

Use these three heuris­tic im­per­a­tives to solve alignment

G6 Apr 2023 16:20 UTC
−17 points
4 comments1 min readLW link

Eliezer on The Lu­nar So­ciety podcast

Max H6 Apr 2023 16:18 UTC
40 points
5 comments1 min readLW link
(www.dwarkeshpatel.com)

Do we get bet­ter or worse at adapt­ing to change?

jasoncrawford6 Apr 2023 14:42 UTC
12 points
2 comments3 min readLW link
(rootsofprogress.org)

Is it true that only a chat­bot en­couraged a man to com­mit suicide?

Jeroen De Ryck6 Apr 2023 14:10 UTC
6 points
0 comments4 min readLW link
(www.vrt.be)

A Fresh FAQ on GiveWiki and Im­pact Mar­kets Generally

Dawn Drescher6 Apr 2023 14:02 UTC
−1 points
0 comments1 min readLW link
(impactmarkets.substack.com)

AI #6: Agents of Change

Zvi6 Apr 2023 14:00 UTC
79 points
13 comments47 min readLW link
(thezvi.wordpress.com)

Stupid Ques­tions—April 2023

ChristianKl6 Apr 2023 13:07 UTC
17 points
44 comments1 min readLW link

(Yet Another) Map for AI Risk Discussion

chronolitus6 Apr 2023 11:55 UTC
1 point
0 comments2 min readLW link

The Com­pu­ta­tional Anatomy of Hu­man Values

beren6 Apr 2023 10:33 UTC
70 points
30 comments30 min readLW link

[Question] Is “Re­cur­sive Self-Im­prove­ment” Rele­vant in the Deep Learn­ing Paradigm?

DragonGod6 Apr 2023 7:13 UTC
32 points
36 comments7 min readLW link

Re­vis­it­ing the Hori­zon Length Hypothesis

Pablo Villalobos6 Apr 2023 6:39 UTC
23 points
4 comments3 min readLW link

Monthly Shorts 3/​23

Celer6 Apr 2023 6:20 UTC
7 points
1 comment4 min readLW link
(keller.substack.com)

Dual-Use­ness is a Ratio

jimrandomh6 Apr 2023 5:46 UTC
35 points
2 comments1 min readLW link

[Question] What’s the deal with Effec­tive Ac­cel­er­a­tionism (e/​acc)?

RomanHauksson6 Apr 2023 4:03 UTC
23 points
9 comments2 min readLW link

No Sum­mer Har­vest: Why AI Devel­op­ment Won’t Pause

Stephen Fowler6 Apr 2023 3:53 UTC
14 points
17 comments12 min readLW link

Yoshua Ben­gio: “Slow­ing down de­vel­op­ment of AI sys­tems pass­ing the Tur­ing test”

Roman Leventov6 Apr 2023 3:31 UTC
49 points
2 comments5 min readLW link
(yoshuabengio.org)

Unal­igned sta­ble loops emerge at scale

Michael Tontchev6 Apr 2023 2:15 UTC
9 points
8 comments4 min readLW link