Eval­u­at­ing Ev­i­dence Re­con­struc­tions of Mock Crimes -Sub­mis­sion 2

Alan E Dunne24 May 2023 22:17 UTC
−1 points
1 comment3 min readLW link

[Linkpost] In­ter­pretabil­ity Dreams

DanielFilan24 May 2023 21:08 UTC
39 points
2 comments2 min readLW link
(transformer-circuits.pub)

Rishi Su­nak men­tions “ex­is­ten­tial threats” in talk with OpenAI, Deep­Mind, An­thropic CEOs

24 May 2023 21:06 UTC
34 points
1 comment1 min readLW link
(www.gov.uk)

If you’re not a morn­ing per­son, con­sider quit­ting allergy pills

Brendan Long24 May 2023 20:11 UTC
8 points
3 comments1 min readLW link

Adum­bra­tions on AGI from an outsider

nicholashalden24 May 2023 17:41 UTC
57 points
44 comments8 min readLW link
(nicholashalden.home.blog)

Open Thread With Ex­per­i­men­tal Fea­ture: Reactions

jimrandomh24 May 2023 16:46 UTC
101 points
189 comments3 min readLW link

A re­jec­tion of the Orthog­o­nal­ity Thesis

ArisC24 May 2023 16:37 UTC
−2 points
11 comments2 min readLW link
(medium.com)

Aligned AI via mon­i­tor­ing ob­jec­tives in Au­toGPT-like systems

Paul Colognese24 May 2023 15:59 UTC
27 points
4 comments4 min readLW link

The Office of Science and Tech­nol­ogy Policy put out a re­quest for in­for­ma­tion on A.I.

HiroSakuraba24 May 2023 13:33 UTC
59 points
4 comments1 min readLW link
(www.whitehouse.gov)

ChatGPT (May 2023) on De­sign­ing Friendly Superintelligence

Mitchell_Porter24 May 2023 10:47 UTC
5 points
0 comments1 min readLW link
(singularitypolitics.wordpress.com)

No—AI is just as en­ergy-effi­cient as your brain.

Maxwell Clarke24 May 2023 2:30 UTC
11 points
7 comments1 min readLW link

[Question] What pro­jects and efforts are there to pro­mote AI safety re­search?

Christopher King24 May 2023 0:33 UTC
4 points
0 comments1 min readLW link

My May 2023 pri­ori­ties for AI x-safety: more em­pa­thy, more unifi­ca­tion of con­cerns, and less vil­ifi­ca­tion of OpenAI

Andrew_Critch24 May 2023 0:02 UTC
268 points
39 comments8 min readLW link

AI Safety Newslet­ter #7: Dis­in­for­ma­tion, Gover­nance Recom­men­da­tions for AI labs, and Se­nate Hear­ings on AI

23 May 2023 21:47 UTC
25 points
0 comments6 min readLW link
(newsletter.safe.ai)

The Po­lar­ity Prob­lem [Draft]

23 May 2023 21:05 UTC
24 points
3 comments44 min readLW link

Progress links and tweets, 2023-05-23

jasoncrawford23 May 2023 20:15 UTC
16 points
0 comments1 min readLW link
(rootsofprogress.org)

Yoshua Ben­gio: How Rogue AIs may Arise

harfe23 May 2023 18:28 UTC
92 points
12 comments18 min readLW link
(yoshuabengio.org)

‘Fun­da­men­tal’ vs ‘ap­plied’ mechanis­tic in­ter­pretabil­ity research

Lee Sharkey23 May 2023 18:26 UTC
65 points
6 comments3 min readLW link

Co­er­cion is an adap­ta­tion to scarcity; trust is an adap­ta­tion to abundance

Richard_Ngo23 May 2023 18:14 UTC
90 points
11 comments4 min readLW link

[Question] Is “brit­tle al­ign­ment” good enough?

the8thbit23 May 2023 17:35 UTC
9 points
5 comments3 min readLW link

Will Ar­tifi­cial Su­per­in­tel­li­gence Kill Us?

James_Miller23 May 2023 16:27 UTC
33 points
2 comments22 min readLW link

Phone Num­ber Jingle

jefftk23 May 2023 15:20 UTC
11 points
12 comments1 min readLW link
(www.jefftk.com)

GPT4 is ca­pa­ble of writ­ing de­cent long-form sci­ence fic­tion (with the right prompts)

RomanS23 May 2023 13:41 UTC
22 points
28 comments65 min readLW link

[Question] Do hu­mans still provide value in cor­re­spon­dence chess?

Jonathan Paulson23 May 2023 12:15 UTC
24 points
16 comments1 min readLW link

[Linkpost] The AGI Show podcast

Soroush Pour23 May 2023 9:52 UTC
4 points
0 comments1 min readLW link

Data and “to­kens” a 30 year old hu­man “trains” on

Jose Miguel Cruz y Celis23 May 2023 5:34 UTC
15 points
15 comments1 min readLW link

How I learned to stop wor­ry­ing and love skill trees

junk heap homotopy23 May 2023 4:08 UTC
81 points
3 comments1 min readLW link

T-Shirt Size Distribution

jefftk23 May 2023 2:40 UTC
9 points
0 comments1 min readLW link
(www.jefftk.com)

AI self-im­prove­ment is possible

bhauth23 May 2023 2:32 UTC
18 points
3 comments8 min readLW link

Wor­ry­ing less about acausal extortion

Raemon23 May 2023 2:08 UTC
41 points
11 comments13 min readLW link

Self-lead­er­ship and self-love dis­solve anger and trauma

Richard_Ngo22 May 2023 22:30 UTC
70 points
7 comments5 min readLW link

A Man­i­fold mar­ket no­tice: Binance

Scrooge Mcduck22 May 2023 22:24 UTC
15 points
13 comments1 min readLW link

I don’t want to talk about AI

KirstenH22 May 2023 21:23 UTC
34 points
11 comments2 min readLW link
(ealifestyles.substack.com)

Ac­ti­va­tion ad­di­tions in a small resi­d­ual network

Garrett Baker22 May 2023 20:28 UTC
22 points
4 comments3 min readLW link

[Linkpost] “Gover­nance of su­per­in­tel­li­gence” by OpenAI

Daniel_Eth22 May 2023 20:15 UTC
67 points
20 comments1 min readLW link

Two Pie­ces of Ad­vice About How to Re­mem­ber Things

omnizoid22 May 2023 18:10 UTC
13 points
3 comments4 min readLW link

Why I Believe LLMs Do Not Have Hu­man-like Emotions

OneManyNone22 May 2023 15:46 UTC
13 points
6 comments7 min readLW link

AI Safety in China: Part 2

Lao Mein22 May 2023 14:50 UTC
95 points
28 comments2 min readLW link

Con­jec­ture in­ter­nal sur­vey: AGI timelines and prob­a­bil­ity of hu­man ex­tinc­tion from ad­vanced AI

Maris Sala22 May 2023 14:31 UTC
155 points
5 comments3 min readLW link
(www.conjecture.dev)

Papers, Please #1: Var­i­ous Papers on Em­ploy­ment, Wages and Productivity

Zvi22 May 2023 12:00 UTC
42 points
2 comments8 min readLW link
(thezvi.wordpress.com)

In Defense of «The Army of Jakoths»

MikkW22 May 2023 11:59 UTC
−14 points
10 comments4 min readLW link

Speed of in­for­ma­tion in­put is a bot­tle­neck for rationality

MikkW22 May 2023 10:24 UTC
13 points
0 comments4 min readLW link

Distil­la­tion of Neu­rotech and Align­ment Work­shop Jan­uary 2023

22 May 2023 7:17 UTC
51 points
9 comments14 min readLW link

The Treach­er­ous Turn is finished! (AI-takeover-themed table­top RPG)

Daniel Kokotajlo22 May 2023 5:49 UTC
55 points
5 comments2 min readLW link
(thetreacherousturn.ai)

The Stan­ley Parable: Mak­ing philos­o­phy fun

Nathan112322 May 2023 2:15 UTC
6 points
3 comments3 min readLW link

Sea Monsters

Adam Zerner22 May 2023 0:58 UTC
28 points
11 comments5 min readLW link

The Army of Jakoths (a parable)

MikkW21 May 2023 22:48 UTC
−6 points
0 comments1 min readLW link

A&I (Rihanna ‘S&M’ par­ody lyrics)

nahoj21 May 2023 22:34 UTC
−2 points
0 comments2 min readLW link

Four Bat­tle­grounds: Power in the Age of Ar­tifi­cial In­tel­li­gence (Book re­view)

PeterMcCluskey21 May 2023 21:19 UTC
25 points
0 comments4 min readLW link
(bayesianinvestor.com)

Gen­der Vec­tors in ROME’s La­tent Space

Xodarap21 May 2023 18:46 UTC
14 points
2 comments3 min readLW link