The Gallery for Paint­ing Trans­for­ma­tions—A GPT-3 Analogy

Robert_AIZI19 Jan 2023 23:32 UTC
1 point
0 comments6 min readLW link
(aizi.substack.com)

AGI safety field build­ing pro­jects I’d like to see

Severin T. Seehrich19 Jan 2023 22:40 UTC
68 points
28 comments9 min readLW link

Ex­ten­sion­al­ity and the uni­valence ax­iom of type theory

Thomas Kehrenberg19 Jan 2023 22:36 UTC
6 points
1 comment16 min readLW link

The spiritual benefits of ma­te­rial progress

jasoncrawford19 Jan 2023 21:35 UTC
24 points
15 comments7 min readLW link
(rootsofprogress.org)

An­nounc­ing Cavendish Labs

19 Jan 2023 20:15 UTC
59 points
5 comments2 min readLW link
(forum.effectivealtruism.org)

Thoughts on re­fus­ing harm­ful re­quests to large lan­guage models

William_S19 Jan 2023 19:49 UTC
32 points
4 comments2 min readLW link

MA RMV Overloaded

jefftk19 Jan 2023 16:40 UTC
16 points
0 comments2 min readLW link
(www.jefftk.com)

“Hereti­cal Thoughts on AI” by Eli Dourado

DragonGod19 Jan 2023 16:11 UTC
145 points
38 comments3 min readLW link
(www.elidourado.com)

Covid 1/​19/​23: Flipped Numbers

Zvi19 Jan 2023 13:30 UTC
19 points
4 comments4 min readLW link
(thezvi.wordpress.com)

List of tech­ni­cal AI safety ex­er­cises and projects

JakubK19 Jan 2023 9:35 UTC
41 points
5 comments1 min readLW link
(docs.google.com)

Group-level Con­se­quences of Psy­cholog­i­cal Problems

19 Jan 2023 9:27 UTC
28 points
3 comments2 min readLW link

6-para­graph AI risk in­tro for MAISI

JakubK19 Jan 2023 9:22 UTC
11 points
0 comments2 min readLW link
(www.maisi.club)

200 COP in MI: Study­ing Learned Fea­tures in Lan­guage Models

Neel Nanda19 Jan 2023 3:48 UTC
24 points
2 comments30 min readLW link

Ama­zon clos­ing Ama­zonSmile to fo­cus its philan­thropic giv­ing to pro­grams with greater impact

Gordon Seidoh Worley19 Jan 2023 1:15 UTC
10 points
8 comments1 min readLW link

Gra­di­ent Filtering

18 Jan 2023 20:09 UTC
54 points
16 comments13 min readLW link

[Cross-post] Is the Fermi Para­dox due to the Flaw of Aver­ages?

18 Jan 2023 19:22 UTC
41 points
27 comments15 min readLW link
(lumina.com)

First Three Epi­sodes of The Filan Cabinet

DanielFilan18 Jan 2023 19:20 UTC
17 points
1 comment1 min readLW link

[Question] Best Ques­tions To Vet Po­ten­tial Ai-Safety Applicants

jacksonjezion18 Jan 2023 19:01 UTC
6 points
1 comment1 min readLW link

[Question] Look­ing for a spe­cific group of people

FriggenRedChickenMan18 Jan 2023 19:00 UTC
15 points
21 comments1 min readLW link

A prob­lem with group epistemics

Mckay Jensen18 Jan 2023 17:06 UTC
4 points
4 comments3 min readLW link
(quevivasbien.github.io)

Why you should learn sign language

Noah Topper18 Jan 2023 17:03 UTC
53 points
23 comments7 min readLW link
(naivebayes.substack.com)

Fly­ing With Covid

jefftk18 Jan 2023 17:00 UTC
44 points
29 comments3 min readLW link
(www.jefftk.com)

Pro­to­type of Us­ing GPT-3 to Gen­er­ate Text­book-length Content

Rafael Cosman18 Jan 2023 14:25 UTC
2 points
8 comments40 min readLW link
(github.com)

How many peo­ple are work­ing (di­rectly) on re­duc­ing ex­is­ten­tial risk from AI?

Benjamin Hilton18 Jan 2023 8:46 UTC
20 points
1 comment1 min readLW link

EA & LW Fo­rum Sum­maries (9th Jan to 15th Jan 23′)

Zoe Williams18 Jan 2023 7:29 UTC
17 points
0 comments1 min readLW link

OpenAI’s Align­ment Plan is not S.M.A.R.T.

Søren Elverlin18 Jan 2023 6:39 UTC
9 points
19 comments4 min readLW link

[Question] For­mal defi­ni­tion of On­tol­ogy Mis­match?

NathanBarnard18 Jan 2023 5:52 UTC
6 points
0 comments1 min readLW link

[Question] Trans­former Mech In­terp: Any vi­su­al­iza­tions?

Joyee Chen18 Jan 2023 4:32 UTC
3 points
0 comments1 min readLW link

Neu­ral net­works gen­er­al­ize be­cause of this one weird trick

Jesse Hoogland18 Jan 2023 0:10 UTC
171 points
28 comments53 min readLW link
(www.jessehoogland.com)

Progress links and tweets, 2023-01-17

jasoncrawford17 Jan 2023 21:31 UTC
13 points
3 comments2 min readLW link
(rootsofprogress.org)

Quotes Worth Talk­ing About

akaTrickster17 Jan 2023 21:26 UTC
−1 points
0 comments3 min readLW link

Build­ing a tran­shu­man­ist fu­ture: 15 years of hplus­roadmap, now Discord

kanzure17 Jan 2023 21:17 UTC
42 points
1 comment1 min readLW link
(twitter.com)

Ad Fraud De­tec­tion Pre­dic­tion Market

jefftk17 Jan 2023 18:10 UTC
17 points
0 comments2 min readLW link
(www.jefftk.com)

Col­lin Burns on Align­ment Re­search And Dis­cov­er­ing La­tent Knowl­edge Without Supervision

Michaël Trazzi17 Jan 2023 17:21 UTC
25 points
5 comments4 min readLW link
(theinsideview.ai)

Les­sons learned and re­view of the AI Safety Nudge Competition

Marc Carauleanu17 Jan 2023 17:13 UTC
3 points
0 comments1 min readLW link

Five Rea­sons to Lie

Dzoldzaya17 Jan 2023 16:53 UTC
0 points
19 comments3 min readLW link

On AI and In­ter­est Rates

Zvi17 Jan 2023 15:00 UTC
79 points
13 comments8 min readLW link
(thezvi.wordpress.com)

Lan­guage mod­els can gen­er­ate su­pe­rior text com­pared to their input

ChristianKl17 Jan 2023 10:57 UTC
48 points
28 comments1 min readLW link

Löbian emo­tional pro­cess­ing of emer­gent co­op­er­a­tion: an example

Andrew_Critch17 Jan 2023 5:59 UTC
23 points
0 comments8 min readLW link

Prepar­ing for AI-as­sisted al­ign­ment re­search: we need data!

CBiddulph17 Jan 2023 3:28 UTC
31 points
3 comments1 min readLW link

Tesla Model 3 Review

jefftk17 Jan 2023 1:10 UTC
18 points
15 comments4 min readLW link
(www.jefftk.com)

[Question] Should AI writ­ers be pro­hibited in ed­u­ca­tion?

Eleni Angelou17 Jan 2023 0:42 UTC
6 points
2 comments1 min readLW link

What can thought-ex­per­i­ments do?

Cleo Nardo17 Jan 2023 0:35 UTC
16 points
3 comments5 min readLW link

Ex­per­i­ment Idea: RL Agents Evad­ing Learned Shutdownability

Leon Lang16 Jan 2023 22:46 UTC
31 points
7 comments17 min readLW link
(docs.google.com)

Con­se­quen­tial­ists: One-Way Pat­tern Traps

David Udell16 Jan 2023 20:48 UTC
59 points
3 comments14 min readLW link

Book Re­view: Wor­lds of Flow

remember16 Jan 2023 20:17 UTC
83 points
3 comments9 min readLW link

For the Record: DL ∩ ASI = ∅

maximkazhenkov16 Jan 2023 19:04 UTC
12 points
13 comments2 min readLW link

[Question] What de­ter­mines fe­male ro­man­tic “mar­ket value”?

anon_girl16 Jan 2023 18:45 UTC
16 points
50 comments1 min readLW link

Sta­tus conscious

avantika.mehra16 Jan 2023 17:44 UTC
2 points
0 comments5 min readLW link

Con­fus­ing the ideal for the necessary

adamShimi16 Jan 2023 17:29 UTC
79 points
6 comments1 min readLW link
(epistemologicalvigilance.substack.com)