Gra­di­ent Filtering

18 Jan 2023 20:09 UTC
55 points
16 comments13 min readLW link

[Cross-post] Is the Fermi Para­dox due to the Flaw of Aver­ages?

18 Jan 2023 19:22 UTC
41 points
27 comments15 min readLW link
(lumina.com)

First Three Epi­sodes of The Filan Cabinet

DanielFilan18 Jan 2023 19:20 UTC
17 points
1 comment1 min readLW link

[Question] Best Ques­tions To Vet Po­ten­tial Ai-Safety Applicants

jacksonjezion18 Jan 2023 19:01 UTC
6 points
1 comment1 min readLW link

[Question] Look­ing for a spe­cific group of people

FriggenRedChickenMan18 Jan 2023 19:00 UTC
15 points
21 comments1 min readLW link

A prob­lem with group epistemics

Mckay Jensen18 Jan 2023 17:06 UTC
4 points
4 comments3 min readLW link
(quevivasbien.github.io)

Why you should learn sign language

Noah Topper18 Jan 2023 17:03 UTC
53 points
23 comments7 min readLW link
(naivebayes.substack.com)

Fly­ing With Covid

jefftk18 Jan 2023 17:00 UTC
44 points
29 comments3 min readLW link
(www.jefftk.com)

Pro­to­type of Us­ing GPT-3 to Gen­er­ate Text­book-length Content

Rafael Cosman18 Jan 2023 14:25 UTC
2 points
8 comments40 min readLW link
(github.com)

How many peo­ple are work­ing (di­rectly) on re­duc­ing ex­is­ten­tial risk from AI?

Benjamin Hilton18 Jan 2023 8:46 UTC
20 points
1 comment1 min readLW link

EA & LW Fo­rum Sum­maries (9th Jan to 15th Jan 23′)

Zoe Williams18 Jan 2023 7:29 UTC
17 points
0 comments1 min readLW link

OpenAI’s Align­ment Plan is not S.M.A.R.T.

Søren Elverlin18 Jan 2023 6:39 UTC
9 points
19 comments4 min readLW link

[Question] For­mal defi­ni­tion of On­tol­ogy Mis­match?

NathanBarnard18 Jan 2023 5:52 UTC
6 points
0 comments1 min readLW link

[Question] Trans­former Mech In­terp: Any vi­su­al­iza­tions?

Joyee Chen18 Jan 2023 4:32 UTC
3 points
0 comments1 min readLW link

Neu­ral net­works gen­er­al­ize be­cause of this one weird trick

Jesse Hoogland18 Jan 2023 0:10 UTC
179 points
29 comments53 min readLW link1 review
(www.jessehoogland.com)

Progress links and tweets, 2023-01-17

jasoncrawford17 Jan 2023 21:31 UTC
13 points
3 comments2 min readLW link
(rootsofprogress.org)

Quotes Worth Talk­ing About

akaTrickster17 Jan 2023 21:26 UTC
−1 points
0 comments3 min readLW link

Build­ing a tran­shu­man­ist fu­ture: 15 years of hplus­roadmap, now Discord

kanzure17 Jan 2023 21:17 UTC
42 points
1 comment1 min readLW link
(twitter.com)

Ad Fraud De­tec­tion Pre­dic­tion Market

jefftk17 Jan 2023 18:10 UTC
17 points
0 comments2 min readLW link
(www.jefftk.com)

Col­lin Burns on Align­ment Re­search And Dis­cov­er­ing La­tent Knowl­edge Without Supervision

Michaël Trazzi17 Jan 2023 17:21 UTC
25 points
5 comments4 min readLW link
(theinsideview.ai)

Les­sons learned and re­view of the AI Safety Nudge Competition

Marc Carauleanu17 Jan 2023 17:13 UTC
3 points
0 comments1 min readLW link

Five Rea­sons to Lie

Dzoldzaya17 Jan 2023 16:53 UTC
0 points
19 comments3 min readLW link

On AI and In­ter­est Rates

Zvi17 Jan 2023 15:00 UTC
79 points
13 comments8 min readLW link
(thezvi.wordpress.com)

Lan­guage mod­els can gen­er­ate su­pe­rior text com­pared to their input

ChristianKl17 Jan 2023 10:57 UTC
48 points
28 comments1 min readLW link

Löbian emo­tional pro­cess­ing of emer­gent co­op­er­a­tion: an example

Andrew_Critch17 Jan 2023 5:59 UTC
23 points
0 comments8 min readLW link

Prepar­ing for AI-as­sisted al­ign­ment re­search: we need data!

CBiddulph17 Jan 2023 3:28 UTC
31 points
3 comments1 min readLW link

Tesla Model 3 Review

jefftk17 Jan 2023 1:10 UTC
18 points
15 comments4 min readLW link
(www.jefftk.com)

[Question] Should AI writ­ers be pro­hibited in ed­u­ca­tion?

Eleni Angelou17 Jan 2023 0:42 UTC
6 points
2 comments1 min readLW link

What can thought-ex­per­i­ments do?

Cleo Nardo17 Jan 2023 0:35 UTC
16 points
3 comments5 min readLW link

Ex­per­i­ment Idea: RL Agents Evad­ing Learned Shutdownability

Leon Lang16 Jan 2023 22:46 UTC
31 points
7 comments17 min readLW link
(docs.google.com)

Con­se­quen­tial­ists: One-Way Pat­tern Traps

David Udell16 Jan 2023 20:48 UTC
59 points
3 comments14 min readLW link

Book Re­view: Wor­lds of Flow

remember16 Jan 2023 20:17 UTC
83 points
3 comments9 min readLW link

For the Record: DL ∩ ASI = ∅

maximkazhenkov16 Jan 2023 19:04 UTC
12 points
13 comments2 min readLW link

[Question] What de­ter­mines fe­male ro­man­tic “mar­ket value”?

anon_girl16 Jan 2023 18:45 UTC
16 points
50 comments1 min readLW link

Sta­tus conscious

avantika.mehra16 Jan 2023 17:44 UTC
2 points
0 comments5 min readLW link

Con­fus­ing the ideal for the necessary

adamShimi16 Jan 2023 17:29 UTC
79 points
6 comments1 min readLW link
(epistemologicalvigilance.substack.com)

Tyler Cowen AMA on the Progress Forum

jasoncrawford16 Jan 2023 17:23 UTC
19 points
0 comments1 min readLW link
(progressforum.org)

Reflec­tions on Trust­ing Trust & AI

Itay Yona16 Jan 2023 6:36 UTC
10 points
1 comment3 min readLW link
(mentaleap.ai)

Is “Earn­ing to Give” a Bad Frame­work?

clans16 Jan 2023 5:35 UTC
2 points
4 comments6 min readLW link
(locationtbd.home.blog)

Why you ask the sig­nifi­cance ques­tion why

Slider16 Jan 2023 3:44 UTC
6 points
0 comments1 min readLW link

In­vest­ment, Work, and Vi­sion: Who is re­spon­si­ble for cre­at­ing value?

Sable16 Jan 2023 1:57 UTC
0 points
10 comments8 min readLW link
(affablyevil.substack.com)

Con­clu­sion and Bibliog­ra­phy for “Un­der­stand­ing the diffu­sion of large lan­guage mod­els”

Ben Cottier16 Jan 2023 1:46 UTC
4 points
0 comments1 min readLW link

Ques­tions for fur­ther in­ves­ti­ga­tion of AI diffusion

Ben Cottier16 Jan 2023 1:46 UTC
4 points
0 comments1 min readLW link

Im­pli­ca­tions of large lan­guage model diffu­sion for AI governance

Ben Cottier16 Jan 2023 1:45 UTC
7 points
0 comments1 min readLW link

Publi­ca­tion de­ci­sions for large lan­guage mod­els, and their impacts

Ben Cottier16 Jan 2023 1:44 UTC
4 points
0 comments1 min readLW link

Drivers of large lan­guage model diffu­sion: in­cre­men­tal re­search, pub­lic­ity, and cascades

Ben Cottier16 Jan 2023 1:44 UTC
4 points
0 comments1 min readLW link

The repli­ca­tion and em­u­la­tion of GPT-3

Ben Cottier16 Jan 2023 1:40 UTC
4 points
0 comments1 min readLW link

GPT-3-like mod­els are now much eas­ier to ac­cess and de­ploy than to develop

Ben Cottier16 Jan 2023 1:39 UTC
12 points
3 comments1 min readLW link

Back­ground for “Un­der­stand­ing the diffu­sion of large lan­guage mod­els”

Ben Cottier16 Jan 2023 1:38 UTC
4 points
0 comments1 min readLW link

Un­der­stand­ing the diffu­sion of large lan­guage mod­els: summary

Ben Cottier16 Jan 2023 1:37 UTC
26 points
1 comment1 min readLW link