Smar­tyHead­erCode: anoma­lous to­kens for GPT3.5 and GPT-4

AdamYedidia15 Apr 2023 22:35 UTC
71 points
18 comments6 min readLW link

Open-source LLMs may prove Bostrom’s vuln­er­a­ble world hypothesis

Roope Ahvenharju15 Apr 2023 19:16 UTC
1 point
1 comment1 min readLW link

[linkpost] Elon Musk plans AI start-up to ri­val OpenAI

Hatfield15 Apr 2023 19:06 UTC
11 points
11 comments1 min readLW link
(www.ft.com)

FLI re­port: Poli­cy­mak­ing in the Pause

Zach Stein-Perlman15 Apr 2023 17:01 UTC
9 points
3 comments1 min readLW link
(futureoflife.org)

Reflec­tive jour­nal en­tries us­ing GPT-4 and Ob­sidian that de­mand less willpower.

Solenoid_Entity15 Apr 2023 12:45 UTC
56 points
24 comments7 min readLW link

An ex­am­ple ele­va­tor pitch for AI doom

laserfiche15 Apr 2023 12:29 UTC
2 points
5 comments1 min readLW link

AI as Con­tact with our Col­lec­tive Unconscious

Scott Broock15 Apr 2023 2:11 UTC
−4 points
6 comments4 min readLW link

The Truth About False

Thoth Hermes15 Apr 2023 1:01 UTC
−21 points
4 comments17 min readLW link
(thothhermes.substack.com)

The ‘ pe­ter­todd’ phenomenon

mwatkins15 Apr 2023 0:59 UTC
192 points
49 comments38 min readLW link

[Question] Con­cave Utility Question

Scott Garrabrant15 Apr 2023 0:14 UTC
55 points
36 comments2 min readLW link

List of re­quests for an AI slow­down/​halt.

Cleo Nardo14 Apr 2023 23:55 UTC
46 points
6 comments1 min readLW link

[linkpost] “What Are Rea­son­able AI Fears?” by Robin Han­son, 2023-04-23

Arjun Panickssery14 Apr 2023 23:26 UTC
26 points
16 comments1 min readLW link

“Do X be­cause de­ci­sion the­ory” ~= “Do X be­cause bayes the­o­rem”

lc14 Apr 2023 20:57 UTC
39 points
1 comment2 min readLW link

LLMs and hal­lu­ci­na­tion, like white on rice?

Bill Benzon14 Apr 2023 19:53 UTC
5 points
0 comments3 min readLW link

GPT-4 is eas­ily con­trol­led/​ex­ploited with tricky de­ci­sion the­o­retic dilem­mas.

scasper14 Apr 2023 19:39 UTC
6 points
4 comments2 min readLW link

On Car­ing about our AI Progeny

PeterMcCluskey14 Apr 2023 19:32 UTC
22 points
5 comments1 min readLW link
(bayesianinvestor.com)

Moder­a­tion notes re: re­cent Said/​Dun­can threads

Raemon14 Apr 2023 18:06 UTC
50 points
560 comments2 min readLW link

What we’ve learned so far from our tech­nolog­i­cal temp­ta­tions project

Richard Korzekwa 14 Apr 2023 17:46 UTC
15 points
4 comments11 min readLW link
(aiimpacts.org)

[Question] How does con­scious­ness in­ter­act with ar­chi­tec­ture?

FinalFormal214 Apr 2023 15:56 UTC
5 points
3 comments1 min readLW link

Iqisa: A Library For Han­dling Fore­cast­ing Datasets

niplav14 Apr 2023 15:16 UTC
27 points
0 comments1 min readLW link

What’s this prob­a­bil­ity you’re re­port­ing?

EOC and SCP
14 Apr 2023 15:07 UTC
19 points
10 comments3 min readLW link

Nav­i­gat­ing AI Risks (NAIR) #1: Slow­ing Down AI

simeon_c14 Apr 2023 14:35 UTC
11 points
3 comments1 min readLW link
(navigatingairisks.substack.com)

[Question] What would the FLI mora­to­rium ac­tu­ally do?

ChristianKl14 Apr 2023 13:14 UTC
17 points
7 comments1 min readLW link

Re­search Re­port: In­cor­rect­ness Cascades

Robert_AIZI14 Apr 2023 12:49 UTC
19 points
0 comments10 min readLW link
(aizi.substack.com)

The self-un­al­ign­ment problem

14 Apr 2023 12:10 UTC
150 points
24 comments10 min readLW link

AI Safety Europe Re­treat 2023 Retrospective

Magdalena Wache14 Apr 2023 9:05 UTC
43 points
0 comments2 min readLW link

[Question] What’s the differ­ence be­tween Wis­dom and Ra­tion­al­ity?

Yoav Ravid14 Apr 2023 6:22 UTC
8 points
4 comments1 min readLW link

Shap­ley Value At­tri­bu­tion in Chain of Thought

leogao14 Apr 2023 5:56 UTC
103 points
7 comments4 min readLW link

A fresh­man year dur­ing the AI midgame: my ap­proach to the next year

Buck14 Apr 2023 0:38 UTC
152 points
14 comments1 min readLW link

Against AI Un­der­stand­ing and Sen­tience: Large Lan­guage Models, Mean­ing, and the Pat­terns of Hu­man Lan­guage Use

Jonathan Yan13 Apr 2023 23:29 UTC
−1 points
0 comments1 min readLW link
(philsci-archive.pitt.edu)

At­tributes of suc­cess­ful professors

electroswing13 Apr 2023 20:38 UTC
13 points
8 comments5 min readLW link

Fi­nan­cial Times: We must slow down the race to God-like AI

trevor13 Apr 2023 19:55 UTC
113 points
17 comments16 min readLW link
(www.ft.com)

R0 Is Not Counterfactual

jefftk13 Apr 2023 19:50 UTC
33 points
9 comments2 min readLW link
(www.jefftk.com)

Sub­scripts for Probabilities

niplav13 Apr 2023 18:32 UTC
67 points
9 comments5 min readLW link

The Virus—Short Story

Michael Soareverix13 Apr 2023 18:18 UTC
4 points
0 comments4 min readLW link

First ACX Brno Meetup

adekcz13 Apr 2023 17:42 UTC
2 points
0 comments1 min readLW link

Pol­lut­ing the agen­tic commons

hamandcheese13 Apr 2023 17:42 UTC
7 points
4 comments2 min readLW link
(www.secondbest.ca)

Cam­bridge LW Meetup: When Science Isn’t Enough

13 Apr 2023 17:36 UTC
2 points
0 comments1 min readLW link

Even if hu­man & AI al­ign­ment are just as easy, we are screwed

Matthew_Opitz13 Apr 2023 17:32 UTC
35 points
5 comments5 min readLW link

In­tro to On­to­ge­netic Curriculum

Eris13 Apr 2023 17:15 UTC
19 points
1 comment2 min readLW link

Was Homer a stochas­tic par­rot? Mean­ing in liter­ary texts and LLMs

Bill Benzon13 Apr 2023 16:44 UTC
7 points
4 comments3 min readLW link

AI #7: Free Agency

Zvi13 Apr 2023 16:20 UTC
33 points
12 comments47 min readLW link
(thezvi.wordpress.com)

Nav­i­gat­ing the Open-Source AI Land­scape: Data, Fund­ing, and Safety

13 Apr 2023 15:29 UTC
32 points
7 comments11 min readLW link
(forum.effectivealtruism.org)

On AutoGPT

Zvi13 Apr 2023 12:30 UTC
248 points
47 comments20 min readLW link
(thezvi.wordpress.com)

Iden­ti­fy­ing se­man­tic neu­rons, mechanis­tic cir­cuits & in­ter­pretabil­ity web apps

13 Apr 2023 11:59 UTC
18 points
0 comments8 min readLW link

Try­ing Agen­tGPT, an Au­toGPT variant

Gunnar_Zarncke13 Apr 2023 10:13 UTC
10 points
9 comments1 min readLW link

An­nounc­ing Epoch’s dash­board of key trends and figures in Ma­chine Learning

Jsevillamol13 Apr 2023 7:33 UTC
35 points
7 comments1 min readLW link
(epochai.org)

[Question] What is the best source to ex­plain short AI timelines to a skep­ti­cal per­son?

trevor13 Apr 2023 4:29 UTC
12 points
12 comments1 min readLW link

“Aligned” foun­da­tion mod­els don’t im­ply al­igned systems

Max H13 Apr 2023 4:13 UTC
39 points
11 comments5 min readLW link

[Question] Us­ing ChatGPT for mem­ory re­con­soli­da­tion?

warrenjordan13 Apr 2023 1:27 UTC
3 points
2 comments1 min readLW link