TinyS­to­ries: Small Lan­guage Models That Still Speak Co­her­ent English

Ulisse Mini28 May 2023 22:23 UTC
66 points
8 comments2 min readLW link
(arxiv.org)

“Mem­branes” is bet­ter ter­minol­ogy than “bound­aries” alone

28 May 2023 22:16 UTC
30 points
12 comments3 min readLW link

The king token

p.b.28 May 2023 19:18 UTC
17 points
0 comments4 min readLW link

Lan­guage Agents Re­duce the Risk of Ex­is­ten­tial Catastrophe

28 May 2023 19:10 UTC
39 points
14 comments26 min readLW link

Devil’s Ad­vo­cate: Ad­verse Selec­tion Against Con­scien­tious­ness

lionhearted (Sebastian Marshall)28 May 2023 17:53 UTC
10 points
2 comments1 min readLW link

Re­acts now en­abled on 100% of posts, though still just ex­per­i­ment­ing

Ruby28 May 2023 5:36 UTC
88 points
73 comments2 min readLW link

My AI Align­ment Re­search Agenda and Threat Model, right now (May 2023)

Nicholas / Heather Kross28 May 2023 3:23 UTC
25 points
0 comments6 min readLW link
(www.thinkingmuchbetter.com)

Kelly bet­ting vs ex­pec­ta­tion max­i­miza­tion

MorgneticField28 May 2023 1:54 UTC
35 points
33 comments5 min readLW link

Why and When In­ter­pretabil­ity Work is Dangerous

Nicholas / Heather Kross28 May 2023 0:27 UTC
20 points
9 comments8 min readLW link
(www.thinkingmuchbetter.com)

Twin Cities ACX Meetup—June 2023

Timothy M.27 May 2023 20:11 UTC
1 point
1 comment1 min readLW link

Pro­ject Idea: Challenge Groups for Align­ment Researchers

Adam Zerner27 May 2023 20:10 UTC
13 points
0 comments1 min readLW link

In­tro­spec­tive Bayes

False Name27 May 2023 19:35 UTC
−3 points
2 comments16 min readLW link

Should Ra­tional An­i­ma­tions in­vite view­ers to read con­tent on LessWrong?

Writer27 May 2023 19:26 UTC
40 points
9 comments3 min readLW link

Who are the Ex­perts on Cry­on­ics?

Mati_Roy27 May 2023 19:24 UTC
30 points
9 comments1 min readLW link
(biostasis.substack.com)

AI and Planet Earth are in­com­pat­i­ble.

archeon27 May 2023 18:59 UTC
−4 points
2 comments1 min readLW link

South Bay ACX/​LW Meetup

IS27 May 2023 17:25 UTC
2 points
0 comments1 min readLW link

Hands-On Ex­pe­rience Is Not Magic

Thane Ruthenis27 May 2023 16:57 UTC
21 points
14 comments5 min readLW link

Is Deon­tolog­i­cal AI Safe? [Feed­back Draft]

27 May 2023 16:39 UTC
19 points
15 comments20 min readLW link

San Fran­cisco ACX Meetup “First Satur­day” June 3, 1 pm

guenael27 May 2023 13:58 UTC
1 point
0 comments1 min readLW link

Papers on pro­tein design

alexlyzhov27 May 2023 1:18 UTC
9 points
0 comments3 min readLW link

D&D.Sci 5E: Re­turn of the League of Defenders

aphyer26 May 2023 20:39 UTC
42 points
11 comments3 min readLW link

Seek­ing (Paid) Case Stud­ies on Standards

HoldenKarnofsky26 May 2023 17:58 UTC
69 points
9 comments11 min readLW link

Con­di­tional Pre­dic­tion with Zero-Sum Train­ing Solves Self-Fulfilling Prophecies

26 May 2023 17:44 UTC
88 points
13 comments24 min readLW link

Re­quest: stop ad­vanc­ing AI capabilities

So8res26 May 2023 17:42 UTC
153 points
24 comments1 min readLW link

Bandgaps, Brains, and Bioweapons: The limi­ta­tions of com­pu­ta­tional sci­ence and what it means for AGI

titotal26 May 2023 15:57 UTC
36 points
20 comments1 min readLW link

The Amer­i­can In­for­ma­tion Revolu­tion in Global Perspective

jasoncrawford26 May 2023 12:39 UTC
16 points
1 comment5 min readLW link
(rootsofprogress.org)

He­lio-Se­lenic Laser Te­lescope (in SPACE!?)

Alexander Gietelink Oldenziel26 May 2023 11:24 UTC
8 points
2 comments4 min readLW link

[Question] Why is vi­o­lence against AI labs a taboo?

ArisC26 May 2023 8:00 UTC
−21 points
63 comments1 min readLW link

Where do you lie on two axes of world ma­nipu­la­bil­ity?

Max H26 May 2023 3:04 UTC
30 points
15 comments3 min readLW link

Some thoughts on au­tomat­ing al­ign­ment research

Lukas Finnveden26 May 2023 1:50 UTC
30 points
4 comments6 min readLW link

[Question] What’s your view­point on the like­li­hood of GPT-5 be­ing able to au­tonomously cre­ate, train, and im­ple­ment an AI su­pe­rior to GPT-5?

Super AGI26 May 2023 1:43 UTC
7 points
15 comments1 min readLW link

Be­fore smart AI, there will be many mediocre or spe­cial­ized AIs

Lukas Finnveden26 May 2023 1:38 UTC
57 points
10 comments9 min readLW link1 review

how hu­mans are aligned

bhauth26 May 2023 0:09 UTC
14 points
3 comments1 min readLW link

[Question] What ve­gan food re­sources have you found use­ful?

Elizabeth25 May 2023 22:46 UTC
29 points
6 comments1 min readLW link

Mob and Bailey

Screwtape25 May 2023 22:14 UTC
78 points
16 comments7 min readLW link

Look At What’s In Front Of You (Con­clu­sion to The Nuts and Bolts of Nat­u­ral­ism)

LoganStrohl25 May 2023 19:00 UTC
50 points
1 comment2 min readLW link

[Mar­ket] Will AI xrisk seem to be han­dled se­ri­ously by the end of 2026?

tailcalled25 May 2023 18:51 UTC
15 points
2 comments1 min readLW link
(manifold.markets)

[Question] What should my col­lege ma­jor be if I want to do AI al­ign­ment re­search?

metachirality25 May 2023 18:23 UTC
8 points
7 comments1 min readLW link

Is be­hav­ioral safety “solved” in non-ad­ver­sar­ial con­di­tions?

Robert_AIZI25 May 2023 17:56 UTC
26 points
8 comments2 min readLW link
(aizi.substack.com)

Book Re­view: How Minds Change

bc4026bd4aaa5b7fe25 May 2023 17:55 UTC
310 points
52 comments15 min readLW link

Self-ad­ministered EMDR with­out a ther­a­pist is very use­ful for a lot of things!

EternallyBlissful25 May 2023 17:54 UTC
49 points
12 comments11 min readLW link

Re­cur­ren­tGPT: a loom-type tool with a twist

mishka25 May 2023 17:09 UTC
10 points
0 comments3 min readLW link
(arxiv.org)

The Ge­nie in the Bot­tle: An In­tro­duc­tion to AI Align­ment and Risk

Snorkelfarsan25 May 2023 16:30 UTC
5 points
1 comment25 min readLW link

AI #13: Po­ten­tial Al­gorith­mic Improvements

Zvi25 May 2023 15:40 UTC
45 points
4 comments67 min readLW link
(thezvi.wordpress.com)

Solv­ing the Mechanis­tic In­ter­pretabil­ity challenges: EIS VII Challenge 2

25 May 2023 15:37 UTC
71 points
1 comment13 min readLW link

Malthu­sian Com­pe­ti­tion (not as bad as it seems)

Logan Zoellner25 May 2023 15:30 UTC
6 points
11 comments2 min readLW link

You Don’t Always Need Indexes

jefftk25 May 2023 14:20 UTC
22 points
6 comments1 min readLW link
(www.jefftk.com)

The­o­ries of Biolog­i­cal Inspiration

Eric Zhang25 May 2023 13:07 UTC
7 points
3 comments1 min readLW link

Eval­u­at­ing strate­gic rea­son­ing in GPT models

phelps-sg25 May 2023 11:51 UTC
4 points
1 comment8 min readLW link

Re­quire­ments for a STEM-ca­pa­ble AGI Value Learner (my Case for Less Doom)

RogerDearnaley25 May 2023 9:26 UTC
33 points
4 comments15 min readLW link