“Just hiring peo­ple” is some­times still ac­tu­ally possible

lcAug 5, 2022, 9:44 PM
38 points
11 comments5 min readLW link

The need for certainty

Thomas McMurtryAug 5, 2022, 8:18 PM
2 points
0 comments4 min readLW link

Rant on Prob­lem Fac­tor­iza­tion for Alignment

johnswentworthAug 5, 2022, 7:23 PM
102 points
53 comments6 min readLW link

Coun­ter­fac­tu­als are Con­fus­ing be­cause of an On­tolog­i­cal Shift

Chris_LeongAug 5, 2022, 7:03 PM
17 points
35 comments2 min readLW link

Orange county ACX/​Less-Wrong dis­cus­sion group and hang-out. (or­ange county)

Michael MichalchikAug 5, 2022, 6:25 PM
2 points
0 comments1 min readLW link

Gears-Level Un­der­stand­ing, De­liber­ate Perfor­mance, The Strate­gic Level

CFAR!DuncanAug 5, 2022, 5:11 PM
30 points
3 comments5 min readLW link

[Question] COVID-19 Group Test­ing Post-mortem?

gwernAug 5, 2022, 4:32 PM
72 points
6 comments2 min readLW link

Where are the red lines for AI?

Karl von WendtAug 5, 2022, 9:34 AM
26 points
10 comments6 min readLW link

Bridg­ing Ex­pected Utility Max­i­miza­tion and Optimization

Daniel HerrmannAug 5, 2022, 8:18 AM
25 points
5 comments14 min readLW link

Deon­tol­ogy and Tool AI

Nathan1123Aug 5, 2022, 5:20 AM
4 points
5 comments6 min readLW link

An at­tempt to un­der­stand the Com­plex­ity of Values

Dalton MaberyAug 5, 2022, 4:43 AM
3 points
0 comments5 min readLW link

$20K In Boun­ties for AI Safety Public Materials

Aug 5, 2022, 2:52 AM
71 points
9 comments6 min readLW link

Two Kids Crosswise

jefftkAug 5, 2022, 2:40 AM
16 points
3 comments1 min readLW link
(www.jefftk.com)

The Fal­ling Drill

ScrewtapeAug 5, 2022, 12:08 AM
46 points
3 comments2 min readLW link

Con­ver­gence Towards World-Models: A Gears-Level Model

Thane RuthenisAug 4, 2022, 11:31 PM
38 points
1 comment13 min readLW link

Cam­bist Booking

ScrewtapeAug 4, 2022, 10:40 PM
20 points
3 comments4 min readLW link

Cal­ibra­tion Trivia

ScrewtapeAug 4, 2022, 10:31 PM
12 points
9 comments4 min readLW link

Monthly Shorts 7/​22

CelerAug 4, 2022, 10:30 PM
5 points
0 comments3 min readLW link
(keller.substack.com)

The Prag­mas­cope Idea

johnswentworthAug 4, 2022, 9:52 PM
59 points
20 comments3 min readLW link

Run­ning a Ba­sic Meetup

ScrewtapeAug 4, 2022, 9:49 PM
20 points
1 comment2 min readLW link

Fiber arts, mys­te­ri­ous do­dec­a­he­drons, and wait­ing on “Eureka!”

eukaryoteAug 4, 2022, 8:37 PM
124 points
15 comments9 min readLW link1 review
(eukaryotewritesblog.com)

[Question] Would “Man­hat­tan Pro­ject” style be benefi­cial or dele­te­ri­ous for AI Align­ment?

Valentin2026Aug 4, 2022, 7:12 PM
5 points
1 comment1 min readLW link

[Question] AI al­ign­ment: Would a lazy self-preser­va­tion in­stinct be suffi­cient?

BrainFrogAug 4, 2022, 5:53 PM
−1 points
4 comments1 min readLW link

So­cratic Duck­ing, OODA Loops, Frame-by-Frame Debugging

CFAR!DuncanAug 4, 2022, 5:44 PM
26 points
1 comment5 min readLW link

What do ML re­searchers think about AI in 2022?

KatjaGraceAug 4, 2022, 3:40 PM
221 points
33 comments3 min readLW link
(aiimpacts.org)

In­ter­pretabil­ity isn’t Free

Joel BurgetAug 4, 2022, 3:02 PM
10 points
1 comment2 min readLW link

Covid 8/​4/​22: Rebound

ZviAug 4, 2022, 11:20 AM
36 points
0 comments11 min readLW link
(thezvi.wordpress.com)

High Reli­a­bil­ity Orgs, and AI Companies

RaemonAug 4, 2022, 5:45 AM
86 points
7 comments12 min readLW link1 review

Sur­prised by ELK re­port’s coun­terex­am­ple to De­bate, IDA

Evan R. MurphyAug 4, 2022, 2:12 AM
18 points
0 comments5 min readLW link

Clap­ping Lower

jefftkAug 4, 2022, 2:10 AM
38 points
7 comments1 min readLW link
(www.jefftk.com)

[Question] How do I know if my first post should be a post, or a ques­tion?

Nathan1123Aug 4, 2022, 1:46 AM
3 points
4 comments1 min readLW link

Three pillars for avoid­ing AGI catas­tro­phe: Tech­ni­cal al­ign­ment, de­ploy­ment de­ci­sions, and coordination

LintzAAug 3, 2022, 11:15 PM
24 points
0 comments11 min readLW link

Pre­cur­sor check­ing for de­cep­tive alignment

evhubAug 3, 2022, 10:56 PM
24 points
0 comments14 min readLW link

Trans­former lan­guage mod­els are do­ing some­thing more general

NumendilAug 3, 2022, 9:13 PM
53 points
6 comments2 min readLW link

[Question] Some doubts about Non Su­per­in­tel­li­gent AIs

aditya malikAug 3, 2022, 7:55 PM
0 points
4 comments1 min readLW link

An­nounc­ing Squig­gle: Early Access

ozziegooenAug 3, 2022, 7:48 PM
51 points
7 comments7 min readLW link
(forum.effectivealtruism.org)

Sur­vey: What (de)mo­ti­vates you about AI risk?

Daniel_FriedrichAug 3, 2022, 7:17 PM
1 point
0 comments1 min readLW link
(forms.gle)

Ex­ter­nal­ized rea­son­ing over­sight: a re­search di­rec­tion for lan­guage model alignment

tameraAug 3, 2022, 12:03 PM
135 points
23 comments6 min readLW link

Open & Wel­come Thread—Aug/​Sep 2022

ThomasAug 3, 2022, 10:22 AM
9 points
32 comments1 min readLW link

[Question] How does one rec­og­nize in­for­ma­tion and differ­en­ti­ate it from noise?

M. Y. ZuoAug 3, 2022, 3:57 AM
4 points
29 comments1 min readLW link

Law-Fol­low­ing AI 4: Don’t Rely on Vi­car­i­ous Liability

CullenAug 2, 2022, 11:26 PM
5 points
2 comments3 min readLW link

Two-year up­date on my per­sonal AI timelines

Ajeya CotraAug 2, 2022, 11:07 PM
293 points
60 comments16 min readLW link

What are the Red Flags for Neu­ral Net­work Suffer­ing? - Seeds of Science call for reviewers

rogersbaconAug 2, 2022, 10:37 PM
24 points
6 comments1 min readLW link

Againstness

CFAR!DuncanAug 2, 2022, 7:29 PM
50 points
8 comments9 min readLW link

(Sum­mary) Se­quence High­lights—Think­ing Bet­ter on Purpose

qazzquimbyAug 2, 2022, 5:45 PM
33 points
3 comments11 min readLW link

Progress links and tweets, 2022-08-02

jasoncrawfordAug 2, 2022, 5:03 PM
9 points
0 comments1 min readLW link
(rootsofprogress.org)

[Question] I want to donate some money (not much, just what I can af­ford) to AGI Align­ment re­search, to what­ever or­ga­ni­za­tion has the best chance of mak­ing sure that AGI goes well and doesn’t kill us all. What are my best op­tions, where can I make the most differ­ence per dol­lar?

lumenwritesAug 2, 2022, 12:08 PM
15 points
9 comments1 min readLW link

Think­ing with­out pri­ors?

Q HomeAug 2, 2022, 9:17 AM
7 points
0 comments9 min readLW link

[Question] Would quan­tum im­mor­tal­ity mean sub­jec­tive im­mor­tal­ity?

n0ahAug 2, 2022, 4:54 AM
2 points
10 comments1 min readLW link

Turbocharging

CFAR!DuncanAug 2, 2022, 12:01 AM
52 points
4 comments9 min readLW link