Align­ment be­ing im­pos­si­ble might be bet­ter than it be­ing re­ally difficult

Martín SotoJul 25, 2022, 11:57 PM
13 points
2 comments2 min readLW link

[Question] How op­ti­mistic should we be about AI figur­ing out how to in­ter­pret it­self?

oh54321Jul 25, 2022, 10:09 PM
3 points
1 comment1 min readLW link

Pro­tec­tion­ism in One Coun­try: How In­dus­trial Policy Worked in Canada

Davis KedroskyJul 25, 2022, 10:08 PM
5 points
0 comments16 min readLW link
(daviskedrosky.substack.com)

Mis­takes as agency

pchvykovJul 25, 2022, 4:17 PM
12 points
8 comments4 min readLW link

My Bit­coin Th­e­sis @2022 - Part 1

aysajanJul 25, 2022, 3:49 PM
7 points
6 comments13 min readLW link

The Reader’s Guide to Op­ti­mal Mone­tary Policy

Ege ErdilJul 25, 2022, 3:10 PM
57 points
10 comments14 min readLW link

AGI Safety Needs Peo­ple With All Skil­lsets!

Severin T. SeehrichJul 25, 2022, 1:32 PM
28 points
0 comments2 min readLW link

[Question] Is there any ev­i­dence that hand­wash­ing does any­thing to pre­vent COVID?

mukashiJul 25, 2022, 7:34 AM
4 points
3 comments1 min readLW link

Open­ing Ses­sion Tips & Advice

CFAR!DuncanJul 25, 2022, 3:57 AM
95 points
3 comments14 min readLW link1 review

How much should we worry about mesa-op­ti­miza­tion challenges?

sudoJul 25, 2022, 3:56 AM
4 points
13 comments2 min readLW link

[Question] Does agent foun­da­tions cover all fu­ture ML sys­tems?

Jonas HallgrenJul 25, 2022, 1:17 AM
4 points
0 comments1 min readLW link

Unify­ing Bar­gain­ing No­tions (1/​2)

DiffractorJul 25, 2022, 12:28 AM
210 points
41 comments16 min readLW link

Re­ward is not the op­ti­miza­tion target

TurnTroutJul 25, 2022, 12:03 AM
376 points
123 comments10 min readLW link3 reviews

Brain­storm of things that could force an AI team to burn their lead

So8resJul 24, 2022, 11:58 PM
134 points
8 comments13 min readLW link

Find­ing Skele­tons on Rashomon Ridge

Jul 24, 2022, 10:31 PM
30 points
2 comments7 min readLW link

Gather­ing In­for­ma­tion you won’t use di­rectly is of­ten useful

Johannes C. MayerJul 24, 2022, 9:21 PM
6 points
1 comment1 min readLW link

[Question] Im­pact of ” ‘Let’s think step by step’ is all you need”?

yrimonJul 24, 2022, 8:59 PM
20 points
2 comments1 min readLW link

The Most Im­por­tant Cen­tury: The Animation

Jul 24, 2022, 8:58 PM
46 points
2 comments20 min readLW link
(youtu.be)

Hiring Pro­gram­mers in Academia

jefftkJul 24, 2022, 8:20 PM
36 points
19 comments2 min readLW link
(www.jefftk.com)

Less Wrong Bu­dapest July 30th Meetup

Richard HorvathJul 24, 2022, 7:07 PM
2 points
0 comments1 min readLW link

Re­la­tion­ship be­tween sub­jec­tive ex­pe­rience and in­tel­li­gence?

Q HomeJul 24, 2022, 9:10 AM
5 points
4 comments9 min readLW link

Dou­ble Crux

CFAR!DuncanJul 24, 2022, 6:34 AM
61 points
9 comments11 min readLW link

Ex­am­ple Meetup Description

JuliusJul 24, 2022, 5:38 AM
6 points
0 comments2 min readLW link

Eaves­drop­ping on Aliens: A Data De­cod­ing Challenge

anonymousaisafetyJul 24, 2022, 4:35 AM
49 points
9 comments4 min readLW link

In­for­ma­tion the­o­retic model anal­y­sis may not lend much in­sight, but we may have been do­ing them wrong!

Garrett BakerJul 24, 2022, 12:42 AM
7 points
0 comments10 min readLW link

What’s next for in­stru­men­tal ra­tio­nal­ity?

Andrew_CritchJul 23, 2022, 10:55 PM
63 points
7 comments1 min readLW link

Easy guide for run­ning a lo­cal Ra­tion­al­ity meetup

nsokolskyJul 23, 2022, 10:52 PM
13 points
1 comment6 min readLW link

Cu­rat­ing “The Epistemic Se­quences” (list v.0.1)

Andrew_CritchJul 23, 2022, 10:17 PM
65 points
12 comments7 min readLW link

Room Opening

jefftkJul 23, 2022, 9:00 PM
8 points
3 comments4 min readLW link
(www.jefftk.com)

A Bias Against Altruism

Lone PineJul 23, 2022, 8:44 PM
58 points
30 comments2 min readLW link

What En­vi­ron­ment Prop­er­ties Select Agents For World-Model­ing?

Thane RuthenisJul 23, 2022, 7:27 PM
25 points
1 comment12 min readLW link

Which sin­gu­lar­ity schools plus the no sin­gu­lar­ity school was right?

Noosphere89Jul 23, 2022, 3:16 PM
9 points
26 comments9 min readLW link

Ba­sic Post Scarcity Q&A

lorepieriJul 23, 2022, 1:43 PM
3 points
0 comments1 min readLW link
(lorenzopieri.com)

Ro­bust­ness to Scal­ing Down: More Im­por­tant Than I Thought

adamShimiJul 23, 2022, 11:40 AM
38 points
5 comments3 min readLW link

Eat­ing Boogers

George3d6Jul 23, 2022, 11:20 AM
17 points
5 comments6 min readLW link
(www.epistem.ink)

On Akra­sia, Habits and Re­ward Maximization

AiyenJul 23, 2022, 8:34 AM
14 points
1 comment6 min readLW link

Which val­ues are sta­ble un­der on­tol­ogy shifts?

Richard_NgoJul 23, 2022, 2:40 AM
75 points
48 comments3 min readLW link
(thinkingcomplete.blogspot.com)

Try­ing out Prompt Eng­ineer­ing on TruthfulQA

Megan KinnimentJul 23, 2022, 2:04 AM
10 points
0 comments8 min readLW link

Con­nor Leahy on Dy­ing with Dig­nity, EleutherAI and Conjecture

Michaël TrazziJul 22, 2022, 6:44 PM
195 points
29 comments14 min readLW link
(theinsideview.ai)

Wy­clif’s Dust: the miss­ing chapter

David Hugh-JonesJul 22, 2022, 6:27 PM
9 points
0 comments4 min readLW link
(wyclif.substack.com)

Mak­ing DALL-E Count

DirectedEvolutionJul 22, 2022, 9:11 AM
23 points
12 comments4 min readLW link

One-day ap­plied ra­tio­nal­ity work­shop in Ber­lin Aug 29 (af­ter LWCW)

Duncan Sabien (Inactive)Jul 22, 2022, 7:58 AM
30 points
5 comments2 min readLW link

In­ter­nal Dou­ble Crux

CFAR!DuncanJul 22, 2022, 4:34 AM
93 points
15 comments12 min readLW link

Con­di­tion­ing Gen­er­a­tive Models with Restrictions

Adam JermynJul 21, 2022, 8:33 PM
18 points
4 comments8 min readLW link

Our Ex­ist­ing Solu­tions to AGI Align­ment (semi-safe)

Michael SoareverixJul 21, 2022, 7:00 PM
12 points
1 comment3 min readLW link

Chang­ing the world through slack & hobbies

Steven ByrnesJul 21, 2022, 6:11 PM
261 points
13 comments10 min readLW link

Which per­son­al­ities do we find in­tol­er­able?

weathersystemsJul 21, 2022, 3:56 PM
10 points
3 comments6 min readLW link

YouTubeTV and Spoilers

ZviJul 21, 2022, 1:50 PM
16 points
6 comments8 min readLW link
(thezvi.wordpress.com)

Covid 7/​21/​22: Fea­tur­ing ASPR

ZviJul 21, 2022, 1:50 PM
27 points
0 comments14 min readLW link
(thezvi.wordpress.com)

[Question] How much to op­ti­mize for the short-timelines sce­nario?

SoerenMindJul 21, 2022, 10:47 AM
20 points
3 comments1 min readLW link