Play­ing Without Affordances

Alex HollowAug 18, 2022, 11:53 AM
11 points
0 comments1 min readLW link
(alexhollow.wordpress.com)

Goal-di­rect­ed­ness: rel­a­tivis­ing complexity

Morgan_RogersAug 18, 2022, 9:48 AM
3 points
0 comments11 min readLW link

What’s up with the bad Meta pro­jects?

YitzAug 18, 2022, 5:34 AM
42 points
29 comments1 min readLW link

An­nounc­ing En­cul­tured AI: Build­ing a Video Game

Aug 18, 2022, 2:16 AM
103 points
26 comments4 min readLW link

Detroit ACX Septem­ber Meetup

MattArnoldAug 18, 2022, 12:48 AM
1 point
0 comments1 min readLW link

Matt Ygle­sias on AI Policy

Grant DemareeAug 17, 2022, 11:57 PM
25 points
1 comment1 min readLW link
(www.slowboring.com)

Spoons and My­ofas­cial Trig­ger Points

vitaliyaAug 17, 2022, 10:54 PM
5 points
3 comments1 min readLW link

Con­crete Ad­vice for Form­ing In­side Views on AI Safety

Neel NandaAug 17, 2022, 10:02 PM
30 points
6 comments10 min readLW link

Progress links and tweets, 2022-08-17

jasoncrawfordAug 17, 2022, 9:27 PM
11 points
0 comments2 min readLW link
(rootsofprogress.org)

Con­di­tion­ing, Prompts, and Fine-Tuning

Adam JermynAug 17, 2022, 8:52 PM
38 points
9 comments4 min readLW link

The Core of the Align­ment Prob­lem is...

Aug 17, 2022, 8:07 PM
76 points
10 comments9 min readLW link

[Question] Could the simu­la­tion ar­gu­ment also ap­ply to dreams?

Nathan1123Aug 17, 2022, 7:55 PM
6 points
4 comments3 min readLW link

In­ter­pretabil­ity Tools Are an At­tack Channel

Thane RuthenisAug 17, 2022, 6:47 PM
42 points
14 comments1 min readLW link

Hu­man Mimicry Mainly Works When We’re Already Close

johnswentworthAug 17, 2022, 6:41 PM
82 points
16 comments5 min readLW link

Thoughts on ‘List of Lethal­ities’

Alex Lawsen Aug 17, 2022, 6:33 PM
27 points
0 comments10 min readLW link

The longest train­ing run

Aug 17, 2022, 5:18 PM
71 points
12 comments9 min readLW link
(epochai.org)

Spoiler-Free Re­view: Across the Obelisk

ZviAug 17, 2022, 2:30 PM
17 points
0 comments6 min readLW link
(thezvi.wordpress.com)

Au­ton­omy as tak­ing re­spon­si­bil­ity for refer­ence maintenance

Ramana KumarAug 17, 2022, 12:50 PM
61 points
3 comments5 min readLW link

Du­pli­cat­ing Ras­berry Pi Images

jefftkAug 17, 2022, 12:10 PM
9 points
4 comments4 min readLW link
(www.jefftk.com)

ACX Meetup—Amsterdam

Pierre VandenbergheAug 17, 2022, 9:56 AM
2 points
1 comment1 min readLW link

In­suffi­cient aware­ness of how ev­ery­thing sucks

FlaglandbaseAug 17, 2022, 8:01 AM
−13 points
5 comments1 min readLW link

Mesa-op­ti­miza­tion for goals defined only within a train­ing en­vi­ron­ment is dangerous

Rubi J. HudsonAug 17, 2022, 3:56 AM
6 points
2 comments4 min readLW link

ACX /​ SSC Meetup Singapore

DGAug 17, 2022, 2:08 AM
2 points
1 comment1 min readLW link

That-time-of-year As­tral Codex Ten Meetup

Ben SmithAug 17, 2022, 12:02 AM
3 points
2 comments1 min readLW link

SSC Reno Meetup

StevenAug 16, 2022, 11:37 PM
1 point
3 comments1 min readLW link

My thoughts on di­rect work (and join­ing LessWrong)

RobertMAug 16, 2022, 6:53 PM
58 points
4 comments6 min readLW link

We can make the fu­ture a mil­lion years from now go bet­ter [video]

WriterAug 16, 2022, 1:03 PM
7 points
1 comment6 min readLW link
(youtu.be)

The Open So­ciety and Its Ene­mies: Sum­mary and Thoughts

mattoAug 16, 2022, 11:44 AM
12 points
4 comments17 min readLW link

An in­tro­duc­tion to sig­nal­ling theory

MvolzAug 16, 2022, 9:37 AM
17 points
1 comment5 min readLW link

Un­der­stand­ing differ­ences be­tween hu­mans and in­tel­li­gence-in-gen­eral to build safe AGI

Florian_DietzAug 16, 2022, 8:27 AM
7 points
8 comments1 min readLW link

Against pop­u­la­tion ethics

jasoncrawfordAug 16, 2022, 5:19 AM
29 points
39 comments3 min readLW link

De­cep­tion as the op­ti­mal: mesa-op­ti­miz­ers and in­ner al­ign­ment

Eleni AngelouAug 16, 2022, 4:49 AM
11 points
0 comments5 min readLW link

Crowd­sourc­ing Anki Decks

ArdenAug 16, 2022, 2:53 AM
1 point
0 comments1 min readLW link

What Makes an Idea Un­der­stand­able? On Ar­chi­tec­turally and Cul­turally Nat­u­ral Ideas.

Aug 16, 2022, 2:09 AM
21 points
2 comments16 min readLW link

Dwarves & D.Sci: Data Fortress Eval­u­a­tion & Ruleset

aphyerAug 16, 2022, 12:15 AM
26 points
10 comments8 min readLW link

I’m mildly skep­ti­cal that blind­ness pre­vents schizophrenia

Steven ByrnesAug 15, 2022, 11:36 PM
83 points
9 comments4 min readLW link

What’s Gen­eral-Pur­pose Search, And Why Might We Ex­pect To See It In Trained ML Sys­tems?

johnswentworthAug 15, 2022, 10:48 PM
156 points
18 comments10 min readLW link

“What Mis­takes Are You Mak­ing Right Now?”

David UdellAug 15, 2022, 9:19 PM
13 points
2 comments1 min readLW link

On Prefer­ence Ma­nipu­la­tion in Re­ward Learn­ing Processes

Felix HofstätterAug 15, 2022, 7:32 PM
8 points
0 comments4 min readLW link

Cam­bist Book­ing: Dis­cussing What We Value

ScrewtapeAug 15, 2022, 6:24 PM
5 points
1 comment1 min readLW link

Cap­i­tal and in­equal­ity

NathanBarnardAug 15, 2022, 5:23 PM
7 points
2 comments5 min readLW link

[Question] Are there prac­ti­cal ex­er­cises for de­vel­op­ing the Scout mind­set?

ChristianKlAug 15, 2022, 5:23 PM
15 points
2 comments1 min readLW link

[Question] How do you get a job as a soft­ware de­vel­oper?

lsusrAug 15, 2022, 2:45 PM
22 points
24 comments1 min readLW link

The Parable of the Boy Who Cried 5% Chance of Wolf

KatWoodsAug 15, 2022, 2:33 PM
140 points
24 comments2 min readLW link

And the Rev­enues Are So Small

ZviAug 15, 2022, 1:00 PM
19 points
5 comments11 min readLW link
(thezvi.wordpress.com)

Ex­treme Security

lcAug 15, 2022, 12:11 PM
38 points
6 comments5 min readLW link

No short­cuts to knowl­edge: Why AI needs to ease up on scal­ing and learn how to code

YldedlyAug 15, 2022, 8:42 AM
5 points
0 comments1 min readLW link
(deoxyribose.github.io)

Seek­ing In­terns/​RAs for Mechanis­tic In­ter­pretabil­ity Projects

Neel NandaAug 15, 2022, 7:11 AM
61 points
0 comments2 min readLW link

A Mechanis­tic In­ter­pretabil­ity Anal­y­sis of Grokking

Aug 15, 2022, 2:41 AM
373 points
48 comments36 min readLW link1 review
(colab.research.google.com)

[Question] If a nuke is com­ing to­wards SF Bay can peo­ple bunker in BART tun­nels?

Pee DoomAug 15, 2022, 1:56 AM
15 points
2 comments1 min readLW link