RSS

Steven Byrnes

Karma: 22,103

I’m an AGI safety /​ AI alignment researcher in Boston with a particular focus on brain algorithms. Research Fellow at Astera. See https://​​sjbyrnes.com/​​agi.html for a summary of my research and sorted list of writing. Physicist by training. Email: steven.byrnes@gmail.com. Leave me anonymous feedback here. I’m also at: RSS feed, X/​Twitter, Bluesky, LinkedIn, and more at my website.

In­cen­tive Learn­ing vs Dead Sea Salt Experiment

Steven ByrnesJun 25, 2024, 5:49 PM
30 points
1 comment28 min readLW link

(Ap­pet­i­tive, Con­sum­ma­tory) ≈ (RL, re­flex)

Steven ByrnesJun 15, 2024, 3:57 PM
38 points
1 comment3 min readLW link

[Valence se­ries] 4. Valence & Lik­ing /​ Admiring

Steven ByrnesJun 10, 2024, 2:19 PM
48 points
12 comments14 min readLW link

Re­sponse to nos­talge­braist: proudly wav­ing my moral-an­tire­al­ist bat­tle flag

Steven ByrnesMay 29, 2024, 4:48 PM
103 points
29 comments11 min readLW link

Spa­tial at­ten­tion as a “tell” for em­pa­thetic simu­la­tion?

Steven ByrnesApr 26, 2024, 3:10 PM
55 points
12 comments8 min readLW link

A cou­ple pro­duc­tivity tips for overthinkers

Steven ByrnesApr 20, 2024, 4:05 PM
78 points
13 comments4 min readLW link

“Ar­tifi­cial Gen­eral In­tel­li­gence”: an ex­tremely brief FAQ

Steven ByrnesMar 11, 2024, 5:49 PM
74 points
6 comments2 min readLW link

Some (prob­le­matic) aes­thet­ics of what con­sti­tutes good work in academia

Steven ByrnesMar 11, 2024, 5:47 PM
148 points
12 comments12 min readLW link

Woods’ new preprint on ob­ject permanence

Steven ByrnesMar 7, 2024, 9:29 PM
58 points
1 comment6 min readLW link

So­cial sta­tus part 2/​2: ev­ery­thing else

Steven ByrnesMar 5, 2024, 4:29 PM
65 points
2 comments23 min readLW link

So­cial sta­tus part 1/​2: ne­go­ti­a­tions over ob­ject-level preferences

Steven ByrnesMar 5, 2024, 4:29 PM
118 points
15 comments21 min readLW link

Four vi­sions of Trans­for­ma­tive AI success

Steven ByrnesJan 17, 2024, 8:45 PM
112 points
22 comments15 min readLW link

De­cep­tive AI ≠ De­cep­tively-al­igned AI

Steven ByrnesJan 7, 2024, 4:55 PM
96 points
19 comments6 min readLW link

[Valence se­ries] Ap­pendix A: He­donic tone /​ (dis)plea­sure /​ (dis)liking

Steven ByrnesDec 20, 2023, 3:54 PM
18 points
0 comments13 min readLW link

[Valence se­ries] 5. “Valence Di­sor­ders” in Men­tal Health & Personality

Steven ByrnesDec 18, 2023, 3:26 PM
45 points
12 comments13 min readLW link

[Valence se­ries] 4. Valence & So­cial Sta­tus (de­p­re­cated)

Steven ByrnesDec 15, 2023, 2:24 PM
35 points
19 comments11 min readLW link

[Valence se­ries] 3. Valence & Beliefs

Steven ByrnesDec 11, 2023, 8:21 PM
77 points
12 comments21 min readLW link1 review

[Valence se­ries] 2. Valence & Normativity

Steven ByrnesDec 7, 2023, 4:43 PM
88 points
7 comments28 min readLW link1 review

[Valence se­ries] 1. Introduction

Steven ByrnesDec 4, 2023, 3:40 PM
99 points
16 comments16 min readLW link2 reviews

Thoughts on “AI is easy to con­trol” by Pope & Belrose

Steven ByrnesDec 1, 2023, 5:30 PM
197 points
63 comments14 min readLW link1 review