RSS

Steven Byrnes

Karma: 22,277

I’m an AGI safety /​ AI alignment researcher in Boston with a particular focus on brain algorithms. Research Fellow at Astera. See https://​​sjbyrnes.com/​​agi.html for a summary of my research and sorted list of writing. Physicist by training. Email: steven.byrnes@gmail.com. Leave me anonymous feedback here. I’m also at: RSS feed, X/​Twitter, Bluesky, Substack, LinkedIn, and more at my website.

Re­sponse to Dileep Ge­orge: AGI safety war­rants plan­ning ahead

Steven Byrnes8 Jul 2024 15:27 UTC
27 points
7 comments27 min readLW link

In­cen­tive Learn­ing vs Dead Sea Salt Experiment

Steven Byrnes25 Jun 2024 17:49 UTC
30 points
1 comment28 min readLW link

(Ap­pet­i­tive, Con­sum­ma­tory) ≈ (RL, re­flex)

Steven Byrnes15 Jun 2024 15:57 UTC
38 points
1 comment3 min readLW link

[Valence se­ries] 4. Valence & Lik­ing /​ Admiring

Steven Byrnes10 Jun 2024 14:19 UTC
48 points
12 comments15 min readLW link

Re­sponse to nos­talge­braist: proudly wav­ing my moral-an­tire­al­ist bat­tle flag

Steven Byrnes29 May 2024 16:48 UTC
103 points
29 comments11 min readLW link

Spa­tial at­ten­tion as a “tell” for em­pa­thetic simu­la­tion?

Steven Byrnes26 Apr 2024 15:10 UTC
55 points
12 comments8 min readLW link

A cou­ple pro­duc­tivity tips for overthinkers

Steven Byrnes20 Apr 2024 16:05 UTC
79 points
13 comments4 min readLW link

“Ar­tifi­cial Gen­eral In­tel­li­gence”: an ex­tremely brief FAQ

Steven Byrnes11 Mar 2024 17:49 UTC
74 points
6 comments2 min readLW link

Some (prob­le­matic) aes­thet­ics of what con­sti­tutes good work in academia

Steven Byrnes11 Mar 2024 17:47 UTC
148 points
12 comments12 min readLW link

Woods’ new preprint on ob­ject permanence

Steven Byrnes7 Mar 2024 21:29 UTC
58 points
1 comment6 min readLW link

So­cial sta­tus part 2/​2: ev­ery­thing else

Steven Byrnes5 Mar 2024 16:29 UTC
65 points
2 comments23 min readLW link

So­cial sta­tus part 1/​2: ne­go­ti­a­tions over ob­ject-level preferences

Steven Byrnes5 Mar 2024 16:29 UTC
118 points
15 comments21 min readLW link

Four vi­sions of Trans­for­ma­tive AI success

Steven Byrnes17 Jan 2024 20:45 UTC
112 points
22 comments15 min readLW link

De­cep­tive AI ≠ De­cep­tively-al­igned AI

Steven Byrnes7 Jan 2024 16:55 UTC
96 points
19 comments6 min readLW link

[Valence se­ries] Ap­pendix A: He­donic tone /​ (dis)plea­sure /​ (dis)liking

Steven Byrnes20 Dec 2023 15:54 UTC
18 points
0 comments13 min readLW link

[Valence se­ries] 5. “Valence Di­sor­ders” in Men­tal Health & Personality

Steven Byrnes18 Dec 2023 15:26 UTC
45 points
13 comments13 min readLW link

[Valence se­ries] 4. Valence & So­cial Sta­tus (de­p­re­cated)

Steven Byrnes15 Dec 2023 14:24 UTC
35 points
19 comments11 min readLW link

[Valence se­ries] 3. Valence & Beliefs

Steven Byrnes11 Dec 2023 20:21 UTC
77 points
12 comments21 min readLW link1 review

[Valence se­ries] 2. Valence & Normativity

Steven Byrnes7 Dec 2023 16:43 UTC
88 points
7 comments28 min readLW link1 review

[Valence se­ries] 1. Introduction

Steven Byrnes4 Dec 2023 15:40 UTC
99 points
16 comments16 min readLW link2 reviews