RSS

Steven Byrnes

Karma: 21,639

I’m an AGI safety /​ AI alignment researcher in Boston with a particular focus on brain algorithms. Research Fellow at Astera. See https://​​sjbyrnes.com/​​agi.html for a summary of my research and sorted list of writing. Physicist by training. Email: steven.byrnes@gmail.com. Leave me anonymous feedback here. I’m also at: RSS feed, X/​Twitter, Bluesky, LinkedIn, and more at my website.

Self-di­alogue: Do be­hav­iorist re­wards make schem­ing AGIs?

Steven ByrnesFeb 13, 2025, 6:39 PM
43 points
0 comments46 min readLW link

“Sharp Left Turn” dis­course: An opinionated review

Steven ByrnesJan 28, 2025, 6:47 PM
205 points
26 comments31 min readLW link

Her­i­ta­bil­ity: Five Battles

Steven ByrnesJan 14, 2025, 6:21 PM
79 points
21 comments60 min readLW link

Ap­ply­ing tra­di­tional eco­nomic think­ing to AGI: a trilemma

Steven ByrnesJan 13, 2025, 1:23 AM
144 points
32 comments3 min readLW link

My AGI safety re­search—2024 re­view, ’25 plans

Steven ByrnesDec 31, 2024, 9:05 PM
109 points
4 comments8 min readLW link

A short­com­ing of con­crete demon­stra­tions as AGI risk advocacy

Steven ByrnesDec 11, 2024, 4:48 PM
103 points
27 comments2 min readLW link

Neu­ro­science of hu­man so­cial in­stincts: a sketch

Steven ByrnesNov 22, 2024, 4:16 PM
69 points
0 comments31 min readLW link

[In­tu­itive self-mod­els] 8. Root­ing Out Free Will Intuitions

Steven ByrnesNov 4, 2024, 6:16 PM
70 points
16 comments24 min readLW link

[In­tu­itive self-mod­els] 7. Hear­ing Voices, and Other Hallucinations

Steven ByrnesOct 29, 2024, 1:36 PM
51 points
2 comments16 min readLW link

[In­tu­itive self-mod­els] 6. Awak­en­ing /​ En­light­en­ment /​ PNSE

Steven ByrnesOct 22, 2024, 1:23 PM
63 points
8 comments21 min readLW link

Against em­pa­thy-by-default

Steven ByrnesOct 16, 2024, 4:38 PM
60 points
24 comments7 min readLW link

[In­tu­itive self-mod­els] 5. Dis­so­ci­a­tive Iden­tity (Mul­ti­ple Per­son­al­ity) Disorder

Steven ByrnesOct 15, 2024, 1:31 PM
59 points
7 comments11 min readLW link

[In­tu­itive self-mod­els] 4. Trance

Steven ByrnesOct 8, 2024, 1:30 PM
82 points
7 comments24 min readLW link

[In­tu­itive self-mod­els] 3. The Homunculus

Steven ByrnesOct 2, 2024, 3:20 PM
78 points
38 comments25 min readLW link

[In­tu­itive self-mod­els] 2. Con­scious Awareness

Steven ByrnesSep 25, 2024, 1:29 PM
82 points
60 comments16 min readLW link

[In­tu­itive self-mod­els] 1. Preliminaries

Steven ByrnesSep 19, 2024, 1:45 PM
91 points
23 comments15 min readLW link

Re­sponse to Dileep Ge­orge: AGI safety war­rants plan­ning ahead

Steven ByrnesJul 8, 2024, 3:27 PM
27 points
7 comments27 min readLW link

In­cen­tive Learn­ing vs Dead Sea Salt Experiment

Steven ByrnesJun 25, 2024, 5:49 PM
30 points
1 comment28 min readLW link

(Ap­pet­i­tive, Con­sum­ma­tory) ≈ (RL, re­flex)

Steven ByrnesJun 15, 2024, 3:57 PM
38 points
1 comment3 min readLW link

[Valence se­ries] 4. Valence & Lik­ing /​ Admiring

Steven ByrnesJun 10, 2024, 2:19 PM
48 points
12 comments14 min readLW link