Steven Byrnes

Karma: 21,639

I’m an AGI safety / AI alignment researcher in Boston with a particular focus on brain algorithms. Research Fellow at Astera. See https://sjbyrnes.com/agi.html for a summary of my research and sorted list of writing. Physicist by training. Email: steven.byrnes@gmail.com. Leave me anonymous feedback here. I’m also at: RSS feed, X/Twitter, Bluesky, LinkedIn, and more at my website.

Self-dialogue: Do behaviorist rewards make scheming AGIs?

Steven ByrnesFeb 13, 2025, 6:39 PM

43 points

0 comments46 min readLW link

“Sharp Left Turn” discourse: An opinionated review

Steven ByrnesJan 28, 2025, 6:47 PM

205 points

26 comments31 min readLW link

Heritability: Five Battles

Steven ByrnesJan 14, 2025, 6:21 PM

79 points

21 comments60 min readLW link

Applying traditional economic thinking to AGI: a trilemma

Steven ByrnesJan 13, 2025, 1:23 AM

144 points

32 comments3 min readLW link

My AGI safety research—2024 review, ’25 plans

Steven ByrnesDec 31, 2024, 9:05 PM

109 points

4 comments8 min readLW link

A shortcoming of concrete demonstrations as AGI risk advocacy

Steven ByrnesDec 11, 2024, 4:48 PM

103 points

27 comments2 min readLW link

Neuroscience of human social instincts: a sketch

Steven ByrnesNov 22, 2024, 4:16 PM

69 points

0 comments31 min readLW link

[Intuitive self-models] 8. Rooting Out Free Will Intuitions

Steven ByrnesNov 4, 2024, 6:16 PM

70 points

16 comments24 min readLW link

[Intuitive self-models] 7. Hearing Voices, and Other Hallucinations

Steven ByrnesOct 29, 2024, 1:36 PM

51 points

2 comments16 min readLW link

[Intuitive self-models] 6. Awakening / Enlightenment / PNSE

Steven ByrnesOct 22, 2024, 1:23 PM

63 points

8 comments21 min readLW link

Against empathy-by-default

Steven ByrnesOct 16, 2024, 4:38 PM

60 points

24 comments7 min readLW link

[Intuitive self-models] 5. Dissociative Identity (Multiple Personality) Disorder

Steven ByrnesOct 15, 2024, 1:31 PM

59 points

7 comments11 min readLW link

[Intuitive self-models] 4. Trance

Steven ByrnesOct 8, 2024, 1:30 PM

82 points

7 comments24 min readLW link

[Intuitive self-models] 3. The Homunculus

Steven ByrnesOct 2, 2024, 3:20 PM

78 points

38 comments25 min readLW link

[Intuitive self-models] 2. Conscious Awareness

Steven ByrnesSep 25, 2024, 1:29 PM

82 points

60 comments16 min readLW link

[Intuitive self-models] 1. Preliminaries

Steven ByrnesSep 19, 2024, 1:45 PM

91 points

23 comments15 min readLW link

Response to Dileep George: AGI safety warrants planning ahead

Steven ByrnesJul 8, 2024, 3:27 PM

27 points

7 comments27 min readLW link

Incentive Learning vs Dead Sea Salt Experiment

Steven ByrnesJun 25, 2024, 5:49 PM

30 points

1 comment28 min readLW link

(Appetitive, Consummatory) ≈ (RL, reflex)

Steven ByrnesJun 15, 2024, 3:57 PM

38 points

1 comment3 min readLW link

[Valence series] 4. Valence & Liking / Admiring

Steven ByrnesJun 10, 2024, 2:19 PM

48 points

12 comments14 min readLW link