Subagents

Tag

Why Subagents?

johnswentworthAug 1, 2019, 10:17 PM

175 points

48 comments7 min readLW link 1 review

Multi-agent predictive minds and AI alignment

Jan_KulveitDec 12, 2018, 11:48 PM

63 points

18 comments10 min readLW link

Building up to an Internal Family Systems model

Kaj_SotalaJan 26, 2019, 12:25 PM

289 points

86 comments28 min readLW link 2 reviews

A non-mystical explanation of insight meditation and the three characteristics of existence: introduction and preamble

Kaj_SotalaMay 5, 2020, 7:09 PM

134 points

40 comments12 min readLW link

Mental Mountains

Scott AlexanderNov 27, 2019, 5:30 AM

159 points

14 comments15 min readLW link 1 review

(slatestarcodex.com)

Book Summary: Consciousness and the Brain

Kaj_SotalaJan 16, 2019, 2:43 PM

179 points

20 comments26 min readLW link 1 review

Forcing yourself to keep your identity small is self-harm

Gordon Seidoh WorleyApr 3, 2021, 2:03 PM

40 points

10 comments2 min readLW link

Resolving internal conflicts requires listening to what parts want

Richard_NgoMay 19, 2023, 12:04 AM

71 points

0 comments4 min readLW link

My current take on Internal Family Systems “parts”

Kaj_SotalaJun 26, 2022, 5:40 PM

97 points

11 comments3 min readLW link

(kajsotala.fi)

The hostile telepaths problem

ValentineOct 27, 2024, 3:26 PM

383 points

89 comments15 min readLW link

Quick thoughts on the implications of multi-agent views of mind on AI takeover

Kaj_SotalaDec 11, 2023, 6:34 AM

47 points

14 comments4 min readLW link

Simulate and Defer To More Rational Selves

LoganStrohlSep 17, 2014, 6:11 PM

217 points

114 comments5 min readLW link

[Question] How effective are tulpas?

EvenflairMar 9, 2020, 5:35 PM

40 points

60 comments2 min readLW link

Subagents, trauma and rationality

Kaj_SotalaAug 14, 2019, 1:14 PM

113 points

4 comments19 min readLW link

Subagents, akrasia, and coherence in humans

Kaj_SotalaMar 25, 2019, 2:24 PM

141 points

31 comments16 min readLW link

Subagents, neural Turing machines, thought selection, and blindspots

Kaj_SotalaAug 6, 2019, 9:15 PM

87 points

3 comments12 min readLW link

Integrating disagreeing subagents

Kaj_SotalaMay 14, 2019, 2:06 PM

147 points

15 comments21 min readLW link

Subagents, introspective awareness, and blending

Kaj_SotalaMar 2, 2019, 12:53 PM

110 points

19 comments9 min readLW link

[Question] How to select a long-term goal and align my mind towards it?

AlexanderDec 24, 2021, 11:40 AM

19 points

8 comments2 min readLW link

Book summary: Unlocking the Emotional Brain

Kaj_SotalaOct 8, 2019, 7:11 PM

334 points

48 comments21 min readLW link 3 reviews

Complex Behavior from Simple (Sub)Agents

moridinamaelMay 10, 2019, 9:44 PM

113 points

14 comments9 min readLW link 1 review

Shoulder Advisors 101

Duncan Sabien (Inactive)Oct 9, 2021, 5:30 AM

203 points

124 comments14 min readLW link 2 reviews

Consistently Inconsistent

Kaj_SotalaAug 4, 2011, 10:33 PM

81 points

25 comments5 min readLW link

City of Lights

AlicornMar 31, 2010, 11:30 PM

55 points

43 comments4 min readLW link

On the construction of the self

Kaj_SotalaMay 29, 2020, 1:04 PM

78 points

18 comments17 min readLW link

Two Explorations

alkjashDec 16, 2020, 9:27 PM

63 points

8 comments9 min readLW link

(radimentary.wordpress.com)

Remarks 1–18 on GPT (compressed)

Cleo NardoMar 20, 2023, 10:27 PM

145 points

35 comments31 min readLW link

Internalizing Internal Double Crux

TurnTroutApr 30, 2018, 6:23 PM

36 points

12 comments4 min readLW link

Announcing the Alignment of Complex Systems Research Group

Jan_Kulveit and technicalities

Jun 4, 2022, 4:10 AM

91 points

20 comments5 min readLW link

Neural Basis for Global Workspace Theory

HazardJun 22, 2020, 4:19 AM

31 points

9 comments8 min readLW link

A mechanistic model of meditation

Kaj_SotalaNov 6, 2019, 9:37 PM

136 points

12 comments21 min readLW link

Intrapersonal negotiation

datadataeverywhereJan 23, 2011, 11:02 PM

34 points

42 comments4 min readLW link

Reward Is Not Enough

Steven ByrnesJun 16, 2021, 1:52 PM

124 points

19 comments10 min readLW link 1 review

Wildfire of strategicness

TsviBTJun 5, 2023, 1:59 PM

38 points

19 comments1 min readLW link

Embedded Agency via Abstraction

johnswentworthAug 26, 2019, 11:03 PM

42 points

20 comments11 min readLW link

The Game of Masks

SlimepriestessApr 27, 2022, 6:03 PM

50 points

18 comments11 min readLW link

(hivewired.wordpress.com)

What Value Subagents?

Gordon Seidoh WorleyJul 20, 2017, 7:19 PM

7 points

1 comment4 min readLW link

(mapandterritory.org)

Mental subagent implications for AI Safety

moridinamaelJan 3, 2021, 6:59 PM

11 points

0 comments3 min readLW link

A Master-Slave Model of Human Preferences

Wei DaiDec 29, 2009, 1:02 AM

101 points

94 comments3 min readLW link

Sequence introduction: non-agent and multiagent models of mind

Kaj_SotalaJan 7, 2019, 2:12 PM

125 points

16 comments7 min readLW link 1 review

Game Theory without Argmax [Part 2]

Cleo NardoNov 11, 2023, 4:02 PM

31 points

14 comments13 min readLW link

Goodhart’s Law inside the human mind

Kaj_SotalaApr 17, 2023, 1:48 PM

125 points

13 comments16 min readLW link

Indecision and internalized authority figures

Kaj_SotalaJul 6, 2024, 10:10 AM

69 points

1 comment2 min readLW link

(kajsotala.fi)

Game Theory without Argmax [Part 1]

Cleo NardoNov 11, 2023, 3:59 PM

70 points

18 comments19 min readLW link

Self-empathy as a source of “willpower”

AcademianOct 26, 2010, 2:20 PM

83 points

32 comments2 min readLW link

Many therapy schools work with inner multiplicity (not just IFS)

David Althaus and Ewelina Tur

Sep 17, 2022, 10:27 AM

52 points

16 comments18 min readLW link

The horror of what must, yet cannot, be true

Kaj_SotalaJun 2, 2022, 10:20 AM

52 points

18 comments2 min readLW link

(kajsotala.fi)

A non-mystical explanation of “no-self” (three characteristics series)

Kaj_SotalaMay 8, 2020, 10:37 AM

120 points

65 comments20 min readLW link 1 review

Conditions under which misaligned subagents can (not) arise in classifiers

anon1Jul 11, 2018, 1:52 AM

12 points

2 comments2 min readLW link

System 2 as working-memory augmented System 1 reasoning

Kaj_SotalaSep 25, 2019, 8:39 AM

110 points

23 comments16 min readLW link

Seven Shiny Stories

AlicornJun 1, 2010, 12:43 AM

145 points

34 comments7 min readLW link

Eight Definitions of Observability

Scott GarrabrantNov 10, 2020, 11:37 PM

34 points

26 comments12 min readLW link

Tentatively considering emotional stories (IFS and “getting into Self”)

Kaj_SotalaNov 30, 2018, 7:40 AM

40 points

31 comments4 min readLW link

(kajsotala.fi)

Slack matters more than any outcome

ValentineDec 31, 2022, 8:11 PM

164 points

56 comments19 min readLW link 1 review

One: a story

Richard_NgoOct 10, 2023, 12:18 AM

30 points

0 comments4 min readLW link

(www.narrativeark.xyz)

Robust Agency for People and Organizations

RaemonJul 19, 2019, 1:18 AM

65 points

10 comments12 min readLW link

Conflicts Between Mental Subagents: Expanding Wei Dai’s Master-Slave Model

Scott AlexanderAug 4, 2010, 9:16 AM

71 points

81 comments10 min readLW link

Hierarchical Agency: A Missing Piece in AI Alignment

Jan_KulveitNov 27, 2024, 5:49 AM

112 points

21 comments11 min readLW link

Why Productivity Systems Don’t Stick

Matt GoldenbergJan 16, 2021, 5:45 PM

62 points

22 comments3 min readLW link

Embedded Agency (full-text version)

Scott Garrabrant and abramdemski

Nov 15, 2018, 7:49 PM

209 points

17 comments54 min readLW link

Shard Theory: An Overview

David UdellAug 11, 2022, 5:44 AM

167 points

34 comments10 min readLW link

Three characteristics: impermanence

Kaj_SotalaJun 5, 2020, 7:48 AM

73 points

4 comments18 min readLW link

Craving, suffering, and predictive processing (three characteristics series)

Kaj_SotalaMay 15, 2020, 1:21 PM

96 points

56 comments19 min readLW link

Internal communication framework

rosehadshar and Nora_Ammann

Nov 15, 2022, 12:41 PM

38 points

13 comments12 min readLW link

Resolving von Neumann-Morgenstern Inconsistent Preferences

niplavOct 22, 2024, 11:45 AM

38 points

5 comments58 min readLW link

Strategic ignorance and plausible deniability

Kaj_SotalaAug 10, 2011, 9:30 AM

62 points

59 comments4 min readLW link

Ayn Rand’s model of “living money”; and an upside of burnout

AnnaSalamonNov 16, 2024, 2:59 AM

231 points

59 comments5 min readLW link

[Question] Anyone been through IFS or coherence therapy?

warrenjordanMar 15, 2021, 6:35 PM

5 points

3 comments1 min readLW link

Non-Coercive Perfectionism

Matt GoldenbergJan 26, 2021, 4:53 PM

25 points

25 comments3 min readLW link

Synthesis of subagents: exercise

Julija KobrinovichSep 20, 2019, 5:24 PM

10 points

2 comments14 min readLW link

On Internal Family Systems and multi-agent minds: a reply to PJ Eby

Kaj_SotalaOct 29, 2019, 2:56 PM

41 points

31 comments25 min readLW link

Actually updating

SaraHaxAug 23, 2019, 5:46 PM

56 points

10 comments4 min readLW link

The self-unalignment problem

Jan_Kulveit and rosehadshar

Apr 14, 2023, 12:10 PM

155 points

24 comments10 min readLW link

Two Coordination Styles

abramdemskiFeb 7, 2018, 9:00 AM

41 points

14 comments7 min readLW link

From self to craving (three characteristics series)

Kaj_SotalaMay 22, 2020, 12:16 PM

63 points

21 comments11 min readLW link

A Framework for Internal Debugging

Matt GoldenbergJan 16, 2019, 4:04 PM

44 points

3 comments5 min readLW link

Should rationalists be spiritual / Spirituality as overcoming delusion

Kaj_Sotala and romeostevensit

Mar 25, 2024, 4:48 PM

49 points

57 comments29 min readLW link

Committing, Assuming, Externalizing, and Internalizing

Scott GarrabrantNov 9, 2020, 4:59 PM

31 points

25 comments10 min readLW link

Subagents of Cartesian Frames

Scott GarrabrantNov 2, 2020, 10:02 PM

53 points

6 comments8 min readLW link

Additive and Multiplicative Subagents

Scott GarrabrantNov 6, 2020, 2:26 PM

20 points

7 comments12 min readLW link

Silence

alkjashMar 18, 2018, 4:10 AM

61 points

17 comments4 min readLW link

(radimentary.wordpress.com)

A Clearer Thinking tool that teaches you to use Internal Family Systems concepts

spencergApr 28, 2023, 1:42 PM

31 points

1 comment1 min readLW link

(programs.clearerthinking.org)

Integrating Three Models of (Human) Cognition

jbkjrNov 23, 2021, 1:06 AM

40 points

4 comments32 min readLW link

The Solitaire Principle: Game Theory for One

alkjashJan 17, 2018, 12:14 AM

25 points

8 comments9 min readLW link

(radimentary.wordpress.com)

Beware Social Coping Strategies

LulieFeb 5, 2018, 4:48 AM

57 points

24 comments7 min readLW link

Make an appointment with your saner self

MalcolmOceanFeb 8, 2019, 5:05 AM

28 points

0 comments4 min readLW link

Which Parts Are “Me”?

Eliezer YudkowskyOct 22, 2008, 6:15 PM

69 points

117 comments5 min readLW link

Prosaic misalignment from the Solomonoff Predictor

Cleo NardoDec 9, 2022, 5:53 PM

42 points

3 comments5 min readLW link

TDT for Humans

alkjashFeb 28, 2018, 5:40 AM

26 points

7 comments5 min readLW link

(radimentary.wordpress.com)

Reflection of Hierarchical Relationship via Nuanced Conditioning of Game Theory Approach for AI Development and Utilization

Kyoung-cheol KimJun 4, 2021, 7:20 AM

2 points

2 comments7 min readLW link

Selection processes for subagents

Ryan KiddJun 30, 2022, 11:57 PM

36 points

2 comments9 min readLW link

A Cautionary Note on Unlocking the Emotional Brain

eapacheFeb 8, 2020, 5:21 PM

55 points

20 comments2 min readLW link

Restricted Antinatalism on Subagents

JosephineMay 13, 2021, 1:48 AM

3 points

1 comment2 min readLW link

Alien parasite technical guy

PhilGoetzJul 27, 2010, 4:51 PM

69 points

55 comments3 min readLW link

Self and No-Self

VaniverDec 29, 2019, 6:15 AM

48 points

3 comments2 min readLW link

Prune

alkjashJan 12, 2018, 10:50 PM

76 points

11 comments4 min readLW link

(radimentary.wordpress.com)

Species as Canonical Referents of Super-Organisms

Yudhister KumarOct 18, 2024, 7:49 AM

15 points

8 comments2 min readLW link

(www.yudhister.me)

No comments.

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer