RSS

Boundaries /​ Mem­branes [tech­ni­cal]

TagLast edit: Jul 31, 2024, 5:03 AM by Chipmonk

Explanation

Andrew Critch, March 2023:

By boundaries, I just mean the approximate causal separation of regions in some kind of physical space (e.g., spacetime) or abstract space (e.g., cyberspace). Here are some examples from my «Boundaries» Sequence:

  • a cell membrane (separates the inside of a cell from the outside);

  • a person’s skin (separates the inside of their body from the outside);

  • a fence around a family’s yard (separates the family’s place of living-together from neighbors and others);

  • a digital firewall around a local area network (separates the LAN and its users from the rest of the internet);

  • a sustained disassociation of social groups (separates the two groups from each other)

  • a national border (separates a state from neighboring states or international waters).

Applications

Compilation

«Boundaries»/​membranes and AI safety compilation.

Beware a common misunderstanding

When I say boundary, I don’t just mean an arbitrary constraint or social norm. («Boundaries» Pt. 1)

It is for this reason that I (@Chipmonk) often use the term “membranes” instead of “boundaries”. It aids understanding.

Credits

Tag created and maintained by Chipmonk, membranes/​«boundaries» enthusiast. 2023 April.

«Boundaries», Part 1: a key miss­ing con­cept from util­ity theory

Andrew_CritchJul 26, 2022, 11:03 PM
158 points
33 comments7 min readLW link

«Boundaries», Part 3b: Align­ment prob­lems in terms of bound­aries

Andrew_CritchDec 14, 2022, 10:34 PM
72 points
7 comments13 min readLW link

Acausal normalcy

Andrew_CritchMar 3, 2023, 11:34 PM
195 points
36 comments8 min readLW link1 review

Agent Boundaries Aren’t Markov Blan­kets. [Un­less they’re non-causal; see com­ments.]

abramdemskiNov 20, 2023, 6:23 PM
82 points
11 comments2 min readLW link

«Boundaries/​Mem­branes» and AI safety compilation

ChipmonkMay 3, 2023, 9:41 PM
57 points
17 comments8 min readLW link

Agent mem­branes/​bound­aries and for­mal­iz­ing “safety”

ChipmonkJan 3, 2024, 5:55 PM
26 points
46 comments3 min readLW link

“Mem­branes” is bet­ter ter­minol­ogy than “bound­aries” alone

May 28, 2023, 10:16 PM
30 points
12 comments3 min readLW link

What does davi­dad want from «bound­aries»?

Feb 6, 2024, 5:45 PM
47 points
1 comment5 min readLW link

[Question] What tech­ni­cal top­ics could help with bound­aries/​mem­branes?

ChipmonkJan 5, 2024, 6:14 PM
15 points
25 comments1 min readLW link

Ret­ro­spec­tive on Math­e­mat­i­cal Boundaries Workshop

May 12, 2024, 9:58 PM
22 points
0 comments4 min readLW link
(formalizingboundaries.substack.com)

Ap­ply to the Con­cep­tual Boundaries Work­shop for AI Safety

ChipmonkNov 27, 2023, 9:04 PM
50 points
0 comments3 min readLW link

«Boundaries» for for­mal­iz­ing an MVP morality

ChipmonkMay 13, 2023, 7:10 PM
20 points
7 comments4 min readLW link

«Boundaries», Part 3a: Defin­ing bound­aries as di­rected Markov blankets

Andrew_CritchOct 30, 2022, 6:31 AM
90 points
20 comments15 min readLW link

«Boundaries», Part 2: trends in EA’s han­dling of boundaries

Andrew_CritchAug 6, 2022, 12:42 AM
81 points
15 comments7 min readLW link

«Boundaries» Se­quence (In­dex Post)

Andrew_CritchJul 26, 2022, 7:12 PM
25 points
1 comment1 min readLW link

Agent mem­branes and causal distance

ChipmonkJan 2, 2024, 10:43 PM
20 points
3 comments3 min readLW link

Boundaries vs Frames

Scott GarrabrantOct 31, 2022, 3:14 PM
58 points
10 comments7 min readLW link

Hier­ar­chi­cal Agency: A Miss­ing Piece in AI Alignment

Jan_KulveitNov 27, 2024, 5:49 AM
112 points
20 comments11 min readLW link

What is au­ton­omy? Why bound­aries are nec­es­sary.

ChipmonkOct 21, 2024, 5:56 PM
8 points
1 comment1 min readLW link
(chrislakin.blog)

An Open Agency Ar­chi­tec­ture for Safe Trans­for­ma­tive AI

davidadDec 20, 2022, 1:04 PM
80 points
22 comments4 min readLW link

The Hu­man-AI Reflec­tive Equilibrium

Allison DuettmannJan 24, 2023, 1:32 AM
22 points
1 comment24 min readLW link

Boundaries en­able pos­i­tive ma­te­rial-in­for­ma­tional feed­back loops

jessicataDec 22, 2018, 2:46 AM
36 points
26 comments5 min readLW link

Boundaries-based se­cu­rity and AI safety approaches

Allison DuettmannApr 12, 2023, 12:36 PM
43 points
2 comments6 min readLW link

Con­se­quen­tial­ists: One-Way Pat­tern Traps

David UdellJan 16, 2023, 8:48 PM
59 points
3 comments14 min readLW link

[AN #127]: Re­think­ing agency: Carte­sian frames as a for­mal­iza­tion of ways to carve up the world into an agent and its environment

Rohin ShahDec 2, 2020, 6:20 PM
53 points
0 comments13 min readLW link
(mailchi.mp)

Em­pow­er­ment is (al­most) All We Need

jacob_cannellOct 23, 2022, 9:48 PM
61 points
44 comments17 min readLW link

Embed­ded Agency (full-text ver­sion)

Nov 15, 2018, 7:49 PM
201 points
17 comments54 min readLW link

LOVE in a sim­box is all you need

jacob_cannellSep 28, 2022, 6:25 PM
66 points
73 comments44 min readLW link1 review

Agents Over Carte­sian World Models

Apr 27, 2021, 2:06 AM
67 points
4 comments27 min readLW link

An­nounc­ing the Align­ment of Com­plex Sys­tems Re­search Group

Jun 4, 2022, 4:10 AM
91 points
20 comments5 min readLW link

Re­spect for Boundaries as non-ar­bir­trary co­or­di­na­tion norms

Jonas HallgrenMay 9, 2023, 7:42 PM
9 points
3 comments7 min readLW link

Roadmap for a col­lab­o­ra­tive pro­to­type of an Open Agency Architecture

Deger TuranMay 10, 2023, 5:41 PM
31 points
0 comments12 min readLW link

Car­tog­ra­phy, blow­ing one’s mind, the illu­sion of sep­a­ra­tion and other gen­eral musings

Neil Jun 16, 2023, 7:19 PM
0 points
4 comments2 min readLW link

Are eth­i­cal asym­me­tries from prop­erty rights?

KatjaGraceJul 2, 2018, 3:00 AM
108 points
37 comments3 min readLW link
(meteuphoric.com)

Agency from a causal perspective

Jun 30, 2023, 5:37 PM
40 points
5 comments6 min readLW link

Desider­ata for an AI

Nathan Helm-BurgerJul 19, 2023, 4:18 PM
9 points
0 comments4 min readLW link

For­mal­iz­ing «Boundaries» with Markov blankets

ChipmonkSep 19, 2023, 9:01 PM
21 points
20 comments3 min readLW link

A list of core AI safety prob­lems and how I hope to solve them

davidadAug 26, 2023, 3:12 PM
165 points
29 comments5 min readLW link

Boundary Vio­la­tions vs Boundary Dissolution

ChipmonkFeb 26, 2024, 6:59 PM
8 points
4 comments1 min readLW link

Ideal­ized Agents Are Ap­prox­i­mate Causal Mir­rors (+ Rad­i­cal Op­ti­mism on Agent Foun­da­tions)

Thane RuthenisDec 22, 2023, 8:19 PM
74 points
14 comments6 min readLW link

In­cor­po­rat­ing Jus­tice The­ory into De­ci­sion Theory

StrivingForLegibilityJan 21, 2024, 7:17 PM
13 points
20 comments5 min readLW link

Pro­tect­ing agent boundaries

ChipmonkJan 25, 2024, 4:13 AM
11 points
6 comments2 min readLW link

How I turned do­ing ther­apy into ob­ject-level AI safety research

ChipmonkMar 14, 2024, 1:54 AM
15 points
5 comments4 min readLW link

Davi­dad’s Bold Plan for Align­ment: An In-Depth Explanation

Apr 19, 2023, 4:09 PM
168 points
40 comments21 min readLW link2 reviews

[Question] Define “Agent” (Embed­ded)

ApolloniaMar 24, 2024, 8:14 PM
10 points
1 comment1 min readLW link

Be­ing nicer than Clippy

Joe CarlsmithJan 16, 2024, 7:44 PM
109 points
32 comments27 min readLW link

On green

Joe CarlsmithMar 21, 2024, 5:38 PM
268 points
35 comments31 min readLW link

[Question] Plau­si­bil­ity of cy­bor­gism for pro­tect­ing bound­aries?

ChipmonkMar 27, 2024, 6:53 PM
10 points
6 comments1 min readLW link

Boundaries Up­date #1

ChipmonkApr 11, 2024, 4:07 PM
3 points
2 comments1 min readLW link
(formalizingboundaries.substack.com)

A nec­es­sary Mem­brane for­mal­ism feature

ThomasCederborgSep 10, 2024, 9:33 PM
20 points
6 comments11 min readLW link

En­cul­tured AI Pre-plan­ning, Part 1: En­abling New Benchmarks

Aug 8, 2022, 10:44 PM
63 points
2 comments6 min readLW link

Con­tent and Take­aways from SERI MATS Train­ing Pro­gram with John Wentworth

RohanSDec 24, 2022, 4:17 AM
28 points
3 comments12 min readLW link
No comments.