I believe that all or most of the claims here are true, but I haven’t written all the proofs in detail, so take it with a grain of salt.
Ambidistributions are a mathematical object that simultaneously generalizes infradistributions and ultradistributions. It is useful to represent how much power an agent has over a particular system: which degrees of freedom it can control, which degrees of freedom obey a known probability distribution and which are completely unpredictable.
Definition 1: Let X be a compact Polish space. A (crisp) ambidistribution on X is a function Q:C(X)→R s.t.
(Monotonocity) For any f,g∈C(X), if f≤g then Q(f)≤Q(g).
(Homogeneity) For any f∈C(X) and λ≥0, Q(λf)=λQ(f).
(Constant-additivity) For any f∈C(X) and c∈R, Q(f+c)=Q(f)+c.
Conditions 1+3 imply that Q is 1-Lipschitz. We could introduce non-crisp ambidistributions by dropping conditions 2 and/or 3 (and e.g. requiring 1-Lipschitz instead), but we will stick to crisp ambidistributions in this post.
The space of all ambidistributions on X will be denoted ♡X.[1] Obviously, □X⊆♡X (where □X stands for (crisp) infradistributions), and likewise for ultradistributions.
Examples
Example 1: Consider compact Polish spaces X,Y,Z and a continuous mapping F:X×Y→Z. We can then define F♡∈♡Z by
F♡(u):=maxθ∈ΔXminη∈ΔYEθ×η[u∘F]
That is, F♡(u) is the value of the zero-sum two-player game with strategy spaces X and Y and utility function u∘F.
Notice that F in Example 1 can be regarded as a Cartesian frame: this seems like a natural connection to explore further.
Example 2: Let A and O be finite sets representing actions and observations respectively, and Λ:{O∗→A}→□(A×O)∗ be an infra-Bayesian law. Then, we can define Λ♡∈♡(A×O)∗ by
Λ♡(u):=maxπ:O∗→AEΛ(π)[u]
In fact, this is a faithful representation: Λ can be recovered from Λ♡.
Example 3: Consider an infra-MDP with finite state set S, initial state s0∈S and transition infrakernel T:S×A→□S. We can then define the “ambikernel” T♡:S→♡S by
T♡(s;u):=maxa∈AET(s,a)[u]
Thus, every infra-MDP induces an “ambichain”. Moreover:
Claim 1:♡ is a monad. In particular, ambikernels can be composed.
This allows us defining
ϕ(γ):=(1−γ)∞∑n=0γn(T♡)n(s0)
This object is the infra-Bayesian analogue of the convex polytope of accessible state occupancy measures in an MDP.
Claim 2: The following limit always exists:
ϕ∗:=limγ→1ϕ(γ)
Legendre-Fenchel Duality
Definition 3: Let D be a convex space and A1,A2…An,B⊆D. We say that Boccludes(A1…An) when for any (a1…an)∈A1×…×An, we have
CH(a1…an)∩B≠∅
Here, CH stands for convex hull.
We denote this relation A1…An⊢B. The reason we call this “occlusion” is apparent for the n=2 case.
Here are some properties of occlusion:
For any 1≤i≤n, A1…An⊢Ai.
More generally, if c∈Δ{1…n} then A1…An⊢∑iciAi.
If Φ⊢A and Φ⊆Ψ then Ψ⊢A.
If Φ⊢A and A⊆B then Φ⊢B.
If A1…An⊢B and A′i⊆Ai for all 1≤i≤n, then A′1…A′n⊢B.
If Φ⊢Ai for all 1≤i≤n, and also A1…An⊢B, then Φ⊢B.
Notice that occlusion has similar algebraic properties to logical entailment, if we think of A⊆B as ”B is a weaker proposition than A”.
Definition 4: Let X be a compact Polish space. A cramble set[2] over X is Φ⊆□X s.t.
Φ is non-empty.
Φ is topologically closed.
For any finite Φ0⊆Φ and Θ∈□X, if Φ0⊢Θ then Θ∈Φ. (Here, we interpret elements of □X as credal sets.)
Question: If instead of condition 3, we only consider binary occlusion (i.e. require |Φ0|≤2), do we get the same concept?
Given a cramble set Φ, its Legendre-Fenchel dual ambidistribution is
^Φ(f):=maxΘ∈ΦEΘ[f]
Claim 3: Legendre-Fenchel duality is a bijection between cramble sets and ambidistributions.
Lattice Structure
Functionals
The space ♡X is equipped with the obvious partial order: Q≤P when for all f∈C(X),Q(f)≤P(f). This makes ♡X into a distributive lattice, with
(P∧Q)(f)=min(P(f),Q(f))(P∨Q)(f)=max(P(f),Q(f))
This is in contrast to □X which is a non-distributive lattice.
The bottom and top elements are given by
⊥(f)=minx∈Xf(x)⊤(f)=maxx∈Xf(x)
Ambidistributions are closed under pointwise suprema and infima, and hence ♡X is complete and satisfies both infinite distributive laws, making it a complete Heyting and co-Heyting algebra.
♡X is also a De Morgan algebra with the involution
¯Q(f):=−Q(−f)
For X≠∅, ♡X is not a Boolean algebra: ΔX⊆♡X and for any θ∈ΔX we have ¯θ=θ.
One application of this partial order is formalizing the “no traps” condition for infra-MDP:
Definition 2: A finite infra-MDP is quasicommunicating when for any s∈S
Ambidistributions
I believe that all or most of the claims here are true, but I haven’t written all the proofs in detail, so take it with a grain of salt.
Ambidistributions are a mathematical object that simultaneously generalizes infradistributions and ultradistributions. It is useful to represent how much power an agent has over a particular system: which degrees of freedom it can control, which degrees of freedom obey a known probability distribution and which are completely unpredictable.
Definition 1: Let X be a compact Polish space. A (crisp) ambidistribution on X is a function Q:C(X)→R s.t.
(Monotonocity) For any f,g∈C(X), if f≤g then Q(f)≤Q(g).
(Homogeneity) For any f∈C(X) and λ≥0, Q(λf)=λQ(f).
(Constant-additivity) For any f∈C(X) and c∈R, Q(f+c)=Q(f)+c.
Conditions 1+3 imply that Q is 1-Lipschitz. We could introduce non-crisp ambidistributions by dropping conditions 2 and/or 3 (and e.g. requiring 1-Lipschitz instead), but we will stick to crisp ambidistributions in this post.
The space of all ambidistributions on X will be denoted ♡X.[1] Obviously, □X⊆♡X (where □X stands for (crisp) infradistributions), and likewise for ultradistributions.
Examples
Example 1: Consider compact Polish spaces X,Y,Z and a continuous mapping F:X×Y→Z. We can then define F♡∈♡Z by
F♡(u):=maxθ∈ΔXminη∈ΔYEθ×η[u∘F]That is, F♡(u) is the value of the zero-sum two-player game with strategy spaces X and Y and utility function u∘F.
Notice that F in Example 1 can be regarded as a Cartesian frame: this seems like a natural connection to explore further.
Example 2: Let A and O be finite sets representing actions and observations respectively, and Λ:{O∗→A}→□(A×O)∗ be an infra-Bayesian law. Then, we can define Λ♡∈♡(A×O)∗ by
Λ♡(u):=maxπ:O∗→AEΛ(π)[u]In fact, this is a faithful representation: Λ can be recovered from Λ♡.
Example 3: Consider an infra-MDP with finite state set S, initial state s0∈S and transition infrakernel T:S×A→□S. We can then define the “ambikernel” T♡:S→♡S by
T♡(s;u):=maxa∈AET(s,a)[u]Thus, every infra-MDP induces an “ambichain”. Moreover:
Claim 1: ♡ is a monad. In particular, ambikernels can be composed.
This allows us defining
ϕ(γ):=(1−γ)∞∑n=0γn(T♡)n(s0)This object is the infra-Bayesian analogue of the convex polytope of accessible state occupancy measures in an MDP.
Claim 2: The following limit always exists:
ϕ∗:=limγ→1ϕ(γ)Legendre-Fenchel Duality
Definition 3: Let D be a convex space and A1,A2…An,B⊆D. We say that B occludes (A1…An) when for any (a1…an)∈A1×…×An, we have
CH(a1…an)∩B≠∅Here, CH stands for convex hull.
We denote this relation A1…An⊢B. The reason we call this “occlusion” is apparent for the n=2 case.
Here are some properties of occlusion:
For any 1≤i≤n, A1…An⊢Ai.
More generally, if c∈Δ{1…n} then A1…An⊢∑iciAi.
If Φ⊢A and Φ⊆Ψ then Ψ⊢A.
If Φ⊢A and A⊆B then Φ⊢B.
If A1…An⊢B and A′i⊆Ai for all 1≤i≤n, then A′1…A′n⊢B.
If Φ⊢Ai for all 1≤i≤n, and also A1…An⊢B, then Φ⊢B.
Notice that occlusion has similar algebraic properties to logical entailment, if we think of A⊆B as ”B is a weaker proposition than A”.
Definition 4: Let X be a compact Polish space. A cramble set[2] over X is Φ⊆□X s.t.
Φ is non-empty.
Φ is topologically closed.
For any finite Φ0⊆Φ and Θ∈□X, if Φ0⊢Θ then Θ∈Φ. (Here, we interpret elements of □X as credal sets.)
Question: If instead of condition 3, we only consider binary occlusion (i.e. require |Φ0|≤2), do we get the same concept?
Given a cramble set Φ, its Legendre-Fenchel dual ambidistribution is
^Φ(f):=maxΘ∈ΦEΘ[f]Claim 3: Legendre-Fenchel duality is a bijection between cramble sets and ambidistributions.
Lattice Structure
Functionals
The space ♡X is equipped with the obvious partial order: Q≤P when for all f∈C(X), Q(f)≤P(f). This makes ♡X into a distributive lattice, with
(P∧Q)(f)=min(P(f),Q(f))(P∨Q)(f)=max(P(f),Q(f))This is in contrast to □X which is a non-distributive lattice.
The bottom and top elements are given by
⊥(f)=minx∈Xf(x)⊤(f)=maxx∈Xf(x)Ambidistributions are closed under pointwise suprema and infima, and hence ♡X is complete and satisfies both infinite distributive laws, making it a complete Heyting and co-Heyting algebra.
♡X is also a De Morgan algebra with the involution
¯Q(f):=−Q(−f)For X≠∅, ♡X is not a Boolean algebra: ΔX⊆♡X and for any θ∈ΔX we have ¯θ=θ.
One application of this partial order is formalizing the “no traps” condition for infra-MDP:
Definition 2: A finite infra-MDP is quasicommunicating when for any s∈S
limγ→1(1−γ)∞∑n=0γn(T♡)n(s0)≤limγ→1(1−γ)∞∑n=0γn(T♡)n(s)Claim 4: The set of quasicommunicating finite infra-MDP (or even infra-RDP) is learnable.
Cramble Sets
Going to the cramble set representation, ^Φ≤^Ψ iff Φ⊆Ψ.
Φ∧Ψ is just Φ∩Ψ, whereas Φ∨Ψ is the “occlusion hall” of Φ and Ψ.
The bottom and the top cramble sets are
⊥={⊤□}⊤=□XHere, ⊤□ is the top element of □X (corresponding to the credal set ΔX).
The De Morgan involution is
¯Φ={Θ∈□X∣∀Ξ∈Φ:Θ∩Ξ≠∅}Operations
Definition 5: Given X,Y compact Polish spaces and a continuous mapping h:X→Y, we define the pushforward h∗:♡X→♡Y by
h∗(Q;f):=Q(f∘h)When h is surjective, there are both a left adjoint and a right adjoint to h∗, yielding two pullback operators h∗min,h∗max:♡Y→♡X:
h∗min(Q;f):=ming∈C(Y):g∘h≥fQ(g)h∗max(Q;f):=maxg∈C(Y):g∘h≤fQ(g)Given Q∈♡X and P∈♡Y we can define the semidirect product Q⋉P∈♡(X×Y) by
(Q⋉P)(f):=Q(λx.P(λy.f(x,y)))There are probably more natural products, but I’ll stop here for now.
Polytopic Ambidistributions
Definition 6: The polytopic ambidistributions ♡polX are the (incomplete) sublattice of ♡X generated by ΔX.
Some conjectures about this:
For finite X, an ambidistributions Q is polytopic iff there is a finite polytope complex C on RX s.t. for any cell A of C, Q|C is affine.
For finite X, a cramble set Φ is polytopic iff it is the occlusion hall of a finite set of polytopes in ΔX.
ϕ(γ) and ϕ∗ from Example 3 are polytopic.
The non-convex shape ♡ reminds us that ambidistributions need not be convex or concave.
The expression “cramble set” is meant to suggest a combination of “credal set” with “ambi”.