We give a sufficient condition for a logical conditional expectation value defined using an optimal predictor scheme to be stable on counterfactual conditions.

Definition

Given $Δ$ an error space of rank $r$ , the stabilizer of $Δ$ , denoted $stab Δ$ is the set of functions $γ : N^{r} \to R^{> 0}$ s.t. for any $δ \in Δ$ we have $γ δ \in Δ$ .

Theorem

Consider $Δ$ an error space of rank 2, $D \subseteq {0, 1}^{*}$ , $μ$ a word ensemble and ${^P}_{D}$ a $Δ (p o l y, l o g)$ -optimal predictor scheme for $(χ_{D}, μ)$ . Assume $ϵ : N^{2} \to R^{> 0}$ is s.t.

(i) $ϵ^{- 1} \in stab Δ$

(ii) ${^P}_{D}^{k j} \geq ϵ (k, j)$

Consider $f : D \cap supp μ \to [0, 1]$ and ${^P}_{1}$ , ${^P}_{2}$ $Δ (p o l y, l o g)$ -optimal predictor schemes for $(f, μ ∣ D)$ . Then, ${^P}_{1} μ ≃ Δ {^P}_{2}$ .

Note

This result can be interpreted as stability on counterfactual conditions since the similarity is relative to $μ$ rather than only relative to $μ ∣ D$ . That is, ${^P}_{1}$ and ${^P}_{2}$ are similar outside of $D$ as well.

Proof of Theorem

We will refer to the previously established results about $Δ (p o l y, l o g)$ -optimal predictor schemes by L.N where N is the number in the linked post. Thus Theorem 1 there becomes Theorem L.1 here and so on.

By Theorem L.A.7

$E_{(μ^{k} ∣ D) \times U^{r_{1} (k, j) + r_{2} (k, j)}} [({^P}_{1}^{k j} (x) - {^P}_{2}^{k j} (x))^{2}] \in Δ$

$\frac{E_{μ^{k} \times U^{r_{1} (k, j) + r_{2} (k, j)}} [χ_{D} (x) ({^P}_{1}^{k j} (x) - {^P}_{2}^{k j} (x))^{2}]}{μ^{k} (D)} \in Δ$

$E_{μ^{k} \times U^{r_{1} (k, j) + r_{2} (k, j)}} [χ_{D} (x) ({^P}_{1}^{k j} (x) - {^P}_{2}^{k j} (x))^{2}] \in Δ$

On the other hand, by Lemma L.B.3

$E_{μ^{k} \times U^{r (k, j) + r_{1} (k, j) + r_{2} (k, j)}} [({^P}_{D}^{k j} (x) - χ_{D} (x)) ({^P}_{1}^{k j} (x) - {^P}_{2}^{k j} (x))^{2}] \in Δ$

Combining the last two statements we conclude that

$E_{μ^{k} \times U^{r (k, j) + r_{1} (k, j) + r_{2} (k, j)}} [{^P}_{D}^{k j} (x) ({^P}_{1}^{k j} (x) - {^P}_{2}^{k j} (x))^{2}] \in Δ$

It follows that

$E_{μ^{k} \times U^{r_{1} (k, j) + r_{2} (k, j)}} [({^P}_{1}^{k j} (x) - {^P}_{2}^{k j} (x))^{2}] = ϵ (k, j)^{- 1} E_{μ^{k} \times U^{r_{1} (k, j) + r_{2} (k, j)}} [ϵ (k, j) ({^P}_{1}^{k j} (x) - {^P}_{2}^{k j} (x))^{2}]$

$E_{μ^{k} \times U^{r_{1} (k, j) + r_{2} (k, j)}} [({^P}_{1}^{k j} (x) - {^P}_{2}^{k j} (x))^{2}] \leq ϵ (k, j)^{- 1} E_{μ^{k} \times U^{r (k, j) + r_{1} (k, j) + r_{2} (k, j)}} [{^P}_{D}^{k j} (x) ({^P}_{1}^{k j} (x) - {^P}_{2}^{k j} (x))^{2}]$

$E_{μ^{k} \times U^{r_{1} (k, j) + r_{2} (k, j)}} [({^P}_{1}^{k j} (x) - {^P}_{2}^{k j} (x))^{2}] \in Δ$

Logical counterfactuals using optimal predictor schemes

Definition

Theorem

Note

Proof of Theorem