Addendum: One lesson to take away is that quantilization doesn’t just depend on the base distribution being safe to sample from unconditionally. As the theorems hint, quantilization’s viability depends on base(plan | plan doing anything interesting) also being safe with high probability, because we could (and would) probably resample the agent until we get something interesting. In this post’s terminology, A := {safe interesting things}, B := {power-seeking interesting things}, C:= A and B and {uninteresting things}.
Addendum: One lesson to take away is that quantilization doesn’t just depend on the base distribution being safe to sample from unconditionally. As the theorems hint, quantilization’s viability depends on base(plan | plan doing anything interesting) also being safe with high probability, because we could (and would) probably resample the agent until we get something interesting. In this post’s terminology, A := {safe interesting things}, B := {power-seeking interesting things}, C:= A and B and {uninteresting things}.