I expect there to be a massive and important distinction between “passive transparency” and “active transparency”, with the latter being much more shaky and potentially concealing of fatality, and the former being cruder as tech at the present rate which is unfortunate because it has so many fewer ways to go wrong. I hope any terminology chosen continues to make the distinction clear.
Possibly relevant here is my transparency trichotomy between inspection transparency, training transparency, and architectural transparency. My guess is that inspection transparency and training transparency would mostly go in your “active transparency” bucket and architectural transparency would mostly go in your “passive transparency” bucket. I think there is a position here that makes sense to me, which is perhaps what you’re advocating, that architectural transparency isn’t relying on any sort of path-continuity arguments in terms of how your training process is going to search through the space, since you’re just trying to guarantee that the whole space is transparent, which I do think is pretty good desideratum if it’s achievable. Imo, I mostly bite the bullet on path-continuity arguments being necessary, but it definitely would be nice if they weren’t.
I expect there to be a massive and important distinction between “passive transparency” and “active transparency”, with the latter being much more shaky and potentially concealing of fatality, and the former being cruder as tech at the present rate which is unfortunate because it has so many fewer ways to go wrong. I hope any terminology chosen continues to make the distinction clear.
Possibly relevant here is my transparency trichotomy between inspection transparency, training transparency, and architectural transparency. My guess is that inspection transparency and training transparency would mostly go in your “active transparency” bucket and architectural transparency would mostly go in your “passive transparency” bucket. I think there is a position here that makes sense to me, which is perhaps what you’re advocating, that architectural transparency isn’t relying on any sort of path-continuity arguments in terms of how your training process is going to search through the space, since you’re just trying to guarantee that the whole space is transparent, which I do think is pretty good desideratum if it’s achievable. Imo, I mostly bite the bullet on path-continuity arguments being necessary, but it definitely would be nice if they weren’t.