abramdemski comments on My Current Take on Counterfactuals

abramdemski 19 Apr 2021 17:18 UTC
LW: 2 AF: 2
0
AF
I’m not convinced this is the right desideratum for that purpose. Why should we care about exploitability by traders if making such trades is not actually possible given the environment and the utility function? IMO epistemic rationality is subservient to instrumental rationality, so our desiderata should be derived from the later.
So, one point is that the InfraBayes picture still gives epistemics an important role: the kind of guarantee arrived at is a guarantee that you won’t do too much worse than the most useful partial model expects. So, we can think about generalized partial models which update by thinking longer in addition to taking in sense-data.
I suppose TRL can model this by observing what those computations would say, in a given situation, and using partial models which only “trust computation X” rather than having any content of their own. Is this “complete” in an appropriate sense? Can we always model a would-be radical-infrabayesian as a TRL agent observing what that radical-infrabayesian would think?
Even if true, there may be a significant computational complexity gap between just doing the thing vs modeling it in this way.
- Vanessa Kosoy 3 May 2021 17:58 UTC
  LW: 4 AF: 3
  0
  AF Parent
  Yes, I’m pretty sure we have that kind of completeness. Obviously representing all hypotheses in this opaque form would give you poor sample and computational complexity, but you can do something midway: use black-box programs as components in your hypothesis but also have some explicit/transparent structure.
  - abramdemski 2 Jun 2021 20:46 UTC
    LW: 2 AF: 2
    0
    AF Parent
    OK, so, here is a question.
    The abstract theory of InfraBayes (like the abstract theory of Bayes) elides computational concerns.
    In reality, all of ML can more or less be thought of as using a big search for good models, where “good” means something approximately like MAP, although we can also consider more sophisticated variational targets. This introduces two different types of approximation:
    The optimization target is approximate.
    The optimization itself gives only approximate maxima.
    What we want out of InfraBayes is a bounded regret guarantee (in settings where we previously didn’t know how to get one). What we have is a picture of how to get that if we can actually do the generalized Bayesian update. What we might want is a picture of how to do that more generally, when we can’t actually compute the full update.
    Can we get such a thing with InfraBayes?
    In other words, search is a very basic type of logical uncertainty. Currently, we don’t have much of a model of that, except “Bayesian Search” (which does not provide any nice regret bounds that I know of, although I may be ignorant). We might need such a thing in order to get nice guarantees for systems which employ search internally. Can we get it?
    Obviously, we can do the bayesian-search thing with InfraBayes substituted in, which already probably provides some kind of guarantee which couldn’t be gotten otherwise. However, the challenge is to get the guarantee to carry all the way through to the end result.
    - Vanessa Kosoy 4 Jun 2021 0:06 UTC
      LW: 11 AF: 7
      0
      AF Parent
      My hope is that we will eventually have computationally feasible algorithms that satisfy provable (or at least conjectured) infra-Bayesian regret bounds for some sufficiently rich hypothesis space. Currently, even in the Bayesian case, we only have such algorithms for poor hypothesis spaces, such as MDPs with a small number of states. We can also rule out such algorithms for some large hypothesis spaces, such as short programs with a fixed polynomial-time bound. In between, there should be some hypothesis space which is small enough to be feasible and rich enough to be useful. Indeed, it seems to me that the existence of such a space is the simplest explanation for the success of deep learning (that is, for the ability to solve a diverse array of problems with relatively simple and domain-agnostic algorithms). But, at present I only have speculations about what this space looks like.
- abramdemski 20 Apr 2021 16:20 UTC
  LW: 2 AF: 2
  0
  AF Parent
  To further elaborate, this post discusses ways a Bayesian might pragmatically prefer non-Bayesian updates. Some of them don’t carry over, for sure, but I expect the general idea to translate: InfraBayesians need some unrealistic assumptions to reflectively justify the InfraBayesian update in contrast to other updates. (But I am not sure which assumptions to point out, atm.)
  - Vanessa Kosoy 3 May 2021 18:14 UTC
    LW: 2 AF: 1
    0
    AF Parent
    
    In particular, it’s easy to believe that some computation knows more than you.
    
    Yes, I think TRL captures this notion. You have some Knightian uncertainty about the world, and some Knightian uncertainty about the result of a computation, and the two are entangled.