justinpombrio comments on You can tell a drawing from a painting

justinpombrio 8 Mar 2022 4:37 UTC
7 points
You draw an element at random from distribution A.

Or you draw an element at random from distribution B.

The range of the distributions is the same, so anything you draw from B could have been drawn from A. And yet...
- TLW 8 Mar 2022 7:16 UTC
  3 points
  Parent
  The range of the distributions is the same, so anything you draw from B could have been drawn from A
  This does not hold in pathological cases, I don’t think. As one example:
  
  $p d f_{A} (x) = ⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ \begin{matrix} . . . 0, & - 1 + 1 / 2^{4} \leq x < - 1 + 1 / 2^{3} 1, & - 1 + 1 / 2^{3} \leq x < - 1 + 1 / 2^{2} 0, & - 1 + 1 / 2^{2} \leq x < - 1 + 1 / 2^{1} 1, & - 1 + 1 / 2^{1} \leq x < 1 - 1 / 2^{1} 0, & 1 - 1 / 2^{1} \leq x < 1 - 1 / 2^{2} 1, & 1 - 1 / 2^{2} \leq x < 1 - 1 / 2^{3} 0, & 1 - 1 / 2^{3} \leq x < 1 - 1 / 2^{4} . . . \end{matrix}$
  $p d f_{B} (x) = ⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ \begin{matrix} . . . 1, & - 1 + 1 / 2^{4} \leq x < - 1 + 1 / 2^{3} 0, & - 1 + 1 / 2^{3} \leq x < - 1 + 1 / 2^{2} 1, & - 1 + 1 / 2^{2} \leq x < - 1 + 1 / 2^{1} 0, & - 1 + 1 / 2^{1} \leq x < 1 - 1 / 2^{1} 1, & 1 - 1 / 2^{1} \leq x < 1 - 1 / 2^{2} 0, & 1 - 1 / 2^{2} \leq x < 1 - 1 / 2^{3} 1, & 1 - 1 / 2^{3} \leq x < 1 - 1 / 2^{4} . . . \end{matrix}$
  (I can’t be bothered to figure out the correct constants to normalize these. Should be fairly straightforward to calculate from the standard infinite-geometric-series-sum formula.)
  Both of these functions have the same range, $- 1 < x < 1$ , but there is no overlap between the two functions. Their support is disjoint.
  - Yair Halberstadt 8 Mar 2022 7:49 UTC
    2 points
    Parent
    That’s a function, he was referring to a distribution
    - TLW 9 Mar 2022 2:42 UTC
      1 point
      Parent
      They are both (un-normalized) probability density functions, as per the names $p d f_{A}$ and $p d f_{B}$ . My apologies if that was unclear.
      To be somewhat clearer: I was referring to the probability distributions described by these two probability density functions. They have the same range, but disjoint support, and so anything you drew from B could not have been drawn from A.
      In other words, a (pathological) counterexample to “The range of the distributions is the same, so anything you draw from B could have been drawn from A”.
    - justinpombrio 8 Mar 2022 15:21 UTC
      1 point
      Parent
      Wikipedia says:
      
      In mathematics, the range of a function may refer to either of two closely related concepts: The codomain of the function; The image of the function.
      
      I meant the image. At least that’s what you call it for a function; I don’t know the terminology for distributions. Honestly I wasn’t thinking much about the word “range”, and should have simply said:
      
      Anything you draw from B could have been drawn from A. And yet...
      
      Before anyone starts on about how this statement isn’t well defined because the probability that you select any particular value from a continuous distribution, I’ll point out that I’ve never seen anyone draw a real number uniformly at random between 0 and 1 from a hat. Even if you are actually selecting from a continuous distribution, the observations we can make about it are finite, so the relevant probabilities are all finite.
      - TLW 11 Mar 2022 7:50 UTC
        1 point
        Parent
        I was assuming you meant range as in the statistical term (for a distribution, roughly, the maximum $x$ for which $p d f (x) \neq 0$ , minus the minimum $x$ for which $p d f (x) \neq 0$ ).
        Annoyingly, this is closer to the domain than it is the range, in function terminology.
        I meant the image.
        Are you sure? The range is a description of the possible outputs of the pdf, which means… almost nothing. Trivial counterexample if you do mean image:
        Uniform distribution A between 0 and 0.5 (that is, 2 for 0..0.5, and 0 otherwise).
        Uniform distribution B between 1.0 and 1.5 (that is, 2 for 1.0..1.5, and 0 otherwise).
        Both of these distributions have the same image {0, 2}. And yet they are disjoint.
        Honestly I wasn’t thinking much about the word “range”, and should have simply said:
        > Anything you draw from B could have been drawn from A.
        There are many probability distributions where this is not the case. (Like the two uniform distributions A and B I give in this post.)
        *****
        Oh. You said you don’t know the terminology for distributions. Is it possible you’re under a misunderstanding of what a distribution is? It’s an “input” of a possible result, and an “output” of how probable that result is^[1]. The output is not a result. The input is.
        ^
        ...to way oversimplify, especially for continuous distributions.
        What links here?
        TLW's comment on You can tell a drawing from a painting by alkexr (11 Mar 2022 7:52 UTC; 2 points)
        justinpombrio 11 Mar 2022 9:06 UTC
        2 points
        Parent
        
        Oh. You said you don’t know the terminology for distributions. Is it possible you’re under a misunderstanding of what a distribution is? It’s an “input” of a possible result, and an “output” of how probable that result is.
        
        Yup, it was that. I thought “possible values of the distribution”, and my brain output “range, like in functions”. I shall endeavor not to use a technical term when I don’t mean it or need it, because wow was this a tangent.
  - Throwaway2367 9 Mar 2022 12:17 UTC
    1 point
    Parent
    If I may ask, why didn’t you use the following (simpler imo) example: pmf_A(0) = 1 pmf_A(1) = 0 pmf_B(0) = 0 pmf_B(1) = 1
    
    Or even the “Bit sequences” part of the post?
    - TLW 11 Mar 2022 7:52 UTC
      2 points
      Parent
      If I may ask, why didn’t you use the following (simpler imo) example: pmf_A(0) = 1 pmf_A(1) = 0 pmf_B(0) = 0 pmf_B(1) = 1
      With that approach one can argue that the two PMFs have different ranges^[1], and get rabbit-holed into a discussion of e.g. “is a uniform distribution from 0 to 1 with a range of −10 to 10 the same or different than a uniform distribution from 0 to 1 with a range of 0 to 1”.
      This approach is more complex, but sidesteps that.
      ^
      (Also see https://www.lesswrong.com/posts/mERNQwDNTtqsXbSng/you-can-tell-a-drawing-from-a-painting?commentId=ySCpKgJ8WmN7BFJjN)
      - Throwaway2367 14 Mar 2022 13:00 UTC
        2 points
        Parent
        What about
        
        $p m f_{A} (x) = ⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ \begin{matrix} \frac{1}{2} & if x = 0 \frac{1}{2} & if x = 2 0 & otherwise \end{matrix}$
        
        $p m f_{B} (x) = ⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ \begin{matrix} \frac{1}{3} & if x = 0 \frac{1}{3} & if x = 1 \frac{1}{3} & if x = 2 0 & otherwise \end{matrix}$ ?
        
        Both functions’ support has the same minimum (0) and maximum (2).