ike comments on Anthropic Reasoning and Perspective-Based Arguments

ike 8 Sep 2020 17:19 UTC
1 point
0
>Does every single observation we make confirms the MWI?
>Does my simple existence confirm the existence of the multiverse?
These are equivalent to presumptuous philosopher, and my answer is the same—if my existence is more likely given MWI or multiverse, then it provides Bayesian evidence. This may or may not be enough to be confident in either, depending on the priors, which depends on the simplicity of the theory compared to the simplicity of competing theories.
>Wouldn’t my existence alone confirm there are numerous ancestor simulations?
No, there’s a lot of implausible assumptions involved there. I don’t think the measure of ancestor simulations across the multiverse is significant. If we had strong evidence that the measure was high, then that probability would go up, all else being equal.
The great filter itself relies on assumptions about base rates of life arising and thriving which are very uncertain. The post you link to says:
>Assume the various places we think the filter could be are equally likely.
Also, we should be conditioning on all the facts we know, which includes the nature of our Earth, the fact we don’t see any signs of life on other planets, etc. It’s not at all clear that a future filter is more likely once all that is taken into account.
>In Dr. Evil and Dub, what conclusion would SIA make?
The conclusion would be that they’re equally likely to be the clone vs the original. Whether they should act on this conclusion depends on blackmail game theory.
> Is the brain arm race correct?
I don’t know which paradox you’re referring to.
I don’t think any of these paradoxes are resolved by ad hoc thinking. One needs to carefully consider the actual evidence, the Bayes factor of conditioning on one’s existence, the prior probabilities, and use SIA to put it all together. The fact that this sometimes results in unintuitive results shouldn’t be held against the theory, all anthropic theories will be unintuitive somewhere.
- cubefox 7 Oct 2020 0:14 UTC
  2 points
  0
  Parent
  Sorry if this is somewhat unrelated to the discussion here, but I don’t think the SIA Doomsday can be dismissed so easily.
  
  The great filter itself relies on assumptions about base rates of life arising and thriving which are very uncertain.
  
  If we don’t have overwhelming reason to think that the filter is in the past, or to think that there is no filter at all, SIA suggests that the filter is very, very likely in the future. SIA itself would, so to speak, be overwhelming evidence for a future filter; you would need overwhelming counter evidence to cancel this out. Or you do a Moorean Shift and doubt SIA, precisely because there apparently is no independent overwhelming evidence that such a filter is in the future. (Especially when we consider the fact that SIA pretty much rules out an AI as the future filter, since we do not only see no Aliens, we also see no rouge alien AIs. There is a separate post on this topic on her blog.) Or you doubt other details of the SIA Doomsday argument, but aside from SIA there aren’t many it seems.
  - ike 7 Oct 2020 0:46 UTC
    1 point
    0
    Parent
    
    If we don’t have overwhelming reason to think that the filter is in the past, or to think that there is no filter at all, SIA suggests that the filter is very, very likely in the future.
    
    My current understanding is that the parameters don’t imply the need for a future great filter to explain the Fermi paradox.
    
    I don’t think you need overwhelming evidence for this. SIA is only overwhelming evidence for a future filter if you already have overwhelming evidence that a filter exists beyond what we know of, which we don’t.
    
    Toy model: if a priori a filter exists 10% of the time, and this filter would prevent 99% of civilizations from evolving into humans OR prevent humans from becoming space civilizations (50% each), then there’s 1800 worlds with no filter for every 200 worlds with a filter; 101 of those filter worlds contain humans and only two of those become space civilizations. So our probability of getting to space is 1802/1901.
    
    If the probability a filter exists is 90%, then there’s 200 worlds with no filter for every 1800 filter worlds. Out of the filter worlds, 909 contain humans. Out of those, 18 go to space. So the probability of us going to space is 218/1109.
    
    You really do need overwhelming evidence that a filter exists / the filter is very strong before it creates overwhelming odds of a future filter.
    - cubefox 7 Oct 2020 23:38 UTC
      2 points
      0
      Parent
      I think you are right that when we are not very certain about the existence / strength of a great filter, SIA Doomsday loses lots of its force. But I think the standard argument for the “great filter hypothesis” was always that a strong filter is extremely likely because even if just a single civilization decides to colonize/sterilize the galaxy (e.g. via von Neumann probes) it could do so comparatively quickly. If it spreads at 1% of the speed of light, it takes 1 million years to colonize the whole Milky Way, which is a very short amount of time compared to the age of the galaxy or even our solar system. Yet the fermi paradox suggests the Milky Way is not colonized to a significant extent. So the expected number civilizations in the galaxy is so low that we are likely one of the first, if not the first.
      
      A different point about your toy model: Why do you assume 50% each for the filter being in the past/future? That seems to ignore SIA. The point of the SIA Doomsday argument is precisely that the filter, assuming it exists, is much, much more likely to be found in the future than in the past. Because SIA strongly favors possible worlds with more observers who could be us, and in a possible world with a past filter (i.e. “early” or “middle” filter in Katja’s post) there are of course very few such observers (the filter prevents them from coming into existence), but in a world with a late filter there are much more of them. (Indeed, SIA’s preference for more observers who could be us seems to be unbounded, to the point that it makes it certain that there are infinitely many observers in the universe.)
      
      Here is the link to the argument again: https://meteuphoric.com/2010/03/23/sia-doomsday-the-filter-is-ahead/
      - ike 8 Oct 2020 2:16 UTC
        1 point
        0
        Parent
        The standard argument for the great filter depends on a number of assumptions, and as I said, my current understanding is this standard argument doesn’t work numerically once you set up ranges for all the variables.
        The point of the SIA Doomsday argument is precisely that the filter, assuming it exists, is much, much more likely to be found in the future than in the past.
        Yes, this is true in my model—conditioning on a filter in the first case yields 100 future filters vs 1 past filter, and in the second case yields 900 future filters vs 9 past filters. There’s a difference between a prior before you know if humans exist and a posterior conditioning on humans existing.
        Indeed, SIA’s preference for more observers who could be us seems to be unbounded, to the point that it makes it certain that there are infinitely many observers in the universe.
        This depends on your measure over the set of possible worlds, but one can plausibly reject infinities in any possible world or reject the coherency of such. As I’ve written elsewhere, I’m a verificationist and don’t think statements about what is per se are verifiable or meaningful—my anthropic statements are meaningful insofar as they predict future experiences with various probabilities.
        cubefox 8 Oct 2020 22:31 UTC
        2 points
        0
        Parent
        
        The standard argument for the great filter depends on a number of assumptions, and as I said, my current understanding is this standard argument doesn’t work numerically once you set up ranges for all the variables.
        
        You are talking about the calculations by Sandberg, Drexler, and Ord, right? In a post where these results were discussed there was an interesting comment by avturchin:
        
        https://www.lesswrong.com/posts/MdGp3vs2butANB7zX/the-fermi-paradox-what-did-sandberg-drexler-and-ord-really?commentId=4FpJaopxXsf7FRLCt
        
        It seems that SIA says that the parameters of the drake equation should be expected to be optimized for observers-which-could-be-us to appear, but exactly this consideration was not factored into the calculations of Sandberg, Drexler, and Ord. Which would mean their estimations for the expected number of civilizations per galaxy are way too low.
        
        Yes, this is true in my model—conditioning on a filter in the first case yields 100 future filters vs 1 past filter, and in the second case yields 900 future filters vs 9 past filters. There’s a difference between a prior before you know if humans exist and a posterior conditioning on humans existing.
        
        Then what I don’t quite understand is why the calculation of your toy model seems so different from the calculation in Katja’s post. In her calculation there is a precise point where SIA is applied, while I don’t see such a point I’m your calculation. Also, the original Bostom SIA (“SSA+SIA”) does, as Dadadarren pointed out, involve a reference class whose effect then “cancels out” while you are, as you pointed out, trying to avoid reference classes to begin with.
        
        Maybe your version of SIA is closer to something like FNC than to the original SIA. Perhaps you should try to give your version a precise definition. The core idea, as far as my limited understanding goes, is this: If hypothesis H makes my existence M more likely, then my existence M also makes H more likely. Because P(M|H) > P(M) implies P(H|M) > P(H). This of course doesn’t work if P(M) is 1 to begin with, as you would expect if M means something like the degree of belief I have in my own existence, or in “I exist”. So we seem to be forced to consider a “non-centered” version without indexical, i.e. “cubefox exists”, which plausibly has a much lower probability than 1 from the gods eye perspective. If we call my indexical proposition M_i and the non-indexical proposition M_c, it becomes clear that the meaning of ‘M’ in “P(M|H) > P(M) implies P(H|M) > P(H)” is ambiguous. If it means: “P(M_c|H) > P(M_c) implies P(H|M_i) > P(H)” then this of course is not a theorem of probability theory anymore. So how is it justified? If we take M_i to simply imply M_c, then P(M_c) would also be 1 and the first inequality (P(M_c|H) > P(M_c)) would again be false.
        
        Maybe I’m off the track here, but Dadadarren seems to be at least right in that the relation between indexical and non-indexical propositions is both important and not straightforward.
        
        This depends on your measure over the set of possible worlds, but one can plausibly reject infinities in any possible world or reject the coherency of such.
        
        Now that seems to me a surprising statement. As far as I’m aware, the most popular guess among cosmologist about the size of the universe is that it is infinitely large. That it has the topology of a plane rather than of a sphere or a torus. Why and how would we plausibly reject this widely held possibility? On the contrary, it seems that SIA presumptuously requires us to categorically reject the sphere and the torus possibility on pure a priori grounds because they imply a universe finite in size and thus with way too few observers.
        
        The only a priori reason against a “big” universe I can think of is one with infinite complexity. By Ockham’s razor, it would be infinitely unlikely. If simplicity is low complexity, and complexity is information content, then the complexity C is related to probability P with C(x) = -log_{2}P(x), or P(x) = 2^-C(x). If C(x) is infinite, P(x) is 0.
        
        But an infinitely large universe doesn’t mean infinite complexity, at least not in the information content sense of “incompressibility”. An infinite universe may arise from quite simple laws and initial conditions, which would make its information content low, and its probability relatively high.
        
        As I’ve written elsewhere, I’m a verificationist and don’t think statements about what is per se are verifiable or meaningful—my anthropic statements are meaningful insofar as they predict future experiences with various probabilities.
        
        Well, SIA seems to predict that we will encounter future evidence which would imply a finite size of the universe with probability 0. Which is just what you required. While the silly cosmologists have not ruled out a finite universe, we philosophers just did so on pure a priori grounds. :)
        ike 9 Oct 2020 0:05 UTC
        1 point
        0
        Parent
        It seems that SIA says that the parameters of the drake equation should be expected to be optimized for observers-which-could-be-us to appear, but exactly this consideration was not factored into the calculations of Sandberg, Drexler, and Ord. Which would mean their estimations for the expected number of civilizations per galaxy are way too low.
        I don’t think this is correct. Look at page 6 of https://arxiv.org/pdf/1806.02404.pdf
        SIA is a reason to expect very low values of N to be unlikely, since we would be unlikely to exist if N was that low. But the lowest values of N aren’t that likely—probability of N<1 is around 33%, but probability of N<10^-5 is around 15. It seems there’s at least a 10% chance that N is fairly close to 1, such that we wouldn’t expect much of a filter. This should carry through to our posterior such that there’s a 10% chance that there’s no future filter.
        ike 8 Oct 2020 23:23 UTC
        1 point
        0
        Parent
        You are talking about the calculations by Sandberg, Drexler, and Ord, right?
        Yes. Will read that post and get back to you.
        reference class whose effect then “cancels out” while you are, as you pointed out, trying to avoid reference classes to begin with.
        I don’t know that this is a meaningful distinction, being as both produce the same probabilities. All we need is a reference class large enough to contain anything that I might be / don’t currently know that I am not.
        Perhaps you should try to give your version a precise definition.
        SIA is a prior over observers, once you have a prior over universes. It says that for any two observers that are equally likely to exist, you are equally likely to “be” either one (and corresponding weighting for observers not equally likely to exist). We take this prior and condition on our observations to get posterior probabilities for being in any particular universe as any particular observer.
        I’m not conditioning on “ike exists”, and I’m not conditioning on “I exist”. I’m conditioning on “My observations so far are ike-ish” or something like that. This rules out existing as anyone other than me, but leaves me agnostic as to who “I” am among the group of observers that also have had the same set of observations. And the SIA prior means that I’m equally likely to be any member of that set, if those members had an equal chance of existing.
        Why and how would we plausibly reject this widely held possibility?
        If it’s incoherent, it doesn’t matter how many people believe it.
        On the contrary, it seems that SIA presumptuously requires us to categorically reject the sphere and the torus possibility on pure a priori grounds because they imply a universe finite in size and thus with way too few observers.
        You’re smuggling in a particular measure over universes here. You absolutely need to do the math along with priors and justification for said priors, you can’t just assert things like this.
        An infinite universe may arise from quite simple laws and initial conditions, which would make its information content low, and its probability relatively high.
        It’s not clear to me this counts as an infinite universe. It should repeat after a finite amount of time or space or both, which makes it equivalent to a finite universe being run on a loop, which doesn’t seem to count as infinite. That’s assuming all of this talk is coherent, which it might not be—our bandwidth is finite and we could never verify an infinite statement.
        Well, SIA seems to predict that we will encounter future evidence which would imply a finite size of the universe with probability 0.
        You need to specify the measure, as above. I disagree that this is an implication of SIA.
        cubefox 11 Oct 2020 0:00 UTC
        2 points
        0
        Parent
        
        SIA is a reason to expect very low values of N to be unlikely, since we would be unlikely to exist if N was that low. But the lowest values of N aren’t that likely—probability of N<1 is around 33%, but probability of N<10^-5 is around 15. It seems there’s at least a 10% chance that N is fairly close to 1, such that we wouldn’t expect much of a filter. This should carry through to our posterior such that there’s a 10% chance that there’s no future filter.
        
        I’m not quite sure I understand you here… Let me unpack this a little.
        
        SIA is a reason to expect very low values of N to be unlikely, since we would be unlikely to exist if N was that low.
        
        Yes, but not only that, according to SIA our existence is also a reason to expect high values of N to be likely, since we are more likely to exist if N is higher. But Sandberg, Drexler, and Ord (SDO) do not include this consideration. Instead, they identify the probability P(N<1) with the probability of us being alone in the galaxy (repeatedly, e.g. on page 5). But that’s simply a mistake. P(N<1) is just the probability that a galaxy like ours is empty. (Or rather it is close to that probability, which is actually about e^-N as they say in footnote 3). But the probability of us being alone in the galaxy, i.e. that no other civilizations besides us exist in the galaxy, is rather the probability that at most one civilization exists in the galaxy, given that at least one civilization (us) exists in the galaxy. To calculate this would amount to apply SIA. Which they didn’t do. This mistake arguably breaks the whole claim of the paper.
        
        It seems there’s at least a 10% chance that N is fairly close to 1, such that we wouldn’t expect much of a filter. This should carry through to our posterior such that there’s a 10% chance that there’s no future filter.
        
        What do you mean with “fairly close to one” here? SDO calculate densities, so we would need a range here. Maybe 0.9<N<1.1? 0.99<N<1.01? 0.5<N<1.5? I don’t even know how to interpret such fraction intervals, given that we can’t have a non-integer number of civilizations per galaxy.
        
        The whole probability distribution for N should have been updated on the fact that N is at least 1. (They actually consider an update later on in the paper, but not on our existence, but on the Fermi observation, i.e. that we don’t see signs of ETI.)
        
        I’m not conditioning on “ike exists”, and I’m not conditioning on “I exist”. I’m conditioning on “My observations so far are ike-ish” or something like that. This rules out existing as anyone other than me, but leaves me agnostic as to who “I” am among the group of observers that also have had the same set of observations. And the SIA prior means that I’m equally likely to be any member of that set, if those members had an equal chance of existing.
        
        This sounds interesting. The “or something like that” is crucial of course… Last time I thought your version of SIA might actually be close to FNC (Full Non-indexical Conditioning) by Radford Neal, which is mostly equivalent in results to SIa. But your “My observations so far are ike-ish” does have an indexical (“my”) in it, while FNC ignores all indexical evidence. (This is initially a big advantage, since it is an open question how beliefs with indexicals, so-called self-locating credences, should be modelled systematically in Bayesian reasoning, which leads to the need for additional ad-hoc principles like SSA or SIA.) As far as I understand it, FNC conditions rather on something like “Someone has exactly this state of mind: [list of ike’s total evidence, including memories and current experience]”. Note that this is not a self-locating probability. But FNC (in contrast to SIA) leads to strange results when there are so many observers in the universe that it becomes virtually certain that there is someone (not necessarily you) with the same mind as you, or even certain that there exist an observer for any possible state of mind.
        
        Maybe you know this already, but if not and if you are interested: in Neal’s original paper there is a rather compact introduction to FNC from page 5 to 9, i.e. sections 2.1 to 2.3. The rest of the paper is not overly important. The paper is here: https://arxiv.org/abs/math/0608592 I’m saying this because you seem to have some promising intuitions which Neal also shares, e.g. he also wants to do away with the artificial “canceling out” of reference classes in SIA, and because FNC is, despite its problem with large universes, in some way an objective improvement over SIA, because it basically falls out of standard Bayesian updating if you ignore indexical information, in contrast to principles like SSA or SIA.
        
        But if your approach really needs indexicals it still sounds plausible. Though there are some open questions related to indexicals. How should the unconditional probability of “My observations so far are ike-ish” be interpreted? For you, this probability is one, presumably. For me it is zero, presumably. But what is it from a gods-eye perspective? Is it undefined, because then “my” has no referent, as dadadarren seems to suggest? Or can the “my” be replaced? Maybe with “The observations of a random observer, who, according to ike’s evidence, might be ike, are ike-ish”?
        
        This rules out existing as anyone other than me, but leaves me agnostic as to who “I” am among the group of observers that also have had the same set of observations.
        
        Actually this is a detail which doesn’t seem to me quite right. It seems you are rather agnostic about who are you among the group of observers that, from your limited knowledge, might have had the same set of observations as you.
        
        You’re smuggling in a particular measure over universes here. You absolutely need to do the math along with priors and justification for said priors, you can’t just assert things like this.
        
        The priors are almost irrelevant. As long as an infinite universe with infinite observers has a prior probability larger than 0 being in such an universe is infinitely more likely than being in a universe with finitely many observers. But given that cosmologists apparently find an infinite universe the most plausible possibility, the probability should arguably be estimated much higher than 0%, apparently many of them find it higher than 50% if they believe in an infinite universe. Let’s assume an infinite and an finite universe mit infinitely many observers are equally likely. Then the odds for being in on of those universes are, according to SIA, n:infinite, where n is the number of observers in the finite universe. We could wheigh these odds by almost any prior probabilities other than 50%/50% and the result wouldn’t change: Infinite weighted by any non-zero probability is still infinite, and n stays a finite number regardless. It will always be infinitely more likely to be in the universe with infinitely many observers. So there are only two possibilities: either the prior probability of a universe with infinitely many observers is not 0, then SIA says we live in such an infinite universe with probability 1. Or the prior probability of and infinite universe is 0, then SIA leaves it at 0.
        
        It’s not clear to me this counts as an infinite universe. It should repeat after a finite amount of time or space or both, which makes it equivalent to a finite universe being run on a loop, which doesn’t seem to count as infinite.
        
        Why not? You might then have exact doppelgängers but you are not them. They are different persons. If you have a headache, your doppelgänger also has a headache, but you feel only your headache and your doppelgänger feels only his. If there are infinitely many of those doppelgängers, we have infinitely many persons. Also, a universe with infinite complexity would also have doppelgängers. Apart from that, simple laws and initial conditions can lead to chaotic outcomes, which are indistinguishable from random ones, i.e. from ones with infinite information content. Consider the decimal expansion of pi. It is not periodic like a rational number, it looks like a random number. Yet it can be generated with a very short algorithm. It is highly compressible, a random number is not, but this is the only qualitative difference. Another example are cellular automata like Conway’s Game of Life, or fractals like the Mandelbrot set. Both show chaotic, random-looking behavior from short rules/definitions. Such a infinite universe with pseudo randomness might be nearly indistinguishable from one with infinite information content.
        
        That’s assuming all of this talk is coherent, which it might not be—our bandwidth is finite and we could never verify an infinite statement.
        
        It depends on what you mean with “verify”. If you mean “assign probability 1 to it” then almost nothing can be verified, not even that you have a hand. (You might be deceived by a Cartesian demon into thinking there is an external world.) If you mean with “verify”, as you suggested in your last comment, to assign some probability after gaining evidence, then this is just updating.
        ike 11 Oct 2020 2:35 UTC
        1 point
        0
        Parent
        I don’t even know how to interpret such fraction intervals, given that we can’t have a non-integer number of civilizations per galaxy.
        N is the average number of civilizations per galaxy.
        But the probability of us being alone in the galaxy, i.e. that no other civilizations besides us exist in the galaxy, is rather the probability that at most one civilization exists in the galaxy, given that at least one civilization (us) exists in the galaxy. To calculate this would amount to apply SIA.
        I was going to agree with this, but I realize I need to retract my earlier agreement with this statement to account for the difference between galaxies and the observable universe. We don’t, in fact, have evidence for the “fact that N is at least 1.” We have evidence that the number of civilizations in the universe is at least one. But this is likely to be true even if the probability of a civilization arising on any given galaxy is very low.
        I think I agree with you that SIA means higher values of N are higher a priori. But I’m not sure this leads to the overwhelming evidence of a future filter that you need, or much evidence for a future filter at all.
        I’ll also note that some of the parameters are already adjusted for such effects:
        As noted by Carter and McCrea [10] the evidential power of the early emergence of life on Earth is weakened by observer selection effects, allowing for deep uncertainty about what the natural timescale of life formation is
        You’ve succeeded in confusing me, though, so I’ll have to revisit this question at a later point.
        But what is it from a gods-eye perspective?
        It doesn’t seem meaningful to ask this.
        It seems you are rather agnostic about who are you among the group of observers that, from your limited knowledge, might have had the same set of observations as you.
        If some observer only has some probability of having had the same set of observations, then they get a corresponding weight in the distribution.
        As long as an infinite universe with infinite observers has a prior probability larger than 0 being in such an universe is infinitely more likely than being in a universe with finitely many observers.
        This breaks all Bayesian updates as probabilities become impossible to calculate. Which is a great reason to exclude infinite universes a priori.
        You might then have exact doppelgängers but you are not them.
        I don’t see any meaningful sense in which this is true.
        Such a infinite universe with pseudo randomness might be nearly indistinguishable from one with infinite information content.
        I don’t know how this is relevant.
        It depends on what you mean with “verify”.
        I wrote two posts on this: https://www.lesswrong.com/posts/PSichw8wqmbood6fj/this-territory-does-not-exist and https://www.lesswrong.com/posts/zm3Wgqfyf6E4tTkcG/the-short-case-for-verificationism. I don’t think ontological claims are meaningful except insofar as they mean a set of predictions, and infinite ontological claims are meaningless under this framework.
        cubefox 11 Oct 2020 16:39 UTC
        1 point
        0
        Parent
        
        N is the average number of civilizations per galaxy.
        
        I was going to agree with this, but I realize I need to retract my earlier agreement with this statement to account for the difference between galaxies and the observable universe. We don’t, in fact, have evidence for the “fact that N is at least 1.” We have evidence that the number of civilizations in the universe is at least one. But this is likely to be true even if the probability of a civilization arising on any given galaxy is very low.
        
        SDO treat N as the expected number of civilizations in the Milky Way, i.e. in our galaxy (page 2):
        
        The Drake equation was intended as a rough way to estimate of the number of detectable/contactable civilizations in the Milky Way
        
        If they interpret N in this way, then N is at least 1. They didn’t account for this fact in a systematic way, even if some parameter estimations already should include some such considerations. (From your quote I don’t find it clear whether this is really the case. Also SIA is a fairly new theory and as such unlikely to play a significant role in the historical estimates they looked at).
        
        But what is it from a gods-eye perspective?
        
        It doesn’t seem meaningful to ask this.
        
        It just occurred to me that you still need some prior probability for your sentence which is smaller than 1. If you condition on “My observations so far a ike-ish” and this statement for you has unconditional probability 1, then conditioning on it has not effect. Conditioning on a probability 1 statement is like not conditioning at all. But what is this prior probability and how could it be smaller than 1 for you? It seems to be necessarily true for you. I guess we are forced to consider some non-indexical (gods-eye) version of that statement, e.g. like the one I suggested in my last comment. Also your characterization of (your version of) SIA was quite informal, so there is room for improvement. My personal goal would be to make SIA (or a similar principle) nothing more than a corollary of Bayesian updating, possibly together with a general theory of indexical beliefs.
        
        If some observer only has some probability of having had the same set of observations, then they get a corresponding weight in the distribution.
        
        Good idea. Maybe it is not just the probability that the hypothetical observer had the same observations, it’s the probability that the hypothetical observer exists and had the same observations. Not just what observations observers made is often a guess but also how many of them exist. Also, I don’t think “had the same observations” is quite right to characterize the “total evidence”. Because there could be observers like a Swamp Man (or Boltzmann brain etc) which have the same state of mind as you, and thus arguably the same total evidence, but whose memories formed just by accident and not because they actually made the experiences/observations they think they remember. So I think “has the same state of mind” is better to not exclude those freak observers to begin with, because we might be such a freak observer.
        
        This breaks all Bayesian updates as probabilities become impossible to calculate.
        
        I think you are referring to what is known as the measure problem in cosmology: What is the probability that a cow is two-headed if there are infinitely many one-headed and two-headed cows in the universe? Surely it is still much more probable that a cow is one-headed. There are apparently several solutions proposed in cosmology. For a universe with is spatially infinite I would estimate the probability of a cow to be one-headed by the ratio of the expected number of one-headed cows to the expected number of cows—in a growing imaginary sphere around us. The sphere is of finite size and we take the probability of a cow being one-headed as the limit of the ratio as the size of the sphere goes towards infinity. Then surely the sphere at any finite size contains much more one-headed cows than two-headed cows (the latter are estimated at a much larger number because two-headedness is not evolutionary advantageous for cows). There are other proposed solutions. I think one can be optimistic here that probabilities are not impossible to calculate.
        
        Which is a great reason to exclude infinite universes a priori.
        
        I think the measure problem is merely a practical problem for us. Which would be an instrumental reason not to consider infinite universes if we don’t like to work on the measure problem (if only considering universes with finite size has higher utility for us). But we would need an epistemic reason, in contrast to an instrumental reason, to a priori exclude a possibility by assigning it probability 0. I think there are three types of epistemic reasons to do this:
        
        if we think that the idea of an infinite universe is logically contradictory (that seems not to be the case)
        
        if we think that an infinite universe is infinitely unlikely (That seems only the case for infinite universes with infinite information content. But infinite universes can plausibly have finite and even low finite information content.)
        
        If something is not the case to which we have direct epistemic access. I currently do not have a headache. Since we are perfectly competent in judging the contents of our mind, and a headache is in the mind, my probability of “I have a headache” is 0. (Unlike headaches and other observational evidence, infinite universes are not mental objects, so that option is also not viable here.)
        
        To highlight the difference between practical/instrumental reasons/rationality and epistemic reasons/rationality: Consider Pascal’s Wager. Pascal argued that believing in God has higher expected utility than not believing or being agnostic. Whether that argument goes through is debatable, but in any case it doesn’t show that God exists (that his existence is likely). If subjectivly assigning high probability to a hypothesis has high utility, that doesn’t mean that this hypothesis actually has high probability. And the other way round.
        
        Such a infinite universe with pseudo randomness might be nearly indistinguishable from one with infinite information content.
        
        I don’t know how this is relevant.
        
        You seemed to specifically object to universes with finite information content on grounds that they are just (presumably periodic) “loops”. But they need not be any more loopy than universes with infinite information content.
        
        I wrote two posts on this: https://www.lesswrong.com/posts/PSichw8wqmbood6fj/this-territory-does-not-exist and https://www.lesswrong.com/posts/zm3Wgqfyf6E4tTkcG/the-short-case-for-verificationism. I don’t think ontological claims are meaningful except insofar as they mean a set of predictions, and infinite ontological claims are meaningless under this framework.
        
        But you seem to be fine with anything on which you could possibly update. E.g. there could be evidence for or against the plane topology of the universe. The plane topology means the universe is infinitely large. And as I said, SIA seems to make the significant prediction that evidence which implies a finite universe has probability 0.
        
        I know this opens a huge can of worms, but I also wanted to comment on this one:
        
        By talking about the unseen causes of visible events, it is often possible for me to compress the description of visible events. By talking about atoms, I can compress the description of the chemical reactions I’ve observed. Sure, but a simpler map implies nothing about the territory.
        
        If hypotheses (e.g. about the existence of hands and chairs and rocks and electrons and forces and laws) which assume the existence of things external to our mind greatly reduce the information content of our mental evidence, then those hypotheses are more likely to be true than a pure phenomenological description of the evidence itself. Because lower information content means higher a priori probability. If you entertained the hypothesis that solipsism is true, this would not compress your evidence at all, which means the information content of that hypothesis would be very high, which means it is very improbable. The map/territory analogy is not overly helpful here I think. If you mean with map “hypotheses”, then simpler hypotheses do in fact (probabilistically) “imply” something about the world, because simpler hypotheses are more likely to be true.
        
        Another point: There many people who say that the main task of science is not to make useful technology, or to predict the future, but to explain our world. If you have some evidence E and a hypothesis H, and that hypothesis is supposed to explain your evidence, then that explanation is correct if and only if the following is true:
        
        E because H.
        
        But the truth of any statement of the form of “y because x” arguably implies the truth of x and y. So H must be true in order to correctly explain your evidence. If H is true, and H asserts the existence of things external to your mind (hands, chairs, laws etc.) then those things exist. Almost any hypothesis talks about objects external to your mind. In fact, we wouldn’t even call beliefs about objects internal to our mind (“I have a headache”, “I have the visual impression of a chair in front of me”, “I have a memory of eating pizza yesterday”) hypotheses at all, we would just call them “evidence”. If no external things exist, then all “y because x” statements would be false.
        
        I’m not sure about your argument involving the “level IV multiverse”. I think it is equivalent to modal realism (everything which possibly exists, exists). I’m not sure whether the information content of that hypothesis is high or low. (It is infinite if we think of it as a long description of every possible world. If the information content is very high, then the hypothesis is likely to be false, which would justify our belief that it is false. If it is in fact false, we have a justified true belief in the falsity of modal realism. Since this is not a Gettier case, we then would know that modal realism is false.)
        ike 11 Oct 2020 17:29 UTC
        1 point
        0
        Parent
        If they interpret N in this way, then N is at least 1.
        No, N is a prior. You can’t draw conclusions about what a prior is like that. N could be tiny and there could be a bunch of civilizations anyway, that’s just unlikely.
        It just occurred to me that you still need some prior probability for your sentence which is smaller than 1.
        Sure, prior in the sense of an estimate before you learn any of your experiences. Which clearly you’re not actually computing prior to having those experiences, but we’re talking in theory.
        My personal goal would be to make SIA (or a similar principle) nothing more than a corollary of Bayesian updating, possibly together with a general theory of indexical beliefs.
        SIA is just a prior over what observer one expects to end up with.
        Maybe it is not just the probability that the hypothetical observer had the same observations, it’s the probability that the hypothetical observer exists and had the same observations. Not just what observations observers made is often a guess but also how many of them exist.
        I’m not sure what distinction you’re drawing here. Can you give a toy problem where your description differs from mine?
        So I think “has the same state of mind” is better to not exclude those freak observers to begin with, because we might be such a freak observer.
        My usual definition is “subjectively indistinguishable from me”, you can substitute that above.
        The sphere is of finite size and we take the probability of a cow being one-headed as the limit of the ratio as the size of the sphere goes towards infinity.
        This is basically just downweighting things infinitely far away infinitely low. It’s accepting unboundedness but not infinity. Unboundedness has its own problems, but it’s more plausible than infinity.
        But we would need an epistemic reason, in contrast to an instrumental reason, to a priori exclude a possibility by assigning it probability 0.
        I’m not assigning it probability 0 so much as I’m denying that it’s meaningful. It doesn’t satisfy my criterion for meaning.
        You seemed to specifically object to universes with finite information content on grounds that they are just (presumably periodic) “loops”.
        That’s one objection among several, but the periodicity isn’t the real issue—even without that it still must repeat at some point, even if not regularly. All you really have is an irrational set of ratios between various “states of the world”, calling that “infinity” seems like a stretch.
        those hypotheses are more likely to be true
        What do you mean by true here?
        Because lower information content means higher a priori probability.
        Probability is just a means to predict the future. Probabilities attached to statements that aren’t predictive in nature are incoherent.
        If you entertained the hypothesis that solipsism is true, this would not compress your evidence at all, which means the information content of that hypothesis would be very high, which means it is very improbable.
        The same thing is true of the “hypothesis” that solipsism is false. It has no information content. It’s not even meaningful to say that there’s a probability that it’s true or false. Neither is a valid hypothesis.
        If no external things exist, then all “y because x” statements would be false.
        The problem with this line of reasoning is that we commonly use models we know are false to “explain” the world. “All models are wrong, some models are useful”.
        Also re causality, Hume already pointed out we can’t know any causality claims.
        Also, it’s unclear how an incoherent hypothesis can serve to “explain” anything.
        I think explanations are just fine without assuming a particular metaphysics. When we say “E because H”, we just mean that our model H predicts E, which is a reason to apply H to other predictions in the future. We don’t need to assert any metaphysical statements to do that.
        Expand this thread
        cubefox 12 Oct 2020 2:09 UTC
        1 point
        0
        Parent
        
        No, N is a prior. You can’t draw conclusions about what a prior is like that. N could be tiny and there could be a bunch of civilizations anyway, that’s just unlikely.
        
        I just quoted the paper. It stated that N is the expected number of civilizations in the Milky Way. If that is the case, we have to account for the fact that at least one civilization exists. Which wasn’t done by the authors. Otherwise N is just the expected number of civilizations in the Milky Way under the assumption we didn’t knew that we existed.
        
        Sure, prior in the sense of an estimate before you learn any of your experiences. Which clearly you’re not actually computing prior to having those experiences, but we’re talking in theory.
        
        “before you learn any experience”? I.e. before you know you exist? Before you exist? Before the “my” refers to anything? You seem to require exactly what I suspected: a non-indexical version of your statement.
        
        SIA is just a prior over what observer one expects to end up with.
        
        There are infinitely many possible priors. One would need a justification that the SIA prior is more rational than the alternatives. FNC made much progress in this direction by only using Bayesian updating and no special prior like SIA. Unfortunately there are problems with this approach. But I think those can be fixed without needing to “assume” some prior.
        
        This is basically just downweighting things infinitely far away infinitely low.
        
        All things in the universe get weighted and all get weighted equally. Things just get weighted in a particular order, nearer things get weighted “earlier” so to speak (not in a temporal sense), but not with more weight.
        
        It’s accepting unboundedness but not infinity. Unboundedness has its own problems, but it’s more plausible than infinity.
        
        “Unboundednes” is means usually something else. A universe with a sphere or torus topology is unbounded but finite in size. I’m talking about a plane topology universe here which is both unbounded and infinitely large.
        
        But you seem to have something like hyperreal numbers in mind when you talk about infinity. Hyperreal numbers include “infinite numbers” (the first is called omega) which are larger than any real number. But if cosmologists talk about a universe which is spatially infinite, they only say that for any positive real number n, there is a place in the universe which is at least n+1 light-years away. They do not say “there is something which is omega light-years away”. They do not treat infinite as a (kind of) number. That’s more of a game played by some mathematicians who sometimes like to invent new numbers.
        
        I’m not sure what distinction you’re drawing here. Can you give a toy problem where your description differs from mine?
        
        You might be certain that 100 observers exist in the universe. You are not sure who might be you, but one of the observers you regard as twice as likely to be you as each of the other ones, so you weigh it twice a strong.
        
        But you may also be uncertain of how many observers exist. Say you are equally uncertain about the existence of each of 99 and twice as certain about the existence of a hundredth one. Then you weigh it twice as strong. (I’m not quite sure whether this is right.)
        
        That’s one objection among several, but the periodicity isn’t the real issue—even without that it still must repeat at some point, even if not regularly.
        
        Even in a finite universe there might be repetition. Possibly our universe is finite and contains not only Earth but also a planet we might call Twin-Earth very far away from Earth. Twin-Earth is a perfect duplicate of Earth. It’s even called “Earth” by twin-earthlings. If a person X on Earth moves only his left arm, Twin-X on Twin-Earth also moves only his left arm. But this is merely (perfect) correlation, there is no stronger form of dependence, like counterfactual dependence. If X had moved his right arm instead, Twin-X still had moved only his left arm. This could not be the case if X and Twin-X were identical. Also, if X hurts his foot, Twin-X will also hurt his foot, but X will only feel the pain caused by X’s foot and not the pain caused by the foot of Twin-X. They don’t share a single mind.
        
        All you really have is an irrational set of ratios between various “states of the world”, calling that “infinity” seems like a stretch.
        
        I would rather say that it’s a stretch to regard infinity as a ordinary number, as you are apparently doing. The limit view of infinity doesn’t do this. “Infinity” then just means that for any real number there is another real number which is larger (or smaller).
        
        those hypotheses are more likely to be true
        
        What do you mean by true here?
        
        What we usually mean. But you can remove “to be true” here and the meaning of the sentence stays the same.
        
        Probability is just a means to predict the future.
        
        We can perfectly well (and do all the time) make probabilistic statements about the present or the past. I suggest to regard probability not so much as a “means” but as a measure of uncertainty, where P(A)=1/2 means I am (or perhaps: I should be) perfectly uncertain whether A or not A. This has nothing to do with predictions. (But as I said, the hypothesis of an infinite universe makes predictions anyway.)
        
        Probabilities attached to statements that aren’t predictive in nature are incoherent.
        
        Where is the supposed “incoherence” here?
        
        The best characterization of incoherence I know treats it as a generalization of logical contradiction: A and B are (to some degree) incoherent if P(A and B) < P(A)*P(B). Negative statistical dependence. I.e. each one is evidence against the other. But you seem to mean something else.
        
        The same thing is true of the “hypothesis” that solipsism is false. It has no information content.
        
        It is verified by just a single non-mental object. It has information content, just a very low one. Not as low as “something exists” (because this is also verified by mental objects) but still quite low. Only tautologies have no (i.e. zero) information content.
        
        The problem with this line of reasoning is that we commonly use models we know are false to “explain” the world. “All models are wrong, some models are useful”.
        
        The common answer to that is that Newton’s theory of gravity isn’t so much wrong as it is somewhat inaccurate. A special case of Einstein’s more accurate theory. A measure of (in)accuracy is generalization error in statistics. Low generalization error seems to be for many theories what truth is for ordinary statements. And if we would say of an ordinary statement A that it is “more likely” than an other statement B we would say that a theory X has a “lower expected” generalization error than a theory Y.
        
        Also re causality, Hume already pointed out we can’t know any causality claims.
        
        Well, not only that! Hume also said that no sort of inductive inference is justified, probabilistic or not, so all predictions would be out of the window, not just ones about causal relationships. Because the evidence is almost always consistent with lots of possible but incompatible predictions. I would say that an objective a priori probability distribution over hypotheses (i.e. all possible statements) based on information content solves the problem. For indexical hypotheses I’m not quite certain yet, maybe there is something similar objective for an improved version of SIA. If there is no objective first prior then Hume is right and verificationism is wrong. What you predict would rely on an arbitrary choice of prior probabilities.
        
        I think explanations are just fine without assuming a particular metaphysics. When we say “E because H”, we just mean that our model H predicts E, which is a reason to apply H to other predictions in the future. We don’t need to assert any metaphysical statements to do that.
        
        That doesn’t work for many reasons. Some barometer reading predicts a storm, but it doesn’t explain it. Rather there is a common explanation for both the barometer reading and the storm: air pressure.
        
        Also, explanation (because statements) are asymmetric. If B because A then not A because B. But prediction is symmetric: If A is evidence for B, then B is evidence for A. Because one is evidence for the other if both are positively probabilistically dependent (“correlated”). P(A|B) > P(A) implies P(B|A) > P(B). The rain predicts the wet street, so the wet street predicts the rain. The rain explains the wet street, so the wet street doesn’t explain the rain.
        
        There are even some cases where H explains E but H and E don’t predict each other, i.e. they are not positively statistically dependent. These cases are known as Simpson’s paradox.
        ike 12 Oct 2020 3:01 UTC
        1 point
        0
        Parent
        I just quoted the paper. It stated that N is the expected number of civilizations in the Milky Way. If that is the case, we have to account for the fact that at least one civilization exists. Which wasn’t done by the authors. Otherwise N is just the expected number of civilizations in the Milky Way under the assumption we didn’t knew that we existed.
        The update we need to do is not equivalent to assuming N is at least one, because as I said, N being less than one is consistent with our experiences.
        “before you learn any experience”? I.e. before you know you exist? Before you exist? Before the “my” refers to anything?
        Yes, it gets awkward if you try to interpret the prior literally. Don’t do that, just apply the updating rules.
        There are infinitely many possible priors. One would need a justification that the SIA prior is more rational than the alternatives.
        SIA as a prior just says it’s equally likely for you to be one of two observers that are themselves equally likely to exist. Any alternative will necessarily say that in at least one such case, you’re more likely to be one observer than the other, which violates the indifference principle.
        You might be certain that 100 observers exist in the universe. You are not sure who might be you, but one of the observers you regard as twice as likely to be you as each of the other ones, so you weigh it twice a strong.
        But you may also be uncertain of how many observers exist. Say you are equally uncertain about the existence of each of 99 and twice as certain about the existence of a hundredth one. Then you weigh it twice as strong.
        I’m not sure where my formulation is supposed to diverge here.
        “Infinity” then just means that for any real number there is another real number which is larger (or smaller).
        Well, this is possible without even letting the reals be unbounded. For any real number under 2, there’s another real number under 2 that’s greater than it.
        We can perfectly well (and do all the time) make probabilistic statements about the present or the past.
        And those statements are meaningless except insofar as they imply predictions about the future.
        Where is the supposed “incoherence” here?
        The statement lacks informational content.
        It is verified by just a single non-mental object.
        I don’t know what this is supposed to mean. What experience does the statement imply?
        Low generalization error seems to be for many theories what truth is for ordinary statements.
        Sure, I have no problem with calling your theory true once it’s shown strong predictive ability. But don’t confuse that with there being some territory out there that the theory somehow corresponds to.
        objective a priori probability distribution over hypotheses (i.e. all possible statements) based on information content
        Yes, this is SIA + Solomonoff universal prior, as far as I’m concerned. And this prior doesn’t require calling any of the hypotheses “true”, the prior is only used for prediction. Solomonoff aggregates a large number of hypotheses, none of which are “true”.
        Some barometer reading predicts a storm, but it doesn’t explain it.
        The reading isn’t a model. You can turn it into a model, and then it would indeed explain the storm, while air pressure would explain it better, by virtue of explaining other things as well and being part of a larger model that explains many things simply (such as how barometers are constructed.)
        prediction is symmetric:
        A model isn’t an experience, and can’t get conditioned on. There is no symmetry between models and experiences in my ontology.
        The experience of rain doesn’t explain the experience of the wet street—rather, a model of rain explains / predicts both experiences.