Hostname: page-component-78c5997874-mlc7c Total loading time: 0 Render date: 2024-11-19T12:15:33.329Z Has data issue: false hasContentIssue false

The Noonday argument: fine-graining, indexicals, and the nature of Copernican reasoning

Published online by Cambridge University Press:  22 March 2023

Brian C. Lacki*
Affiliation:
Breakthrough Listen, Astronomy Department, University of California, Berkeley, CA, USA
*
Author for correspondence: Brian C. Lacki, E-mail: astrobrianlacki@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

Typicality arguments attempt to use the Copernican Principle to draw conclusions about the cosmos and presently unknown conscious beings within it, including extraterrestrial intelligences (ETI). The most notorious is the Doomsday Argument, which purports to constrain humanity's future from its current lifespan alone. These arguments rest on a likelihood calculation that penalizes models in proportion to the number of distinguishable observers. I argue that such reasoning leads to solipsism, the belief that one is the only being in the world, and is therefore unacceptable. Using variants of the ‘Sleeping Beauty’ thought experiment as a guide, I present a framework for evaluating observations in a large cosmos: Weighted Fine Graining (WFG). WFG requires the construction of specific models of physical outcomes and observations. Valid typicality arguments then emerge from the combinatorial properties of third-person physical microhypotheses. Indexical (observer-relative) facts do not directly constrain physical theories, but instead weight different provisional evaluations of credence. As indexical knowledge changes, the weights shift. I show that the self-applied Doomsday Argument fails in WFG, even though it can work for an external observer. I argue that the Copernican Principle does not let us apply self-observations to constrain ETIs.

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2023. Published by Cambridge University Press

Background

What can we learn from the simple fact that we exist where and when we do? The answer may bear on many profound questions, including the nature and size of the cosmos, the existence and types of extraterrestrial intelligence, and the future of humanity.

Attempts to reason about the cosmos from the fact of our existence have been called anthropic reasoning (e.g. Leslie, Reference Leslie1996; Bostrom, Reference Bostrom2013). The anthropic principle argues that our existence is expected even if the events that lead up to it are rare (Carter, Reference Carter and Longair1974). In its weakest form, it simply asserts that humanity's existence is probabilistically likely in a large enough universe (Carter, Reference Carter1983). The strongest versions of the anthropic principle have an opposite premise at heart (Bostrom, Reference Bostrom2013): our existence is not a fluke, but somehow necessary in a logical sense (Barrow, Reference Barrow1983). Anthropic reasoning has been extended to include Copernican principles, which emphasize the typicality of our environment. Weak forms point out our evolution was not a special rupture in the laws of physics, but one possible outcome that can be repeated if given enough ‘trials’ in sufficiently many cosmic environments like ours. Strong forms state that conscious beings (‘observers’) like ourselves are common.

Anthropic reasoning frequently tries to synthesize the anthropic and Copernican principles: we should regard our circumstances as typical of observers like ourselves. Bostrom (Reference Bostrom2013) has formalized this notion as the Self-Sampling Assumption (SSA). The group of observers considered similar enough to us for Copernican reasoning to be valid is our ‘reference class’. It may be as wide as all possible sentient beings or as narrow as people exactly identical to your current self. There is no consensus on a single reference class, or indeed whether we might use a multitude (e.g. Neal, Reference Neal2006; Garriga and Vilenkin, Reference Garriga and Vilenkin2008; Bostrom, Reference Bostrom2013), although the more extreme Copernican formulations apply the universal reference class of all observers. Typicality arguments are often justified in normal experiments to derive conclusions when unusual outcomes are expected given enough ‘trials’. In fact, some kind of typicality assumption seems necessary to reason about large cosmologies, where thanks to the anthropic principle there will exist observers like ourselves with certainty even in Universes distinct from ours – otherwise observations have no power to constrain the nature of the world (Bostrom, Reference Bostrom2002; Bousso et al., Reference Bousso, Freivogel and Yang2008). Typicality is commonly invoked in discussions of cosmology as the ‘principle of mediocrity’ (Vilenkin, Reference Vilenkin1995).

The seemingly reasonable Copernican statement of the SSA has led to controversy, as it can be applied to constrain the cosmological contexts of as-of-yet unobservable intelligences in the Universe. Few applications of the Copernican principle are more contentious than the Doomsday Argument. In its most popular form as presented by Gott (Reference Gott1993), we are most likely ‘typical’ humans and therefore are unlikely to be near humanity's beginning or end (see also Nielsen, Reference Nielsen1989). The Bayesian Doomsday Argument, most strongly defended by Leslie (Reference Leslie1996), has a more robust basis: our birthrank (the number of humans before us) is treated as a uniform random variable drawn from the set of all birthranks of the final human population. A larger human population has more ‘outcomes’, resulting in a smaller likelihood of ‘drawing’ your specific birthrank and a Bayesian shift favouring a short future for humanity (see also Knobe et al., Reference Knobe, Olum and Vilenkin2006; Bostrom, Reference Bostrom2013). A generalized non-ranked variant has been brought to bear to evaluate the existence of beings unlike ourselves in some way, as in the Cosmic Doomsday argument (Olum, Reference Olum2004).

The Doomsday Argument and similar typicality arguments would have profound implications for many fields. For example, in the Search for Extraterrestrial Intelligences (SETI Tarter, Reference Tarter2001) the prevalence of technological societies is critically dependent on the lifespan of societies like our own (e.g. Bracewell, Reference Bracewell1960; Sagan, Reference Sagan1973; Forgan and Nichol, Reference Forgan and Nichol2011). Doomsday would be an extraordinarily powerful argument against prevalent interstellar travel, much less more exotic possibilities of astronomical-scale ‘megastructures’ (Dyson, Reference Dyson1960; Kardashev, Reference Kardashev1964). The instantaneous population of a galaxy-spanning society could be ~1020 (Gott, Reference Gott1993; Olum, Reference Olum2004) while predictions of intergalactic travel and astro-engineering suggest populations greater than 1050 (Bostrom, Reference Bostrom2003; Ćirković, Reference Ćirković2004). Yet the Doomsday Argument applies huge Bayes Factors against the viability of these possibilities or indeed any long future, essentially closing off the entire field of SETI as difficult to futile (Gott, Reference Gott1993). Indeed, this is a straightforward implication of the Cosmic Doomsday argument, which is more well known in cosmology (Olum, Reference Olum2004). Similar Bayesian shifts might drastically cut across theories in other fields.

This sheer power cannot be stressed enough. These likelihood ratios for broad classes of theories (>1040 in some cases!) imply more than mere improbability, they are far more powerful than those resulting from normal scientific observation. If we have any realistic uncertainty in such futures, even the slightest possibility of data being hoaxed or mistaken (say 10−9) results in epistemic closure. If we discovered a galaxy-spanning society, or if we made calculations implying that the majority of observers in the standard cosmology live in realms where the physical constants are different from ours, then the evidence would force us to conclude that scientists are engaged in a diabolical worldwide conspiracy to fake these data. The SSA even can lead to paradoxes where we gain eerie ‘retrocausal’ influence over the probabilities of past events by prolonging humanity's lifespan (Bostrom, Reference Bostrom2001).

Given the unrealistic confidence of the Doomsday Argument's assertions, it is not surprising that there have been many attempts to cut down its power and either tame or refute typicality (e.g. Dieks, Reference Dieks1992; Kopf et al., Reference Kopf, Krtous and Page1994; Korb and Oliver, Reference Korb and Oliver1998; Monton and Roush, Reference Monton and Roush2001; Olum, Reference Olum2002; Neal, Reference Neal2006; Bostrom, Reference Bostrom2013; Benétreau-Dupin, Reference Benétreau-Dupin2015; Garisto, Reference Garisto2020). These include disputing that our self-observation can be compared to a uniformly drawn random sampling (Dieks, Reference Dieks1992; Korb and Oliver, Reference Korb and Oliver1998; Garisto, Reference Garisto2020), arguing for the use of much narrower reference classes (Neal, Reference Neal2006; Bostrom, Reference Bostrom2013; Friederich, Reference Friederich2017; Ćirković and Balbi, Reference Ćirković and Balbi2020), or rejecting the use of a single Bayesian credence distribution (Srednicki and Hartle, Reference Srednicki and Hartle2010; Benétreau-Dupin, Reference Benétreau-Dupin2015). The most common attack is the Self-Indication Assumption (SIA): if we really are drawn randomly from the set of possible observers, then any given individual is more likely to exist in a world with a larger population. The SIA demands that we adopt a prior in which the credence placed on a hypothesis is directly proportional to the number of observers in it, which is then cut down by the SSA using our self-observation (Dieks, Reference Dieks1992; Kopf et al., Reference Kopf, Krtous and Page1994; Olum, Reference Olum2002, see also Neal Reference Neal2006 for further discussion).

The pitfall of these typicality arguments is the Presumptuous Philosopher problem where philosophical arguments giving us ridiculous levels of confidence about otherwise plausible beings in the absence of observations. The problem was first stated for the SIA: the prior posits absurd levels of confidence (say 10100 to 1) in models with a very large number of observers (for example, favouring large universe cosmologies over small universe cosmologies; Bostrom and Ćirković Reference Bostrom and Ćirković2003; Ćirković Reference Ćirković2004). This confidence is ‘corrected’ in the Doomsday Argument, but a prior favouring a galactic future $\gtrsim 10^{20}$ to 1 is unlike how we actually reason, and there is no correction when comparing different cosmologies for example (Bostrom and Ćirković, Reference Bostrom and Ćirković2003; Ćirković, Reference Ćirković2004). But the Doomsday Argument is a Presumptuous argument too, just in the opposite direction – one develops extreme certainty about far-off locations without ever observing them.

In this paper, I critically examine and deconstruct the Copernican typicality assumptions used in the Doomsday Argument and present a framework for understanding them, Fine Graining with Auxiliary Indexicals (FGAI).

Statement of the central problem

It is important to specify the issue at stake – what is being selected and what we hope to learn. The Doomsday Argument is sometimes presented as a way to prognosticate about our future specifically (Gott, Reference Gott1993; Leslie, Reference Leslie1996; Bostrom, Reference Bostrom2013). This question can become muddled if there are multiple ‘copies’ of humanity because the universe is large. Now, if we already know the fraction of societies that are long-lived versus short-lived, the fraction of observers at the ‘start’ of any society's history follows from a simple counting argument, with no Doomsday-like argument applying (Knobe et al., Reference Knobe, Olum and Vilenkin2006; Garisto, Reference Garisto2020).

The real problem is that we have nearly no idea what this probability distribution is, aside from it allowing our existence. What typicality arguments actually do is attempt to constrain the probability distribution of observers throughout the universe. This is implicit even in Gott (Reference Gott1993), which argues the negative results of SETI are an expected result, because the Doomsday Argument implies long-lived societies are rare and the vast megastructure-building ETIs are nearly non-existent. Knobe et al. (Reference Knobe, Olum and Vilenkin2006) makes this argument explicit with the ‘universal Doomsday argument’, suggesting a large fraction of observers are planetbound like us and not in galaxy-spanning societies ETIs. More generally, typicality arguments have been purported to constrain the distribution of other properties of ETIs, like whether they are concentrated around G dwarfs like the Sun instead of the more numerous red dwarfs (Haqq-Misra et al., Reference Haqq-Misra, Kopparapu and Wolf2018) or if ETIs in elliptical galaxies can greatly outnumber those in spiral galaxies (Zackrisson et al., Reference Zackrisson, Calissendorff, González, Benson, Johansen and Janson2016; Whitmire, Reference Whitmire2020). These all are questions about the distribution of observers.

Even questions about our future can be recast into a question about lifespan distributions. Specifically, what is the lifespan distribution of societies that are exactly like our own at this particular point in our history? There presumably is only one such distribution, even if there are many instantiations of humanity in the Universe. If our future is deterministic, then this distribution is monovalued, with a single lifespan for all humanities out there including ours. More generally, we can imagine there is some mechanism (e.g. nuclear war or environmental collapse) that could cut short the life of societies like ours, and we try to evaluate the probability that it strikes a random society in our reference class.

Thus, what we are comparing are different theories about this distribution, with two important qualities. First, the theories are mutually exclusive in the Garisto (Reference Garisto2020) sense – only one distribution can exist. Even if we suppose there is a multiverse with a panoply of domains, each with their own distribution (e.g. because large-scale travel and expansion is easier under certain physical laws than others), there is presumably only one distribution of these pockets, which in turn gives only one cosmic distribution for the observers themselves. Second, however, there is no ‘draw’ by an external observer when we are doing self-observation. All the observers are realized, none more real than the others. This makes it unlike other cases of exclusive selection.

The central problem, then, is whether given mutually exclusive theories about the distribution of observers, should we automatically prefer theories with narrower distributions, such that we are more typical, closer to the bulk? Should we do this even if the broader distributions have more total observers, so that both predict similar numbers of observers like us? Throughout this paper, I use the terms ‘Small’ (S) and ‘Large’ (L) to broadly group theories about unknown observers (as in Leslie, Reference Leslie1996). In Small theories, the majority of actually existing observers are similar to us and make similar observations of their environment, while Large theories propose additional, numerically dominant populations very dissimilar to us. The version of the Doomsday Argument that I focus on applies this to the distribution of the total final population (or lifespan). In this context, Small theories are ‘Short’, while Large theories are considered ‘Long’ (e.g. N total ≫ 1011 for human history).

I illustrate the Argument with some simple toy models. In the prototypical model, I consider only two distributions, both degenerate:

  • In the Small theory, all ‘worlds’ have the same small N total value of 1.

  • In the Large theory, all ‘worlds’ have the same large N total value, taken to be 2.

Each ‘world’ consists of an actually existing sequence of observers, ordered by birthrank, the time of their creation (note that world does not mean theory; there can be many ‘worlds’ in this sense but only one correct theory). The observers may be undifferentiated, or may have a distinct label that individuates them. The data each observer can acquire is their own birthrank, and their label if they have one. Numerous variants will be considered, however, including ones where additional data provide additional distinctions in the theory, and ones where N total is fixed but not the distribution. I also consider models where there may be a mixture of actually existing worlds of different sizes, with or without a distribution known to the observer. To distinguish this variance from Small and Large theories, I denote small and large worlds with lowercase letters (s and ℓ), reserving uppercase S and L for mutually exclusive theories.

The Doomsday argument and its terrible conclusion

The Doomsday Argument is arguably the most far-reaching and contentious of the arguments from typicality. It generates enormous Bayes factors against its Large models, despite relatively plausible (though still very uncertain) routes to Large futures. By comparison, we have no specific theory or forecast implying that non-artificial inorganic lifeforms (c.f., Neal, Reference Neal2006) or observers living under very different physical constants elsewhere in a landscape dominate the observer population by a factor of  ≫ 1010. Cases where we might test typicality of astrophysical environments, like the habitability of planets around the numerically dominant red dwarfs (Haqq-Misra et al., Reference Haqq-Misra, Kopparapu and Wolf2018), generate relatively tame Bayes factors of ~10 −100 that seem plausible (a relatively extreme value being ~104 for habitable planets in elliptical galaxies from Dayal et al. Reference Dayal, Cockell, Rice and Mazumdar2015; Whitmire Reference Whitmire2020). For this reason, it is worth considering Doomsday as a stringent test of the ‘Copernican Princple’.

Noonday, a parable

A student asked their teacher, how many are yet to be born?

The teacher contemplated, and said: ‘If the number yet to be born equalled the number already born, then we would be in the centre of history. Now, remember the principle of Copernicus: as we are not in the centre of the Universe, we must not be in such a special time. We must evaluate the p-value of a possible final population as the fraction of people who would be closer to the centre of history.

‘Given that 109 billion humans have been born so far,’ continued the teacher, citing Kaneda and Haub (Reference Kaneda and Haub2020), ‘The number of humans yet to be born is not between 98 and 119 billion with 95% confidence. There may be countless trillions yet to be born, or none at all, but if you truly believe in Copernicus's wisdom, you must be sure that the number remaining is not 109 billion.’

And so all who spoke of the future from then on minded the teacher's Noonday Argument.

Noonday, x-day, and the frequentist Doomsday argument

In its popular frequentist form, the Doomsday Argument asks Wouldn't it be strange if we happened to live at the very beginning or end of history? Given some measure of history z, like humanity's lifespan or population, let F ∈ [0,  1] be the fraction z present/z total that has passed so far. If we regard F as a uniform random variable, we construct confidence intervals: F is then between [(1 − p)/2,  (1 + p)/2] with probability p. Thus for a current measure z present, our confidence limit on the total measure z total is [2z present/p,  2z present/(1 − p)], allowing us to estimate the likely future lifespan of humanity (Gott, Reference Gott1993). This form does not specifically invoke the SSA: F can parametrize measures like lifespan that have nothing to do with ‘observers’ and require no reference class. It is motivated by analogy to similar reasoning applied to external phenomena. Unlike the Bayesian Doomsday Argument, it penalizes models where F = 1. A more proper Doomsday Argument would constrain the z total distribution, from which the expected F distribution can be derived, although the basic idea of finding the [(1 − p)/2,  (1 + p)/2] quantile still applies.

The parable of the Noonday Argument demonstrates the weakness of the frequentist Doomsday Argument: it is not the only confidence interval we can draw. The Noonday Argument gives another, one arguably even more motivated by the Copernican notion of us not being in the ‘centre’ (c.f., Monton and Roush, Reference Monton and Roush2001). An infinite or zero future is maximally compatible with the Noonday Argument. But there is no reason to stop with Noonday either. Wouldn't it be strange if our f happened to be a simple fraction like 1/3, or some other mathematically significant quantity like 1/e, or indeed any random number we pick?

Doomsday and Noonday are just two members of a broad class of x-day Arguments. Given any x in the range [0,  1], the x-day Argument is the observation that it is extremely unlikely that a randomly drawn F will just happen to lie very near x. A confidence interval with probability p is constructed by excluding values of F in between $[ x - ( 1-p) /2]$ mod 1 and $[ x + ( 1-p) /2]$ mod 1, wrapping x as in the frequentist Doomsday Argument. The Doomsday Argument is simply the 0-day Argument and the 1-day Argument, whereas the Noonday Argument is the 1/2-day Argument.

By construction, all frequentist x-day Arguments are equally valid frequentist statements if F is truly a uniformly distributed random variable. Any notion that being near the beginning, the end, or the centre is ‘strange’ is just a subjective impression. The confidence intervals are disjoint but their union covers the entire range of possibilities, with an equal density for any p covering any single z total value (Monton and Roush, Reference Monton and Roush2001).

The Noonday Argument demonstrates that not every plausible-sounding Copernican argument is useful, even when technically correct. Confidence intervals merely summarize the effects of likelihood – which is small for Large models, as reflected by the relative compactness of the 0/1-day intervals. A substantive Doomsday-style Argument therefore requires something more than the probability of being near a ‘special’ time.

Bayesian Doomsday and Bayesian Noonday

Bayesian statistics is a model of how our levels of belief, or credences, are treated as probabilities conditionalized by observations. We have a set of some models we wish to constrain, and we start with some prior credence distribution over them. The choice of prior is subjective, but a useful prior when considering a single positive parameter λ of unknown scale is the flat log prior: P prior (λ) ∝ 1/λ. An updated posterior credence distribution is calculated by multiplying prior credences by the likelihood of the observed data D in each model λ i, which is simply the probability ${\scr P}( D\vert {\boldsymbol \lambda _i})$ in that model that we observe y:

(1)$$P^{{\rm pos}} ( {\boldsymbol \lambda_j} \vert D) = \displaystyle {P^{{\rm prior}} ( {\boldsymbol \lambda_j}) {\scr P}( D \vert {\boldsymbol \lambda_j}) \over \sum_{{\boldsymbol \lambda_x} \in \Lambda} P^{{\rm prior}} ( {\boldsymbol \lambda_x}) {\scr P}( D \vert {\boldsymbol \lambda_x}) },\; $$

where Λ is the set of possible λ.Footnote 1

Bayesian probability provides a more robust basis for the Doomsday Argument, and an understanding of how it supposedly works. In the Bayesian Doomsday argument, we seek to constrain the distribution of N total, the final total population of humanity and its inheritors, using birthranks as the observable. The key assumption is to apply the SSA with all of humanity and its inheritors as our reference class. If we view ourselves as randomly selected from a society with a final total population of N total, our birth rank N past is drawn from a uniform distribution over [0,  N total − 1], with a likelihood of

(2)$${\scr P} ( N_{\rm past} \vert N_{\rm total} = n) = \left\{\matrix{ 1/n \hfill & n \ge N_{\rm past}\hfill \cr 0 \hfill & n < N_{\rm past} . \hfill }\right.$$

Since we are deciding between different distributions of N total, we must average this over the probability that a randomly selected observer is from a society with N total,

(3)$${\scr P}_o ( N_{\rm total} = n) = {n f( N_{\rm total} = n) \over \sum_{n^{\prime} = 1}^{\infty} n^{\prime} f( N_{\rm total} = n^{\prime}) },\; $$

where f(N total = n) is the probability that a randomly selected society has a final population N total. This gives a likelihood for the distribution f of

(4)$$\eqalign{P ( N_{\rm past} \vert f) & = \sum_{n = N_{\rm past}}^{\infty} {\scr P} ( N_{\rm past} \vert N_{\rm total} = n) {\scr P}_o ( N_{\rm total} = n) \cr & = {F( N_{\rm total} \ge N_{\rm past}) \over {\langle N_{\rm total} \rangle}} .}$$

Here, F is the complementary cumulative mass function. Starting from the uninformative flat prior $P^{{\rm prior}}( {\langle N_{\rm total} \rangle }) \propto 1/ {\langle N_{\rm total} \rangle }$, applying Bayes’ theorem results in the posterior:

(5)$$P^{{\rm pos}} ( {\langle N_{\rm total} \rangle}) \propto \displaystyle {F( N_{\rm total} \ge N_{\rm past}) \over {\langle N_{\rm total} \rangle}^2} .$$

The posterior in equation (5) is strongly biased against Large models, with P pos(N total ≥ N) ≈ N past/N.

The parable's trick of excluding N total values near 2N past is irrelevant here. Bayesian statistics does define credible intervals containing a fraction p of posterior credence and we could draw Noonday-like intervals that include Large models but exclude a narrow range of Small models. But this is simply sleight-of-hand, using well-chosen integration bounds to hide the fundamental issue that P pos (N total ≫ N past) ≪ 1.

But could there be a deeper x-day Argument beyond simply choosing different credible intervals? Perhaps not in our world, but we can construct a thought experiment where there is one. Define an x-ranking as N x = N past − x N total; allowed values are in the range of [ − xN total,  (1 − x)N total]. If through some quirk of physics we only knew our N xFootnote 2, we would treat N x as a uniform random variable, calculate likelihoods proportional to 1/N total and derive a posterior of $1/N_{\rm total}^2$ for allowed values of N total (N total ≥ −N x/x and N x < 0 or N total ≥ N x/(1 − x) and N x > 0). Thus all Bayesian x-day Arguments rule out Large worlds, including the Bayesian Noonday Argument, even though the parable's Noonday Argument implies we should be perfectly fine with an infinite history. This is true even if x is not ‘special’ at all: most of the x-day Arguments are perfectly consistent with a location in the beginning, the middle, or the end of history – anything is better than having F ≈ x, even if the maximum likelihood is for a ‘special’ location. For example, the Bayesian Doomsday Argument, which uses our 0-ranking, predicts a maximum likelihood for us being at the very end of history. The oddity is also clear if we knew a cyclic Noonday rank $N_{1/2}^{\prime }$ that wraps from the last to the first human: the most likely N total value would be $N_{1/2}^{\prime } + 1$, a seemingly anti-Copernican conclusion favouring us being adjacent to the centre of history.

The Bayesian x-day Arguments provide insight into the heart of the Bayesian Doomsday Argument, which has nothing to do with any particular point in history being ‘special’. What powers all of these arguments is the SSA applied with a broad reference class: the 1/N total factor in the likelihood of equation (2). According to these SSA-based arguments, the reason Large worlds are unlikely is because measuring any particular value of N x whatsoever is more unlikely as N total increases. This is a generic property when one actually is randomly drawing from a population; it does not even depend on numerical rankings at all. Ultimately, the result follows from the Bayesian Occam's Razor effect, in which Bayesian probability punishes hypotheses with many possible outcomes (Jefferys and Berger, Reference Jefferys and Berger1991).

The presumptuous solipsist

The SSA is doing all of the work in the x-day Arguments, but nothing restricts its applications solely to birth rankings or N x in general. In fact, the unrestricted SSA argues against Large models of all kinds. It can be used to derive ‘constraints’ on all kinds of intelligences. One example is the Cosmic Doomsday argument against the existence of interstellar societies, solely by virtue of their large populations regardless of any birthrank (Olum, Reference Olum2004). If thoroughly applied, however, it would lead to shockingly strong evidence on a variety of matters, from cosmology to astrobiology to psychology (as noted by Neal, Reference Neal2006; Hartle and Srednicki, Reference Hartle and Srednicki2007).

The existence of non-human observers is widely debated, and a central question in SETI. The unrestricted SSA rules out large sentient populations that are not humanlike in some way, like having a non-carbon based biology or living in the sea. The Cosmic Doomsday argument notes how ‘unlikely’ it is to be in a planetbound society if some ETIs have founded galaxy-wide societies (Olum, Reference Olum2004). In fact it more generally obliterates the case for SETI even without the Doomsday Argument, for if many aliens existed in the Universe, what would the probability be of being born a human instead any of the panoply of extraterrestrial species that are out there?Footnote 3 despite the many astronomers of past centuries who believed other planets were similar to Earth on typicality grounds (Crowe, Reference Crowe1999). The real problem is that the unrestricted SSA rules against intelligent life everywhere else, even the nearest 10100 Hubble volumes. Animal consciousness may be a better historical example as we have reason to suspect many animals are indeed conscious in some way (e.g. Griffin and Speck, Reference Griffin and Speck2004).

The consequences extend to other fields, where the consequences become harder to accept. Several theories of physics predict a universe with infinite extent (in space, time, or otherwise), including the open and flat universes of conventional cosmology, eternal inflation, cyclic cosmology, and the many-world interpretation of quantum mechanics, with a nearly endless panoply of Earth at least slightly different from ours. But according to the unrestricted SSA, the probability that a randomly selected observer would find themselves in this version of Earth might as well be zero in a sufficiently big universe. It also makes short work of the question of the question of animal consciousness (c.f., Griffin and Speck, Reference Griffin and Speck2004). Isn't it strange that, of all the creatures on the Earth, you happen to be human? The SSA would be a strong argument against the typical mammal or bird being a conscious observer, to say nothing of the trillions of other animals and other organisms (Neal, Reference Neal2006).

Unrestrained application of the SSA leads to a far more radical, and ominous conclusion, however. Why not apply the SSA to other humans? We can do this even if we restrict our reference class to humanity and remain agnostic about aliens, multiverses and animal consciousness. Solipsism, the idea that one is the only conscious being in existence and everything else is an illusion, is an age-old speculation. Obviously, most people do not favour solipsism a priori, but it cannot actually be disproved, only ignored as untenable. Solipsism would imply that scientific investigation is pointless, however. Any method that leads to near certainty in solipsism is self-defeating, indicating a flaw in its use.

What happens when we apply the SSA to our sliver of solipsistic doubts? Let $\epsilon$ be the credence you assign to all solipsistic ideas that are constrained by SSA. Although surely small, perhaps 10−9 being reasonable, it should not have to be zero – if only because cosmology predicts scenarios like solipsism could happen (like being a Boltzmann brain in a tiny collapsing cosmos). What are the odds that you are you, according to the SSA? The principle assigns a likelihood of ≤1/1011 for a realist worldview; only the normalization factor in Bayes’ theorem preserves its viability. But the SSA indicates that, according to solipsism, the likelihood that an observer has your data instead of the ‘data’ of one of the hallucinatory ‘observers’ you are imagining is 1. According to Bayes’ theorem, your posterior credence in solipsism is $\epsilon /[ 1/N_{\rm past} + \epsilon ( 1 - 1/N_{\rm past}) ]$, or $1 - 1/( N_{\rm past} \epsilon )$. If your prior credence is solipsism was above ~10−11, the SSA magnifies it into a virtual certainty. In fact, the problem is arguably much worse than that: Bostrom (Reference Bostrom2013) further proposes a Strong Self-Sampling Assumption (SSSA), wherein individual conscious experiences are the fundamental unit of observations. What are the odds that you happen to make this observation at this point of your life instead of any other? Your credence in extreme solipsism, where only your current observer-moment exists, should be amplified by a factor $\gtrsim 10^{17}$, and more if one believes there is good evidence for an interstellar future, animal consciousness, alien intelligences, or the existence of a multiverse. Unless one is unduly prejudiced against it – Presumptuously invoking the absurdly small prior probabilities that are the problem in the SIA – the principle behind the Doomsday Argument impels one into believing nobody else in existence is conscious, not even your past and future selves.Footnote 4

One might object that ‘me, now’ is a superficial class, that everyone in the real world is just as unique and unrepresentative, so there is no surprise that you are you instead of everything else. This is invalid according to unrestricted Copernican reasoning. You are forced to grapple with the fundamental problem that your solipsistic model has only one possible ‘outcome’ but realist models have many because there really do exist many different people – in the same way that drawing a royal flush from a stack of cards on the first try is very strong evidence it is rigged even if every possible draw is equally unlikely with a randomly shuffled deck. Indeed, we can use trivial identifying details to validly make inferences about external phenomena, like in the urn problems used as analogies for the Doomsday Argument. The x-day Argument prevent us from saving the Doomsday Argument by appealing to the unlikelihood of being born near a ‘special time’ in the future. There exists an N x that is small for each individual in any population.

A later section will provide a way out of solipsism even if one accepts the SSA. This is to fine-grain the solipsism hypothesis by making distinctions about what the sole observer hallucinates, or which of all humans is the ‘real’ one. The distinctions between specific observers are thus important and cannot be ignored. In the context of the Doomsday Argument, we can ask what is the likelihood that the sole solipsist observer imagines themself to have a birthrank of 1011 instead of 1 or 1050. Indeed, I will argue that fine-graining is an important part of understanding the role of typicality: it imposes constraints that limit its use in Doomsday Arguments.

The anti-Copernican conclusion of the maximal Copernican Principle

The Copernican Principle is inherently unstable when adopted uncritically. Even a small perturbation to one's initial prior, a sliver of doubt about there really being 109 billion people born so far, is magnified to the point where it can completely dominate one's views about the existence of other beings. This in turn leads to epistemic instability, as everything one has learned from the external world is thrown into doubt. Although the SSA started out as a way of formalizing the Copernican Principle, it has led to what may be considered an anti-Copernican conclusion in spirit. Weaker Copernican principles suggest that you consider yourself one of many possible minds, not considering yourself favoured, just as the Earth is one of many planets and not the pivot of the Universe. But this strong formulation suggests that you are the only kind of mind, unique in all of existence. Instead of a panoply of intelligences, we get at most an endless procession of copies of you and no one else. Like the anthropic principle, the most extreme versions of the Copernican principle presumptuously tell you that you are fundamental.

Deconstructing typicality

Why is typicality invoked at all?

Typicality, the principle behind the SSA, has been invoked to explain how we can conclude anything at all in a large Universe. Since the Universe appears to be infinite, all possible observations with nonzero probability will almost surely be made by some observers somewhere in the cosmos by the anthropic principle. That is, the mere fact that there exists an observer who makes some observation has likelihood 1 in every cosmology where it is possible. Furthermore, a wide range of observations is possible in any given world model. Quantum theories grant small but nonzero probabilities for measurements that diverge wildly from the expected value: that a photometer will detect no photons if it is pointed at the Sun, for example, or that every uranium nucleus in the Earth will spontaneously decay within the next ten seconds. Most extreme are Boltzmann brains: any possible observer (for a set of physical laws) with any possible memory can be generated wherever there is a thermal bath. Thus we expect there exist observers who make any possible observation in infinite cosmologies that can sustain cognition. Without an additional principle to evaluate likelihoods, no evidence can ever favour one theory over another and science is impossible (Bostrom, Reference Bostrom2002; Bousso et al., Reference Bousso, Freivogel and Yang2008; Srednicki and Hartle, Reference Srednicki and Hartle2010; Bostrom, Reference Bostrom2013).

The common solution has been to include indexical information in our distributions. Indexicals are statements relating your first-person experience to the outside world. They are not meaningful for a third-person observer standing outside the world and perceiving its entire physical content and structure. The SSA, and the SIA in reply, attempt to harness indexicals to learn things about the world: they convert the first-person statement into a probabilistic objective statement about the world, by treating you as a ‘typical’ observer. Frequently, the physical distinctions between observers are left unspecified, as if they are intrinsically identical and only their environments are different.

When we make an observation, we do at least learn the indexical information that we are an observer with our data D. The idea of these arguments, then, is to construct a single joint distribution for physical theories about the third-person nature of the cosmos and indexical theories about which observer we are. Often this is implicit – rather than having specified theories about which observer we are, indexical information is evaluated with an indexical likelihood, the fraction of observers in our reference class with data D. While all possible observations are consistent with a theory in an infinite Universe, most observations will be clustered around a typical value that indicates the true cosmology (Bostrom, Reference Bostrom2013). Thus the indexical likelihoods result in us favouring models that predict most observers have data similar to ours over those where our observations are a fluke.

Typicality is usually fine for most actual cosmological observations, but it yields problematic conclusions when attempting to choose between theories with different population sizes. These problems will motivate the development of Fine-Graining with Attached Indexicals (FGAI) approach to typicality over the next sections.

Sleeping Beauty as three different thought experiments

The core of the Doomsday Argument, and many similar ‘Copernican’ arguments, can be modelled with a thought experiment known as the Sleeping Beauty problem. Imagine that you are participating in experiment in which you wake up in a room either on just Monday or on both Monday and Tuesday. Each day your memory of any previous days has been wiped, so you have no sense of which day it is. Now, suppose you knew that the experiment was either Short, lasting for just Monday, or Long, lasting for Monday and Tuesday. In the original formulation of the thought experiment (SB-O), the experimenters flip a coin, running the Short version if it came up Heads and the Long version if it came up Tails (Elga, Reference Elga2000). You wake up in the room, not knowing how the coin landed, ignorant of whether you are in a Short run or a Long run. What probability should you assign to the possibility that the coin landed heads and the run is Short, 1/2 or 1/3 (Fig. 1)?Footnote 5

Fig. 1. The Sleeping Beauty variants SB-A, SB-B, and SB-C, illustrating how different theories of typicality handle Bayesian credence, before and after learning it is Monday (m) instead of Tuesday (t). Ruled out hypotheses are coloured in black and do not count towards the normalization. The SSA, with or without the SIA, leads to presumptuous conclusions in SB-B. In FGAI, indexical and physical distributions are not mixed. Instead, there is an overarching physical distribution, and each model has an associated indexical probability distribution (indicated by the arrows).

According to the ‘halfer’ camp, it is obvious that you have absolutely no basis to choose between Short and Long because the coin is fair, and that you should have an uninformative prior assigning weight 1/2 to each possibility.Footnote 6

The ‘thirder’ camp instead argues that it is obvious that you are more likely to wake up on any particular day in the Long experiment. That is, there are three possibilities – Short and Monday, Long and Monday, and Long are Tuesday – so the odds of Short and Monday are 1/3. Treating these hypothetical days with equal weight is not as facetious as it sounds: if the experiment was run a vast number of times with random lengths, 1/3 of all awakenings really would be in Short variations. One could even construct bets that favour odds of 1/3.

And what if the experimenters then tell you it is Monday? In the Short theory, there is a 100% chance that today is Monday, but in the Long theory, there is just a 50% chance. If you initially adopted 1/2 as the probability in Short, you now favour Short with a credence of 2/3. In fact, this the basic principle of the Doomsday Argument. On the other hand, if you initially adopted 1/3 as the probability in Long, you now have even credences in Short and Long. And indeed, if many such experiments were being run, half of the Monday awakenings occur in the Long runs. Still, this leads to the odd situation where we start out fairly confident that we are in a Long experiment. It devolves into the Presumptuous Philosopher problem, forcing us to start out virtually certain that we live in a universe with many inhabitants. Just as in the Doomsday Argument, we are drawing confident conclusions about the Universe without actually looking at anything but ourselves.

The conventional Sleeping Beauty thought experiment has been used to compare the SSA and competing principles (Neal, Reference Neal2006), but it conceals some very different situations. This is why it is not immediately obvious the probability is 1/2 or 1/3. The following versions of the experiment clarify this distinction (see Fig. 1):

  • (SB-A) You know that the experiments proceed with a Short run followed by a Long run of the experiment, and you are participating in both. Today you wake up in the room with no memories of yesterday. With what probability is today one the day you awaken during the Short run?

  • (SB-B) The experimenters have decided, through some unknown deterministic process, to run only either the Short or the Long version of the experiment. You have absolutely no idea which one they have decided upon. Today you wake up in the room with no memories of yesterday. What credence should you assign to the belief that the experimenters have chosen to do the Short run?

  • (SB-C) The experimenters have decided, through some unknown deterministic process, to run only either the Short or a variant of the Long version of the experiment. In the modified Long experiment, they run the experiment on both Monday and Tuesday, but only wake you on one of the two, chosen by another unknown deterministic process. Today you wake up in the room. What credence should you assign to the belief that the experimenters have chosen to do the Short run?

In the Doomsday Argument, we essentially are in SB-B: we know that we are ‘early’ in the possible history and want to know if we can conclude anything about conscious observers at ‘later’ times. Invocations of typicality then presume a similarity between either SB-A or SB-C to SB-B. Yet these analogies are deeply flawed. Both SB-A and SB-C have obvious uninformative priors yielding the same result with or without the SIA, but they point to different resolutions of the Sleeping Beauty problem: 1/3 for SB-A and 1/2 for SB-C.Footnote 7 Thus thought experiments lead to ambiguous conclusions (for example, Leslie Reference Leslie1996's ‘emerald’ thought experiment motivates typicality by noting that we should a priori consider it more likely we are in the ‘Long’ group in a situation like SB-A because they are more typical, but this could be viewed as support for the SIA).

SB-A and SB-B leave us with indexical uncertainty. In SB-A, this is the only uncertainty, with all relevant objective facts of the world known with complete certainty. Because only an indexical is at stake, there can be no Presumptuous Philosopher problem in SB-A – you are already absolutely certain of the ‘cosmology’. But although it has been reduced to triviality in SB-A, there is actually a second set of credences for the third-person physical facts of this world: our 100% credence in this cosmology, with both a Short and Long run. SB-B instead posits that you are not just trying to figure out where you are in the world, but the nature of the world in the first place.

SB-C and SB-B leave us with objective uncertainty about the physical nature of the world itself. The objective frequentist probability that you are in the Short run with SB-B is neither 1/2 nor 1/3 – it is either 0 or 1. Instead the Bayesian prior is solely an internal one, used by you to weigh the relative merits of different theories of the cosmology of the experiment. Thus, it makes no sense to start out implicitly biased against the Short run, so the probability 1/2 is more appropriate for a Bayesian distribution. The big difference between SB-B and SB-C is that there are two possible physical outcomes in SB-C's Long variant, but only one in SB-B's. If we knew whether the experiment was Short or Long, we could predict with 100% certainty what each day's observer will measure. But in SB-C, ‘a participant awakened on Monday’ is not a determined outcome in the SB-C Long theory.

Separating indexical and physical facts

According to typicality arguments, indexical and physical propositions can be mixed, but in this paper I regard them as fundamentally different. Indexicals are like statements about coordinate systems: they can be centred at any arbitrary location, but that freedom does not fundamentally change the way the Universe objectively works. It follows that purely indexical facts cannot directly constrain purely physical world models. The Presumptuous Philosopher paradoxes are a result of trying to force indexical data into working like physical data.

Since Bayesian updating does work in SB-A and SB-C, we can suppose there are in fact two types of credence distributions, physical and indexical. In FGAI there is an overarching physical distribution describing our credences in physical world models. Attached to each physical hypothesis is an indexical distribution (Fig. 1). Each indexical distribution is updated in response to indexical information (c.f., Srednicki and Hartle, Reference Srednicki and Hartle2010). For example, in the Sleeping Beauty thought experiment, when the experimenter announces it is the first day of the experiment, you learn both an physical fact (‘a participant wakes up on Monday’) and an indexical fact (‘I'm the me waking on Monday’). The physical distribution is insulated from changes in the indexical distribution, protecting it from the extremely small probabilities in both the SSA and SIA. Thus, in SB-B, learning ‘today is Monday’ only affects the indexical distribution, invalidating a Doomsday-like argument. Within the context of a particular world-model, one may apply typicality assumptions like the SSA/SIA to the indexical assumption.

A fourth variant of the Sleeping Beauty thought experiment will complicate this attempted reconciliation:

  • (SB-D) The experimenters have decided, through some unknown deterministic process, to run only either a modified Short or the standard (SB-B) Long version of the experiment. This version of the Short experiment is identical to the Long variant in SB-C: the experimenters wake you up on one of Monday or Tuesday, chosen through an unknown process. Today you wake up in the room. What credence should you assign to the belief that the experimenters have chosen to do the Short run?

Both the SSA and SIA yield the natural result of equal credences in Short and Long before and after learning it is Monday, while a simple separation of indexical and physical facts favours Long (Fig. 2). This situation is distinct from the Doomsday Argument because it is not possible for only high N x humans to exist without the low N x ones. An attempt to address this issue will be made in the section on Weighted Fine Graining (WFG).

Fig. 2. SB-D is a variant of Sleeping Beauty that is challenging for theories with separate indexical and physical distributions. More outcomes are instantiated in the Long theory than in either Short microhypothesis, thus seemingly favouring the Long theory unless the SSA is applied.

The astrobiological relevance of Sleeping Beauty

The variants of the Sleeping Beauty thought experiment are models of ‘Copernican’ arguments about questions in astrobiology. In all these cases, we are interested in the existence and nature of beings who are unlike us in some way. In these analogies, humanity might be likened to a ‘Monday’ observer, and we are considering ‘Tuesday’ beings like those in the clouds of Jupiter, around red dwarfs, or in different types of galaxies or different cosmological epochs. With that analogy in mind, consider these hypothetical scenarios in astrobiology:

  • (AB-A) We start out already knowing that the Milky Way disk, Milky Way bulge, and M33 disk are inhabited (with M33 lacking a major bulge), and that the number of inhabitants in these three regions is similar. We know we are in one of these three regions, but not which one. Are we more likely to be in the Milky Way or M33? Are we more likely to be in a spiral disk or a bulge? What if we learn we are in a disk?

  • (AB-B) We start out knowing that we live in the Milky Way's disk. We come up with two theories: in theory S, intelligence only evolves in galactic disks, but in theory L, intelligence evolves in equal quantities in galactic disks and bulges. Does the Copernican principle let us favour theory S?

  • (AB-C) We have two theories: in theory S, intelligence only evolves in galactic disks. In theory L, intelligence either evolves only in galactic disks or only in galactic bulges but not both, with equal credence in either hypothesis. We then discover we live in a galactic disk. Do we favour theory S or theory L?

  • (AB-D) We do not know our galactic environment because of our limited observations, but from observations of external galaxies we suspect that galactic disks and bulges are possible habitats for ETIs. Theory S is divided into two hypotheses: in S 1, intelligence only evolves in galactic disks and in S 2, it evolves only in galactic bulges. Theory L proposes that intelligence is evolves in both disks and bulges. How should we apportion our credence in S 1, S 2, and L? Once we learn the Earth is in the galactic disk, how do our beliefs change?

Put this way, it is clear that neither AB-A nor AB-C are like our current astrobiological questions, and thus neither are SB-A nor SB-C. In AB-A, we already have an answer to the question of whether galactic bulges are inhabited, we are merely uncertain of our address! AB-C, meanwhile, is frankly bizarre: we are seemingly convinced that intelligence cannot evolve in both environments, as if the mere existence of intelligence in galactic bulges always prevents it from evolving in galactic disks. Since we start out knowing there is life on Earth and we wonder if there is life in non-Earthly environments, clearly AB-B – and SB-B – is the better model of astrobiology.

AB-D is an interesting case, though, and it too has relevance for astrobiology. In the 18th century, it was not clear whether Earth is in the centre of the Milky Way or near its edge, but the existence of ETIs was already a well-known question (Crowe, Reference Crowe1999). AB-D really did describe our state of knowledge about different regions of the Milky Way being inhabited back then. In other cases, the history is more like AB-B; we discovered red dwarfs and elliptical galaxies relatively late. As the distinction between AB-B and AB-D basically comes down to the order of discoveries in astronomy, it suggests AB-B and AB-D should give similar results in a theory of typicality, with the same true for SB-B and SB-D.

The frequentist limit and microhypotheses

Finally, it's worth noting that the frequentist 1/3 probability slips back in for a frequentist variant:

  • (SB-Bn) You know for certain that the experiment is being run for n times where n ≫ 1. Whether a given run is Short or Long is determined through some deterministic but pseudo-random process, such that any possible sequence of Shorts and Longs is equally credible from your point of view. Each day, you wake up with an identical psychological state. Today you wake up in the room. What credence should you assign to the belief that today is happening during a Short run (c.f., the ‘Three Thousand Weeks’ thought experiment of Bostrom, Reference Bostrom2007)?

There are now 2n competing physical hypotheses, one for each possible sequence of Shorts and Longs. In most of these hypotheses, however, about half of the runs are Short and Long, and only about one-third of the observers are in the Short runs through simple combinatorics (Fig. 3). Thus, as n tends to infinity, the coarse-grained physical distribution converges to one like SB-A, with unbalanced Short/Long runs being a small outlier. We can then say in any likely scenario, the probability we are in a Short run is ~1/3.

Fig. 3. If the Sleeping Beauty experiment is run many times, with each possible sequence of Long and Short a priori equally likely, a vast number of microhypotheses about the sequence of Short and Long is generated. Shown here are the 64 microhypotheses when there are n = 6 runs. Attached to every single one of these fine models is an indexical distribution.

SB-Bn calls to mind statistical mechanics, where a vast number of physical microstates are grouped into a small number of distinguishable macrostates. By analogy, I call each possible detailed world model a microhypothesis, which are then grouped into macrotheories defined by statistical properties.

Replacing typicality

The fine-grained approach to typicality

How are we to make inferences in a large universe, then, without directly mixing indexicals into the credence distribution? I propose that most of the work performed by typicality can instead be performed by fine-graining. Fine-graining of physical theories is the first principle of FGAI.

The common practice is to treat observers in a fairly large reference class as interchangeable when discussing their observations, but this is merely a convenience. In fact, we can make fine distinctions between observers – between Earth and an inhabited planet in Hubble volume # 239,921, for example, or between me and you, or even between you now and you last Thursday. Macrotheories often cannot predict exactly which specific observer measures a particular datum. Thus, every theory is resolved into myriads of microhypotheses, each of which does make these predictions. Because the distinctions between these observers are physical, statements about a specific observer making a particular measurement are evaluated as purely physical propositions, without invoking indexicals. Some microhypotheses will be consistent with the data, others will not be. The resulting credences in the macrotheories are entirely determined by summing the posterior weight over all microhypotheses, in many cases through simple counting arguments. Typicality then follows from the likelihood values of the microhypotheses – as it indeed does in conventional probability, where specific ‘special’ events like getting a royal flush from a randomly shuffled deck are no more rare than specific mundane outcomes (Laplace, Reference Laplace1902). Thus, in most cases, there is no need to invoke any separate Copernican principle, because it is a demonstrable consequence of our theories.

The other main precept of FGAI is that purely indexical facts do not directly affect third-person propositions about the physical world, rather modifying indexical distributions attached to each world model. Observations must be treated as physical third-person events when constraining the physical distribution. Statements like ‘I picked ball 3 out of the urn’ must be recast into third-person statements like ‘Brian Lacki picked ball 3 out of the urn’. Each microhypothesis requires an observation model, a list of possible observations that each observer may make. Observation models necessarily impose physical constraints on which observations can be made by whom, forbidding impossible observations like ‘Hypatia of Alexandria observed that her peer was Cyborg 550-319447 of Gliese 710’ from being considered as possible outcomes.

The fine-graining is most straightforward when every microhypothesis predicts that all physically indistinguishable experiments lead to the same outcome. This follows when we expect conditionalization to entirely restrict possible observations. For example, the Milky Way could be the product of an indeterministic quantum fluctuation in the Big Bang, but copies of our Earth with its data (e.g. photographs of the Milky Way from inside) do not appear in elliptical galaxies except through inconceivably contrived series of coincidences. More difficult are purely indeterministic cases, when any specific observer can observe any outcome, which is true for most quantum experiments. I will argue that even then we can form microhypotheses by assuming the existence of an appropriate coordinate system.

A more serious difficulty is what to do when different plausible indexical hypotheses would lead to different likelihood evaluations for microhypotheses. That is, we may not know enough about where we are to determine whether a microhypothesis predicts an observation or not. In this section, I will adopt the perspective that I call Naive FGAI: we adopt the maximum possible likelihood over all observers we could be.

Naive FGAI

FGAI constructs the physical probability distributions using the Hierarchical Bayes framework, dividing theories into finer hypotheses about internal parameters, possibly with intermediate levels. Suppose we have M macrotheories ${\Theta }_1,\; \, {\Theta }_2,\; \, \ldots,\; \, {\Theta }_M$. Each macrotheory ${\Theta }_k$ has m k microhypotheses $\mu _{k, 1},\; \, \mu _{k, 2},\; \, \ldots ,\; \, \mu _{k, m_k}$. Each microhypothesis inherits some portion of its parent macrotheory's credence or weight. Sometimes, when the microhypotheses correspond to exact configurations resulting from a known probabilistic (e.g. flips of an unfair coin), then the prior credence in each $P_{k, j}^{{\rm prior}}$ can be calculated by scaling the macrotheory's total prior probability accordingly. In other cases, we have no reason to favour one microhypothesis over another, and by the Principle of Indifference, we assign each microhypothesis in a macrotheory equal prior probability: $P_{k, j}^{{\rm prior}} = P_k^{{\rm prior}} / m_k$. Some macrotheories are instead naturally split into mesohypotheses describing intermediate-level parameters, which in turn are fine-grained further into microhypotheses. Mesohypotheses are natural when different values of these intermediate-level parameters result in differing numbers of outcomes – like if a first coin flip determines the number of further coin flips whose results are reported. Finally, in each μ k,j, we might be found at any of a number of locations. More properly, as observers we follow trajectories through time, following a sequence of observations at particular locations, as we change in response to new data (c.f., Bostrom, Reference Bostrom2007). The set ${\cal O}_{k, j} ( D)$ is the set of possible observer-trajectories we could be following allowed by the microhypothesis μ k,j and the data D. In Naive FGAI, the set ${\cal O}_{k, j}$ describes our reference class if μ k,j is true.

In Naive FGAI, prior credences $P_{k, j}^{{\rm prior}}$ in μ k,j are updated by data D according to:

(6)$$P_{k, j}^{{\rm pos}} = {\displaystyle P_{k, j}^{{\rm prior}} \times \max\nolimits_{i \in {\cal O}_{k, j} ( D) } {\scr P}( o @ i \to D \vert \mu_{k, j}) \over \displaystyle \sum\nolimits_{x = 1}^M \sum\nolimits_{y = 1}^{m_x} P_{x, y}^{{\rm prior}} \times \max\nolimits_{z \in {\cal O}_{x, y} ( D) } {\scr P}( o @ z \to D \vert \mu_{x, y}) },\; $$

where ${\scr P}( o @ i \to D \vert \mu _{k, j})$ is the likelihood of μ k,j if the observer (o) located at position i in ${\cal O}_{k, j} ( D)$ observes data D. Of course if the likelihoods are equal for all observers in the reference class ${\cal O}_{k, j} ( D)$, equation (6) reduces to Bayes’ formula. The posterior credences in the macrotheories can be found simply as:

(7)$$P_k^{{\rm pos}} = \sum_{\,j = 1}^{m_k} P_{k, j}^{{\rm pos}} .$$

Naive FGAI is sufficient to account for many cases where typicality is invoked. Paradoxes arise when the number of observers itself is in question, as in Doomsday, requiring a more sophisticated treatment.

A simple urn experiment

In some cases, microhypotheses and observations models are nearly trivial. Consider the following urn problem: you are drawing a ball from an urn placed before you that contains a well-mixed collection of balls, numbered sequentially starting from 1. You know the urn contains either one ball (theory A) or ten (theory B), and start with equal credence in each theory. How does drawing a ball and observing its number constrain these theories? Both theory A and theory B have microhypotheses of the form ‘The urn contains N balls and ball j is drawn at the time of experiment’. In theory A, there is only one microhypothesis, which inherits the full 50% of Theory A's prior credence. Theory B has 10 microhypotheses, one for each possible draw and each of equal credence, so its microhypotheses start with 5% credence each. Each microhypotheses about drawing ball j has an observation model containing the proposition that you observe exactly ball j – this observation is a physical event, since you are a physical being.

Then the likelihoods of an observed draw is either 1 (if the ball drawn is that predicted in the microhypothesis) or 0 (if the ball is not the predicted one). If we draw ball 3, for example, the credence in all hypotheses except Theory B's ‘Ball 3 is drawn’ microhypothesis is zero. Then the remaining microhypothesis has 100% credence, and Theory B has 100% credence as well. If instead ball 1 is drawn, Theory A's sole microhypothesis and one of Theory B's microhypotheses survive unscathed, while the other nine microhypotheses of Theory B are completely suppressed. That is, 100% of Theory A's credence survives, while only 10% of Theory B's credence remains; therefore, post-observation, the credence in Theory A is 10/11 and the credence in Theory B is 1/11. This, of course, matches the usual expectation for the thought experiment.

But what if instead you and 99 other attendees at a cosmology conference were drawing from the urn with replacement and you were prevented from telling each other your results? If we regard all one hundred participants as exactly identical observers, the only distinct microhypotheses seem to be the frequency distribution of each ball j being drawn. It would seem that there could be no significant update to Theory B's credence if you draw ball 1: all you know is that you are a participant-observer and there exists an observer-participant who draws ball 1, which is nearly certain to be true. Inference would seem to require something like the SSA. Yet this is not necessary in practice because if this experiment were carried out at an actual cosmology conference, the participants would be distinguishable. We then can fine-grain Theory B further by listing each attendee by name and specifying which ball they draw for each microhypothesis, and forming an observation model where names are matched to drawn balls. For example, we could order the attendees by alphabetical order and each microhypothesis would be a 100-vector of integers from 1 to 10.

With fine-graining, there is only 1100 = 1 microhypothesis in Theory A – all participants draw ball 1 – but 10100 microhypotheses in Theory B. In only 1099 of B's microhypotheses do you specifically draw ball 1 and observe ball 1. Thus only 10% of Theory B's microhypotheses survive your observation that you drew ball 1. The credence in Theory B is again 1/11, but is derived without appealing to typicality. Instead, typicality follows from the combinatorics. The original one-participant version of this thought experiment can be regarded as a coarse-graining of this 100-participant version, after marginalizing over the unknown observations of the other participants.

Implicit microhypotheses: A thought experiment about life on Proxima b

In other cases, the microhypotheses can be treated as abstract, implicit entities in a theory. A theory may predict an outcome has some probability, but provide no further insight into which situations actually lead to the outcome. This happens frequently when we are actually trying to constrain the value of a parameter in some overarching theory that describes the workings of unknown physics. Not only do we not know their values, we have no adequate theories to explicitly predict them. Historical examples include the terms of the Drake equation and basic cosmological parameters like the Hubble constant. Yet these parameters are subject to sampling variance; some observers in a big enough Universe should deduce unusual values far from their expectation values. Naive FGAI can be adapted for such cases by positing there are implicit microhypotheses that we cannot specify yet. The probability that we observe an outcome is then treated as if it is indicating the fraction of microhypotheses where that outcome occurs.

Suppose we have two models about the origin of life, L-A and L-B, both equally plausible a priori. L-A predicts that all habitable planets around red dwarfs have life. L-B predicts that the probability that a habitable zone planet around a red dwarf has life is 10−100.Footnote 8 Despite this, we will suppose that the conditions required for life on the nearest potentially habitable exoplanet Proxima b are not independent of our existence on Earth, and that any copy of us in a large Universe will observe the same result. The butterfly effect could impose this conditionalization – small perturbations induced by or correlated with (un)favourable conditions on Proxima b may have triggered some improbable event on Earth necessary for our evolution.

We wish to constrain L-A and L-B by observing the nearest habitable exoplanet, Proxima b, and we discover that Proxima b does have life on it. The measurement is known to be perfectly reliable. Copernican reasoning suggests that in L-B, only one in 10−100 inhabited G dwarf planets would observe life around the nearest red dwarf, and that as typical observers, we should assign a likelihood of 10−100 to L-B. Thus, L-B is essentially ruled out.

L-B does not directly specify which properties of a red dwarf are necessary for life on its planets; it merely implies that the life is the result of some unknown but improbable confluence of properties. Nonetheless, we can interpret L-B as grouping red dwarfs into 10100 equivalence classes, based on stellar and planetary characteristics. Proxima Centauri would be a member of only one of these. L-B then would assert that only one equivalence class of 10100 bears life. Thus, L-B actually implicitly represents 10100 microhypotheses, each one an implicit statement about which equivalence class is the one that hosts life (Fig. 4). In contrast, L-A has only one microhypothesis since the equivalence class contains all red dwarfs. We then proceed with the calculation as if these microhypotheses were known.

Fig. 4. Extremely simplified representation of how FGAI treats theories like L-B. Probabilities of observing a particular outcome in a macrotheory may be presumed to result from some unknown fine-scale division of parameter space into regions delineating different equivalence classes. Then microhypotheses would be constructed by considering all possible outcomes for all regions. For L-B, we observe the red dwarf nearest to Earth (Proxima $\oplus$) and see whether it is inhabited (flower symbol). The observation of an inhabited Proxima $\oplus$ reduces the number of allowed microhypotheses. Left: observers on other planets distinguishable from Earth would observe red dwarfs with different properties (Proxima X, Y, and Z) and probe different classes. Right: many identical Earths observe distinct Proxima $\oplus$ (red stars). These are treated by assuming there is some indexing that allows microhypotheses to be constructed, and their likelihoods calculated by symmetry.

If we observe life on Proxima b, then the sole microhypothesis of L-A survives unscathed, but implicitly only one microhypothesis of L-B of the 10100 would survive. Thus, after the observation, L-A has a posterior credence of $100\% /( 1 + 10^{-100})$, while L-B has a posterior credence of 10−100/(1 + 10−100). As we would hope, FGAI predicts that we would be virtually certain that L-A is correct, which is the result we would expect if we assumed we observed a ‘typical’ red dwarf.

What of the other inhabited G dwarf planets in an infinite Universe? The nearest red dwarfs to these will have different characteristics and most of them will belong to different equivalence classes (Fig. 4). In principle, we could construct microhypotheses that specify what each type of these observers will around their nearest red dwarf, and implicitly we assume they exist. If we failed to develop the capability to determine whether Proxima b has life but trustworthy aliens from 18 Scorpii broadcast to us that their nearest red dwarf has life, the result would be the same.

Fine-graining and implicit coordinate systems

The strictest interpretation of the separation of physical and indexical facts is that we cannot constrain physical models if the observed outcome happens to any observer physically indistinguishable from us. This is untenable, at least in a large enough universe – quantum mechanics predicts that all non-zero probability outcomes will happen to our ‘copies’ in a large universe. But this would mean no measurement of a quantum mechanical parameter can be constraining. Surely if we do not observe any radiodecays in a gram of material over a century, we should be able to conclude that its half-life is more than a nanosecond, even though a falsely stable sample will be observed by some copy of us out there in the infinite universe.Footnote 9

Strict indeterminism is not necessary for this to be a problem, either. In the last section, we might have supposed that the existence of life on Proxima b depends on its exact physical microstate five billion years ago, and that these microstates are scattered in phase space. Yet, Proxima b is more massive than the Earth and has a vaster number of microstates – by the pigeonhole principle, in an infinite Universe, most Earths exactly identical to ours would neighbour a Proxima b that had a different microstate, opening the possibility of varying outcomes (Fig. 4).

But the separation of indexicals and physical theories need not be so strict. Indexicals are regarded in FGAI as propositions about coordinate systems. In physical theories we can and do use coordinate systems, sometimes arbitrary ones, as long as we do not ascribe undue objective significance. We can treat these situations by imposing an implicit indexing, as long as we do not ascribe undue physical significance to it. Thus, in an infinite Universe, we label ‘our’ Earth as Earth 1. The next closest Earth is Earth 2, the third closest Earth is Earth 3, and so on. We might in fact only implicitly use a coordinate system, designating our Earth as Earth 1 without knowing details of all the rest. Our observations then are translated into third-person propositions about observations of Earth 1. The definition of the coordinate system imposes an indexical distribution where we must be on Earth 1. Of course, this particular labelling is arbitrary, but that hardly matters because we would reach the same conclusions if we permuted the labels. If we came into contact with some other Earth that told us that ‘our’ Earth was Earth 3,296 in their coordinate system, that would not change our credence in a theory.

In a Large world, then, the microhypotheses consists of an array listing the outcome observed by each of these implicitly indexed observers. Only those microhypotheses where Earth (or observer) 1 has a matching observation survive. If the observed outcome contradicts the outcome assigned to Earth 1 by the microhypothesis, we cannot then decide we might actually be on Earth 492155 in that microhypothesis because Earth 492155 does observe that outcome. The coordinate system's definition has already imposed the indexical distribution on us.

There are limits to the uses of implicit coordinate systems, however, if we wish to avoid the usual Doomsday argument and its descent into solipsism. In SB-B, could we not assign an implicit coordinate system with today at index 1, and the possible other day of the experiment at index 2? A simplistic interpretation would then carve the Long run theory into two microhypotheses about which day has index 1, and learning ‘today is Monday’ leads us to favour the Short theory.

There is a very important difference between SB-B and the previous thought experiments. In those, our physical theory did not in any degree predict which observers get a particular outcome – the likelihood distribution for the outcomes is exactly identical for all observers. This is why we are able to assign likelihoods even though the mapping between our implicit coordinate system and some external coordinate system is unknown. In SB-B, though, our physical theory does predict which observer gets a particular outcome. The likelihood of observing ‘it is Monday’ is not identical for each day. Instead, ‘it is Monday’ needs to be interpreted as an indexical fact. We can indeed create an implicit coordinate system with today at index 1 and the other day at index 2. But we cannot calculate likelihoods in this coordinate system – the referents of the indices in the theory are unknown, and until we can connect the implicit coordinates with the coordinate system used by the Long theory, we cannot update credences either. All we can say is that if observer 1 is located on Monday, they observe ‘it is Monday’ with certainty; if observer 1 is located on Tuesday, they observe ‘it is Tuesday’ with certainty. In these kinds of situations, a more sophisticated theory is needed.

Reconstructing typicality

The indication paradox in Naive FGAI

Naive FGAI, supplemented by the use of implicit microhypotheses and implicit coordinate systems, is sufficient to handle many practical cases where typicality is invoked. In these cases, however, it has been possible to calculate a single likelihood for each result because of an underlying symmetry in the problem.

In other cases, however, Naive FGAI leads to a bizarre SIA-like effect that always favours Large models. This thought experiment demonstrates the unwanted effect of an irrelevant detail:

  • (SB-B′) You wake up in the room. The experimenters tell you they are running SB-B, but each day they will also flip a fair coin each day the experiment is run and tell you the result. But a team member gains your trust and tells you they are lying! The other experimenters have either adopted Strategy A or Strategy B. In Strategy A, they tell you the coin lands Heads on Monday, and if you awaken on Tuesday, they will tell you it landed Tails. In Strategy B, the supposed outcomes are opposite Strategy A's. Suspecting your new friend's duplicity, the other experiments exclude them from the decision of which Strategy to go with. Now the experimenters inform you the coin landed Heads. How does that affect your credence in a Short or Long run?

It is obvious the coin flip ‘result’ should not affect our beliefs in a Short or Long experiment, because of the symmetry between the coin flip outcomes. Yet in Naive FGAI, either coin flip ‘result’ leads us to favouring a Long experiment (Fig. 5)! This is because either outcome is compatible with at least one of your wakenings in Long, but only in one of the Short microhypotheses. The indication paradox is that any datum about the false coin flip favours the Long run.

Fig. 5. The indication paradox of SB-B′, and how it is treated with different modifications of FGAI. Each box, representing the credence in a microhypothesis, lists the observed outcomes, ordered by index in the used coordinate system. An underlined, bold outcome is treated as the one the observer measures for the purposes of calculating likelihoods. In the Weighted Fine Graining treatment, the normalized weights applied to each microhypotheses are given to the left of each provisional physical credence.

So, then, how should we calculate likelihoods when we do not even know who or where we are? In cases like SB-B and SB-B′, our likelihoods depend on our indexical position. Naive FGAI simply used the maximum likelihood among all possible observers, but there are other ways to approach the problem (Fig. 5). These highlight different pitfalls to avoid in reconstructing a theory of typicality.

Implicit FGAI: can we just use observer-relative indexing?

One method to address SB-B′ is to reject its formulation, demanding the observer-relative indexing of ‘today’ and the ‘other day’. ‘Monday’ and ‘Tuesday’ do not have any meaning in this coordinate system except as purely indexical information. Instead, our microhypotheses are mixtures of Strategies A and B, with the coin reported as Heads ‘today’ and Tails ‘the other day’ in Strategy A′, and the reverse in B′. Then the experimenters announcing Heads rules out B′ in both Short and Long. As before, Implicit FGAI is only practical because of the symmetry. We cannot translate predictions between one coordinate system and the next. Many situations of interest, like evaluating the merits of theories where we are probably a Boltzmann brain against those where we are probably evolved, break this symmetry and require some way to compare different classes of observers.

SSA+FGAI: can we have Doomsday while avoiding solipsism with fine-graining?

A more radical solution is to reintroduce the SSA: instead of applying the maximum possible likelihood for an outcome among all observers we might be, we apply the mean likelihood among these possible observers. Equation (6) is modified to:

(8)$$P_{k, j}^{{\rm pos}} = {\displaystyle P_{k, j}^{{\rm prior}} \times \sum\nolimits_{i \in {\cal O}_{k, j} ( D) } {{\scr P}( o @ i \to D \vert \mu_{k, j}) \over \vert {\cal O}_{k, j} ( D) \vert }\over \displaystyle \sum\nolimits_{x = 1}^M \sum\nolimits_{y = 1}^{m_k} P_{x, y}^{{\rm prior}} \times \sum\nolimits_{z \in {\cal O}_{x, y} ( D) } {{\scr P}( o @ z \to D \vert \mu_{x, y}) \over \vert {\cal O}_{x, y} ( D) \vert }} .$$

Then, in SB-B′, the result of the ‘coin flip’ does not affect our credences in Short and Long. Both microhypotheses of Long would survive the observation, but they would be penalized by a 1/2 factor because only one in two participants is given a Heads outcome. But this penalty continues to apply even when it is no longer needed: when we now apply the observation that ‘today is Monday’ (and thus Strategy A was adopted) to the resulting distribution, Short is now favoured two to one.

SSA+FGAI also gives the correct answer for the SB-D variant. The Doomsday Argument is hence possible in SSA+FGAI, but it does not lead to solipsism if we start out ignorant of our location and include hypotheses in our prior that we are in the ‘wrong’ place. That is the crucial difference between Doomsday and solipsism: it is impossible for high N x observers to exist without low N x observers, but it is possible that intelligence only exists in gas giants, or only around red dwarfs, or only in galactic bulges. Indeed, we must include these ‘wrong’ hypotheses to avoid solipsism, as though we are starting out as a prescientific society, although these possibilities usually are not included when ‘Copernican’ arguments are made. Even the Doomsday Argument may be tamed or neutralized in certain fine-grainings (KSO with a restricted reference class, from later in the paper).

Although tamer than the unrestricted SSA, SSA+FGAI is unsatisfactory both in that it allows (potentially unrealistically powerful) Doomsday Arguments at all and because it never ‘forgives’ the penalties it imposes. The 1/2 factor applied to Long's likelihood in SB-B′ arose from our uncertainty in our location. That uncertainty vanishes when one learns ‘today is Monday’, but the application of the SSA cannot be undone. Essentially, SSA+FGAI double-counts evidence against Large theories: penalizing all the microhypotheses because we do not know who we are, and then penalizing some of the microhypotheses because we do know who we are. SSA+FGAI is unsatisfactory because it either reaches different conclusions based on which order data is learned, or it demands that we ignore data to avoid this double-penalty effect.

Rebooted SSA+FGAI: the use of shrinking reference classes

‘Rebooted SSA+FGAI’ is a modification of SSA+FGAI to eliminate the order-dependence of learning data. It is like SSA+FGAI, except that the credence distribution is reset to the original prior whenever new indexical information is learned. Then, all physical and indexical data is re-applied simultaneously. The likelihood of microhypotheses are evaluated according to the SSA applied only among observers with all your current indexical data. Essentially, with each reboot, the reference class shrinks, undoing the now-redundant penalty of the SSA.

In summary, a theory of typicality should ideally (1) be able to evaluate data when observers at different locations have different likelihoods, (2) avoid universally favouring Large theories no matter the datum by using some kind of typicality assumption, but (3) be able to retroactively update the reference class used in the typicality calculations.

Weighted fine graining

Weighted Fine Graining's premise is that there is no single physical distribution for the microhypotheses. Instead, there is a set of provisional physical credences for each microhypothesis, each associated with a possible observer location. The credence is essentially a superposition of these provisional credences. We evaluate our credences in the various microhypotheses by averaging over the provisional physical distributions.Footnote 10

The provisional physical credence p k,j;i is proportional to the credence in μ k,j calculated by an observer known to be along observer-trajectory $i \in {\cal O}_{k, j}$:

(9)$${\,p_{k, j; i}\over \sum_{\mu_{x, y}} p_{x, y; i}} \equiv P( \mu_{k, j} \vert o @ i) .$$

If $i \notin {\cal O}_{k, j}$, then p k,j;i = 0. Additionally, the ratio of the provisional credences in μ k,j for different observer-trajectories gives the indexical distribution for μ k,j:

(10)$$\xi_{k, j; i} = {\,p_{k, j; i}\over \sum_{z \in {\cal O}_{k, j}} p_{k, j; z}} .$$

The usual case is to start out with an uninformative indexical distribution for all $o \in {\cal O}_{k, j}$, resulting in $P_{k, j}^{{\rm prior}} = p_{k, j; i}^{{\rm prior}} = p_{k, j; z}^{{\rm prior}}$.

Prior provisional credences are updated with data D. The observation model of μ k,j gives a single likelihood for D conditionalized on our location being i, which we use to update the provisional physical credences:

(11)$$p_{k, j; i}^{{\rm pos}} = p_{k, j; i}^{{\rm prior}} {\scr P}( o @ i \to D\vert \mu_{k, j}) .$$

Our credence in microhypothesis μ k,j is then generated from the provisional physical distributions using indexical weights:

(12)$$P_{k, j} = {\displaystyle ( {1}/{{\hat{\Xi}}_{k, j}}) \sum\nolimits_{i \in {\cal O}_{k, j}} {\Xi}_i p_{k, j; i}\over \displaystyle \sum\nolimits_{\mu_{x, y}} ( {1}/{{\hat{\Xi}}_{x, y}}) \sum\nolimits_{z \in {\cal O}_{x, y}} {\Xi}_z p_{x, y; z}} .$$

where posterior provisional credence and indexical weights are substituted to find $P_{k, j}^{{\rm pos}}$ and prior provisional credences and weights for $P_{k, j}^{{\rm prior}}$. As in Naive FGAI, posterior credence in a macrotheory is found by summing over microhypotheses. The use of the weights means that WFG is not a strictly Bayesian theory. We cannot calculate the ‘actual’ physical credence until we can assign a single, unambiguous likelihood, and we cannot do this in FGAI. Instead the posterior credence is a construction built from possible physical credences using the indexical weights.

The weight for i is proportional to the total provisional credence for that observer-trajectory:

(13)$${\Xi}_i = \displaystyle {\sum_{\mu_{x, y}} p_{x, y; i}\over \sum_{\mu_{x, y}} \sum_{z \in {\cal O}_{x, y}} p_{x, y; z}} .$$

These weights perform an averaging over all microhypotheses in all theories.Footnote 11 While not a pure indexical distribution, which in FGAI is considered to be associated with only one microhypothesis, it forms an effective indexical distribution. In fact, it is equal to the indexical distribution according to the SIA prior. The results differ from SIA because of the normalization factor applied to the weights:

(14)$${\hat{\Xi}}_{k, j} \equiv \displaystyle \sum_{i \in {\cal O}_{k, j}} {\Xi}_i .$$

The normalization factor is critical in that it allows a ‘dilution’ of credence in microhypotheses with more observers. It is a reflection of the fact that the set of observers in our reference class differs from one hypothesis to the next, and so will our conclusions for a typical observer. Thus, although the indexical prior reflects the SIA, the evaluated credences in equation (??) lack this indication effect.

In summary, the evaluated credences are calculated by (1) constructing the provisional credences using equations (9), (10), and (9); (2) calculating the weights by summing the provisional credences for each observer-trajectory and normalizing; and (3) calculating the credence by taking the weighted average of the provisional credences for each microhypotheses using those weights, and normalizing.

The indexical weights probabilistically define our current reference class, which evolves as we learn new information. Yet the identity of the observer at each location (trajectory) $i \in {\cal O}$ depends on the microhypothesis, just as who is born at birthrank 1 in the Doomsday Argument varies. Thus the indexical weights describe a reference class of locations or contexts (or more properly, trajectories), not observers. The indexical weights tell us where we are, not what we are. The characteristics of the observer at that location is, if specified at all, given by the microhypothesis.

WFG has several useful limits:

  • When we already are certain of the physical nature of the world, p k,j;i is nonzero only for the known world model μ k,j. Then ${\Xi }_i = \xi _{k, j; i}$, with simple Bayesian updating, just as we expect from SB-A.

  • If ${\scr P}( o @ i \to D \vert \mu _{k, j})$ is the same for all possible locations, and this is true for all μ k,j, then equation (12) reduces to Bayes’ formula applied to

    (15)$$\widetilde{P_{k, j}} \equiv \displaystyle \sum_{i \in {\cal O}_{k, j}} {{\Xi}^{{\rm pos}}_i\over {\hat{\Xi}}_{k, j}} p_{k, j; i} .$$
  • When indexical information is entirely irrelevant, p k,j; i = P k,j, and all indexical dependence vanishes from equation (12), which reduces to simple Bayesian updating.

  • Suppose the data is consistent only with a subset ${\cal O}^{\prime }$ of locations, and that the posterior weights are uninformative (${\Xi }^{{\rm pos}}_i = 1 / \vert {\cal O}\vert$) among this subset. Then ${\cal O}^{\prime }$ is effectively our new location reference class, and equation (12) implements SSA+FGAI over it. Equation (12) can thus be viewed as WFG's rendition of the Observer Equation proposed by Bostrom (Reference Bostrom2013).

  • If we become certain of μ k,j, then provisional credences for all other microhypotheses are zero, and $P_{k, j}^{{\rm pos}} = 1$.

The provisional physical credences update solely in response to the physical fact that a specific observer measures the datum, independent of what happens at other locations. This ensures the insulation between the physical distribution and the indexical distribution, which emerges from the ratios of the provisional credences, albeit the weights have a more active role than in Naive FGAI. The weights shift adaptively in response to indexical data, as we can more accurately assess likelihoods over microhypotheses. The novel feature of WFG is the shifting indexical weights’ ability to ‘revive’ theories that were provisionally disfavoured by the SSA. If we had only a single joint probability distribution, evidence can only eat away the prior probability in a theory because likelihoods are always ≤1. The only way a theory survives with a single credence distribution is if the other theories’ prior credence is also consumed at a comparable rate. In WFG, the provisional probability invested in disfavoured observer trajectories is not necessarily lost but re-assigned to the microhypotheses consistent with likely observer trajectories.

This formulation of WFG is based on the assumption that there are only a finite number of locations or trajectories, parameterized by a finite set of indexical weights. Infinite volumes are a common feature of modern cosmologies, however, and this issue needs to be addressed. Perhaps the discrete distributions can be replaced with probability densities in a way that recovers conclusions found for a large, finite number of locations. Alternatively, the number of expected configurations of a Hubble volume is expected to be finite (e.g. Bousso, Reference Bousso2002), which may be consistent with the discrete distributions, if the observer locations are treated as equivalence classes of identical Hubble volumes at the moment we start the experiment.

An example: two urn problems

Urn problems give natural cases where we should favour a ‘Small’ theory where our observation is more typical. In Naive FGAI, this can follow from combinatorics when observers are distinguishable. The conclusion also holds up in WFG.

In the first urn problem (U-I), there is one urn. The urn may have a Small set of balls numbered 1 through N S, or a Large set numbered 1 through N L; we know that this number follows from some unknown deterministic process. Now, a series of N participants including us draws a ball from the urn in sequence, with replacement. We do not know how many participants there are before or after us, nor any other identifying details, nor what ball the other participants draw. We thus have a series of provisional credences for each possible participant, describing microhypotheses about the sequence of balls drawn. Now, if N is big enough, any possible draw will occur with high probability. However, in any theory about which type of urn is being used, the probability of drawing a given ball is equal for all locations. Thus, if we draw a numbered ball that is found in both a Small urn and a Large urn, we favour the Small urn, as expected – the provisional physical distribution has fewer compatible microhypotheses at each location without a compensating weight shift.

Figure 6 demonstrates a simple worked example. Here, there are two participants. They draw from the urn with replacement at distinct locations i and ii. We are completely agnostic about which of these two we are: $p^{{\rm prior}}_{k, j; {\rm i}} = p^{{\rm prior}}_{k, j; {\rm ii}}$. If the Small urn theory is true, there is only one ball in the urn, with one microhypothesis inheriting all of the Small theory's credence: $p^{{\rm prior}}_{S; {\rm i}} = p^{{\rm prior}}_{S; {\rm ii}} = 1/2$. According to the Large urn theory, there are two balls, with four microhypotheses, with $p^{{\rm prior}}_{L, 11; z} = p^{{\rm prior}}_{L, 12; z} = p^{{\rm prior}}_{L, 21; z} = p^{{\rm prior}}_{L, 22; z} = 1/8$. Because the sums of the provisional credences for position i and ii are the same, ${\Xi }_{{\rm i}} = {\Xi }_{{\rm ii}} = 1/2$. It can then be shown that $P^{{\rm prior}}_S = P^{{\rm prior}}_L = 1/2$.

Fig. 6. Treatment of the ‘urn problems’ U-I and U-II in WFG. In U-I (top), there is one urn. We and one other participant try to determine if it is Small or Large by drawing one ball with replacement. The different indices correspond to the order in which the participants draw. In U-II (bottom), there are two urns at different locations. We and one other participant are assigned randomly to the different urns and try to determine which urn location we are at based on that draw.

Now we draw ball one. The probability of this happening if we are an observer who draws ball two is zero, of course, updating some of the provisional credences: $p^{{\rm pos}}_{L, 12; {\rm ii}} = p^{{\rm pos}}_{L, 21; {\rm i}} = p^{{\rm pos}}_{L, 22; {\rm i}} = p^{{\rm pos}}_{L, 22; {\rm ii}} = 0$. The first and second position are equally likely to draw ball two in the Long theory, so the sums of the provisional credences for these positions remains the same, 1/2 + 1/8 + 1/8 + 0 + 0 = 3/4. Thus, the weights remain the same for both theories, ${\Xi }^{{\rm pos}}_{{\rm i}} = {\Xi }^{{\rm pos}}_{{\rm ii}} = 1/2$. The evaluated credence in the Small theory is now

(16)$$P^{{\rm pos}}_S = {( 2/2) \times ( 1/2) \over ( 2/2) \times ( 1/2) + [ ( 2/2) + 2 \times ( 1/2) ] \times ( 1/8) } = 2/3 ,\; $$

as expected from the SSA.

What if instead we already know the distribution of urn sizes, and we are trying to determine which urn we are drawing from? In U-II, there are only two urns, and two participants including us, one for each urn. At location s is a small urn with a single ball numbered 1, while at location ℓ is a large urn with two balls numbered 1 and 2. We wish to know which location we are at, but we have no idea. The overall distribution of urns – one small and one big – is known, but there are still two microhypotheses regarding for which ball is drawn at ℓ. The provisional credences are equal for both locations and microhypotheses, to yield uninformative physical and indexical priors: $p^{{\rm prior}}_{11; s} = p^{{\rm prior}}_{11; \ell } = p^{{\rm prior}}_{12; s} = p^{{\rm prior}}_{12; \ell } = 1/2$. Now we draw ball 1. If we are at the small urn, or if the ℓ participant draws ball 1, then this is consistent with this datum ($p^{{\rm prior}}_{11; s} = p^{{\rm prior}}_{11; \ell } = p^{{\rm prior}}_{12; s} = 1/2$), but not if we are the ℓ participant in the microhypothesis where they draw ball 2 ($p^{{\rm pos}}_{12; \ell } = 0$). By summing these provisional credences, we find ${\Xi }^{{\rm pos}}_{s} = ( 1/2 + 1/2) /[ ( 1/2 + 1/2) + 1/2] = 2/3$ and ${\Xi }^{{\rm pos}}_{\ell } = 1/3$. This indexical formulation of the urn problem gives the same result – we favour our having a small urn by 2:1.

An interesting effect of the updating weights is shown in Figure 6. What if, in U-II, we want to know whether both participants drew ball 1? Of course, if we are certain we are at the small urn, then we definitely have no information about what the large urn participant draws and $P^{{\rm pos}}_{11} = P^{{\rm prior}}_{11} = 1/2$. If we remained completely uncertain about our location, as with a naive application of the SSA, we would average the probability that a random participant draws ball 1: $P^{{\rm pos}}_{11} = ( 1/2) \times 1 + ( 1/2) \times 1/2 = 3/4$. In WFG, we favour our being at location s but are not certain about it, resulting in an intermediate credence: $P^{{\rm pos}}_{11} = ( 2/3 \times 1 + 1/3 \times 1) / [ ( 2/3 \times 1 + 1/3 \times 1) + ( 2/3 \times 1 + 1/3 \times 0) ] = 3/5$.

WFG and Sleeping Beauty

WFG's treatment of the Sleeping Beauty examples is illustrated in Figure 7. Because both the physical provisional distributions and indexical weights are updated according to Bayes’ equation, we can recover the naive Bayesian predictions of SB-A and SB-C.

Fig. 7. Treatment of the fundamental variations of the Sleeping Beauty problem in WFG. In SB-A, only the indexical weights shift, while in SB-C, the physical credence distributions entirely drive the conclusion. In SB-B, the indexical information shifts the indexical weights assigned to each provisional distribution, ensuring that neither the Short nor Long distributions are favoured.

In the SB-B variant, there are only the Short and the Long theories, with one trivial microhypothesis each, and each having 1/2 prior credence. The Short theory has a normalized indexical weighting of 1 for Monday. The Long theory has a Monday provisional credence and a Tuesday provisional credence, both still equal to 1/2. The prior indexical weights are 2/3 for Monday and 1/3 for Tuesday. Now we learn ‘today is Monday’. The likelihood that this observation is made is 1 if we are the Monday observer so the Monday provisional credences have a likelihood of 1. The likelihood this observation is made is 0 if we are the Tuesday observer, so the Tuesday provisional credence in the Long theory is eliminated. However, we have 100% confidence in our being the Monday observer if today is Monday, and the indexical weights shift to 1 for Monday and 0 for Tuesday. As a result, the probability lost in the Tuesday provisional distribution is irrelevant. Our credence in the Long theory is (1 × 1/2 + 0 × 0)/[(1 × 1/2) + (1 × 1/2 + 0 × 0)] = 1/2, just as before.

There may be some situations like SB-B when we do want an indication effect: if SB-B is carried out with two already extant observers Alice and Bob who are released after the experiment. If Alice wakes up during the experiment instead of afterwards, she can arguably conclude the Long scenario is more likely. Garisto (Reference Garisto2020) emphasizes this distinction between situations where one outcome is ‘Picked’ and those where each outcome is simply a different location one can ‘Be’. Indexical weights representing trajectories instead of static locations allow for this distinction. In the Alice-first Long microhypothesis, the Monday trajectory connects with Alice's Sunday location instead of Bob's Sunday location; in the Bob-first Long microhypothesis, the Monday trajectory connects with Bob-on-Sunday. There are thus four possible indexical weights, and each microhypothesis is compatible with only some of them.

SB-D, virtual observers, and the nature of the provisional distributions

Explaining SB-D is a challenge for FGAI, and the resolution I will now sketch provides insight into the nature of the provisional distributions.

The issue in SB-D arises when a participant builds the provisional distributions as if they are an external, timeless observer X of the experiment. Then, one may reason that Short and Long are equally likely, so the two Short microhypotheses (waking on Monday and waking on Tuesday) inherit half of that credence. Short-Monday's entire credence of 1/4 is given to the Monday provisional distribution, Short-Tuesday's entire credence of 1/4 is given to the Tuesday provisional distribution, and the weights normalize so that Short and Long have equal prior credence.

Learning that it is Monday, however, rules out the Short-Tuesday microhypothesis but the weights shift so that the Long theory is unaffected. As a result, we now favour Long 2:1 (Fig. 8). Crucially, unlike SB-C, this also happens if one learns it is Tuesday. Any data about one's location favours the Long theory, a clear absurdity. Even if one initially favoured Short 2:1, the Bayesian shift still occurs, so the problem remains.

Fig. 8. Illustration of an indication paradox resulting from SB-D. Whether we are told it is Monday or Tuesday, we seemingly favour the conclusion we are in the Long experiment (top). Virtual observers can be introduced to prevent this paradox (bottom).

Clearly, we cannot use this ‘external’ credence distribution. For X, the statement ‘it is Monday’ has no relevance. Presumably, X asks something like ‘is the participant awake on Monday?’, but an affirmative reply is informative because it is possible for the answer to be ‘No’, whereas there is a selection effect for the participant. Or, if X asks ‘what is one day the participant is awake on?’, then there is equal chances of the result being Monday or Tuesday in the Long case, so the credence of the Long theory is reduced accordingly (essentially converting SB-D into SB-B′).

But in fact the provisional prior distributions are the prior distributions one would have used if one was entirely certain of one's location. For each location, the Short and Long prior probabilities should be equal. This is supported by the analogy between AB-B and AB-D: surely our present belief in the diversity of ETIs should not depend on whether we learned of Earth's environment before or after the question was first posed. One's situation after learning ‘today is Monday’ in SB-D is the same as if we had learned ‘today is Monday’ in SB-B.

How, then, are we to avoid the Bayesian shift that occurs? I propose that we add virtual observers to each Short microhypothesis, expanding ${\cal O}_{S, M}$ and ${\cal O}_{S, T}$ to {M,  T}. The observational model is extended to allow virtual observers to ‘observe’ the same data that a real observer at the same location would. The provisional prior credence of a microhypothesis for a virtual observer is zero. The indexical distributions emerge from the provisional credences, so they still give the correct result. The virtual observes dilute the Short microhypotheses’ credences by spreading the indexical weights. Furthermore, now Short's weights are as responsive as Long's, allowing the remaining Short microhypothesis to remain on an even level with Long (Fig. 8).

Virtual observers are placed to prevent Bayesian shifts upon learning any data (as in SB-D), but not to inhibit valid inferences when there are more outcomes in one theory than another (as in SB-C) – a distinction dependent on symmetry. The nature of these virtual observers is unclear, and a full theory of how they are placed must be developed. This is clear when we consider a hybrid thought experiment, where there is a Short variant with a wakening on Monday only, a Long variant with awakenings on Monday and Tuesday, and an Intermediate variant with awakenings on one day chosen at random. If one then learns the experiment is not-Short, not-Intermediate, or not-Long, one would expect the situation to reduce SB-D, SB-B, or SB-C, respectively, but this is impossible if there is only one provisional distribution for each location. The reductions are possible if there are two Monday and Tuesday provisional distributions, one for Short-Intermediate and one for Long-Intermediate, but this is admittedly contrived. This hybrid thought experiment itself is unlike the types of problems that arise in astrobiology, though, and itself may be contrived.

Variants on weighting

The indexical weights are based on sums of the provisional credences for each position. Our evaluated credence in microhypotheses according to equation (12) therefore depends on the provisional credences in separate theories. This can lead to some odd results. Consider SB-B′, after one learns that the experimenter reports a Heads outcome. Only one microhypothesis in each theory for each day survives this observation (Short-A and Long-A for Monday, and Long-B for Tuesday), thus neither the weights, nor the relative credence in Short and Long change. However, because the weights favour Monday two-to-one, that means that P pos(Long + A) = 2 P pos(Long + B) (c.f., Fig. 5). Of course, if the experimenters had announced Tails, the relative credences of the microhypotheses would be reversed; it is not as though any data leads to us favouring Long+A over Long+B.

One ultimately problematic way to address this issue is to adopt a weight distribution that starts out not favouring any particular position: ${{\Xi }^{{\rm prior}}_i}^{\prime } = 1/\vert {\cal O}\vert$. The weights then are updated by multiplying with an indexical likelihood,

$${\sum_{\mu_{k, j}} p_{k, j; i} {\scr P}( o @ i \to D \vert \mu_{k, j}) \over \sum_{\mu_{k, j}} p_{k, j; i}}$$

and normalizing. In many situations this gives the same result as the standard weights. Unfortunately, the weights now retain a ‘memory’ of theories that are no longer viable, with two negative consequences. First, the weights can no longer directly serve as an effective indexical distribution. This can be seen in SB-B if the participant learns only that they are in the Long experiment without learning the day of the week. Then because the Short theory is ruled out, the Monday weight has an indexical likelihood of 1/2 while the Tuesday weight has an indexical likelihood of 1, leading the weights to favour Tuesday two-to-one. Second, this memory effect can still lead to undesirable asymmetry in credences. For suppose the participant in SB-B′ learns they are in a Long experiment and the otherwise irrelevant detail that the coin flip ‘outcome’ for the day is Heads. Now, because the weights favour Tuesday, the participant would conclude that P pos(Long + B) = 2 P pos(Long + A). In contrast, in the standard weights, the now-unviable Short theory no longer influences ${\Xi }^{{\rm pos}}$ or P pos.

A different solution is to abandon universal sets of weights. Instead the weights are defined only for a subset of microhypotheses, most naturally individual macrotheories. Defining a separate set of weights for each macrotheory k,

(17)$${\Xi}_{k; i} = \displaystyle{\sum_{\mu_{k, j} \in {\Theta}_k} p_{k, j; i}\over \sum_{z \in {\cal O}_{k}} \sum_{\mu_{k, j} \in {\Theta}_k} p_{k, j; z}},\; $$

still allows for an averaging over microhypotheses, but now credence shifts in one theory has no effect on the weights in another. This eliminates the asymmetric microhypothesis credences in SB-B′. The main issue with macrotheory-level weights is their ad hoc nature. It leaves open the question of what even counts as a macrotheory, and why that level should be ‘correct’ other than it giving seemingly correct results.

Using separate sets of indexical weights for every single microhypothesis may seem the most natural solution, with ${\Xi }_{k, j; i} = \xi _{k, j; i}$. Each microhypothesis’ credence is determined by the likelihood for the locations most compatible with observations. Essentially, microhypothesis-level weighting implements the simplest forms of naive FGAI, without implicit indexing. And therein lies the problem: as long as there is any observer anywhere who observes one's data in the microhypothesis, that microhypothesis cannot be constrained. Thus in the Proximan life thought experiment, if life on Proxima b is independent of our existence, then L-B cannot be ruled out if we observe life there: it simply means we live on one of the Earths in an endless universe that just so happens to be around the ultrarare inhabited Proxima b, whether that is Earth 1 or Earth 492,155. Implicit indexing in naive FGAI gets around this by fixing one position as the only possible location, preventing indexical weight shifts. A similar restriction of the reference class (e.g. only Earth 1 has nonzero indexical weight) is necessary if microhypothesis level weighting is to be practical.

Boltzmann brains in WFG

The Boltzmann brain problem is deeply related to the notion of typicality and has guided previous treatments of the matter. Cosmological theories with large thermal baths – including the event horizons of black holes and de Sitter universes – predict the appearance of Boltzmann brains, observers with your memories and beliefs that assemble out of thermal fluctuations and flicker into activity briefly (e.g. Dyson et al., Reference Dyson, Kleban and Susskind2002; Albrecht and Sorbo, Reference Albrecht and Sorbo2004; Carroll, Reference Carroll, Dasgupta, Dotan and Weslake2017). Not only are they predicted to exist with probability 1, but current cosmological theories have the disturbing tendency to predict that Boltzmann brains with your memories vastly outnumber those of you who have evolved naturally. Furthermore, even in universes very unlike ours, there should exist observers with our memories who remember our ΛCDM cosmology with probability 1. So what justification do we have in concluding that the universe is actually anything like we observe it to be?

Boltzmann brains and similar problems motivated Bostrom (Reference Bostrom2002) to introduce the SSA and argue for the use of a wider reference class than observers identical to us. The vast majority of Boltzmann brains have experiences completely unlike ours; even those that momentarily have our memories almost always find their sensoria dissolving immediately afterwards as they are exposed to their environment (say, all but 1 in $\gg e^{10^{25}}$). We might hope that the persistence of our observed world favours us being evolved observers viewing a real universe. In the strictest interpretation of FGAI, this is of no help for inference because there always exist Boltzmann brains that observe a persisting universe (living on Boltzmann Earths, for example).

Let us now apply WFG to a simplified Boltzmann brain problem. In false cosmologies, only the Boltzmann brains (b) exist, while the true cosmology also contains evolved brains (e). We start with a set of indexical weights assigning equal credence to each possible location, Boltzmann or evolved. Every evolved observer sees a long-lived persistent universe, but only a small fraction η of the Boltzmann brains observes one. As the observed cosmology persists in our sensory data, we update both the indexical weights and each cosmology's provisional physical credence. Then we will find ourselves both favouring the true cosmology and, if we use provisional-prior updating for the weights, our being an evolved observer within that cosmology.

For example, suppose we consider two cosmologies, a False one (F) and a True (T) one. The False one contains only N e locations with Boltzmann brains, while the True one contains those same N b locations with Boltzmann brains and an additional N e locations with evolved observers. The provisional credences of each microhypothesis start equal for each occupied position in each theory. This results in initial weights of 1/(N e + 2N b) for each evolved location and 2/(N e + 2N b) for each Boltzmann brain location.

Our not instantly dissipating is certain if we are evolved observers but unlikely if we are Boltzmann brains. It can be shown that the effective likelihood for each evolved location is then 1 but η for Boltzmann brain locations. When the provisional credences are updated, the weight for each evolved location shifts to 1/(N E + 2ηN B) and the weight for each Boltzmann brain location falls to 2η/(N e + 2ηN b). If we are an evolved individual, all microhypotheses in the T theory are consistent with the observation, while if we are any given Boltzmann individual, only a fraction η of the provisional credence in each theory remains. Using equation (12), we now evaluate a posterior credence in cosmology T of:

(18)$$P^{{\rm pos}}_T = {N_E + 2 \eta^2 N_B\over N_E ( 1 + \eta) + 4 \eta^2 N_B} .$$

As long as η ≪ 1 and N E ≫ 2η 2N B, we now almost entirely favour cosmology T ($P^{{\rm pos}}_T \approx 1 - \eta - 2 \eta ^2 N_B/N_E$) (see Fig. 9 for a minimal example). When these criteria are fulfilled, the provisional credence for the evolved locations dominates the evaluated credence in T, even if ${\Xi }^{{\rm pos}}_E \ll {\Xi }^{{\rm pos}}_B$.Footnote 12

Fig. 9. Minimal example of Boltzmann brain problem treated in WFG with N b = 2, and N e = 1. Both the false and true cosmologies have two Boltzmann observers identical to ourselves; in this case, each with an equal probability of being long-lived (ℓ) and decaying in a short time (s). The true cosmology also has an evolved observer E that always observes it correctly. An observation of T leaves the e provisional physical distribution untouched and also shifts indexical weight to e, favouring T.

In the above example, we consider only observers with exactly our memories at the start of the observation (as in Neal, Reference Neal2006). There are two reasons this is allowed despite Bostrom (Reference Bostrom2013)'s objections. First, the indexical weights are averaged instead of being defined on the microhypothesis level, and they describe a location reference class instead of an observer reference class. This lets the vast majority of Boltzmann brains that later experience a dissipating universe to influence the final credence: although a given microhypothesis has some locations with long-lived Boltzmann brains, these are no more favoured than locations with dissipating Boltzmann brains. Second, unlike the SSA, WFG results in an uneven indexical distribution: we place much more weight in each evolved brain location than the Boltzmann brain locations. This actually results in a faster convergence to the realism of the T cosmology than the SSA.

FGAI and x-day arguments

Classes of x-day models in FGAI

One advantage of underpinning typicality arguments with fine-grained hypotheses is that doing so forces one to use a well-specified model that makes the assumptions explicit. In an x-day Argument, each microhypothesis corresponds to a possible permutation of people born as well as a complete set of possible observations by each observer (whether a human or an effectively independent external observer like an alien). This entails an observation model, a set of constraints on who can observe whom. In particular, realistic theories of observation are causal – one cannot ‘observe’ people living in the future – and local – one cannot ‘observe’ another person without some physical mechanism linking them.

I present four general schema of fine-grained x-day theories, resulting in microhypotheses with different combinatorial properties. These fine-grainings are illustrated by a simplified model where we consider only two theories (c.f., Leslie, Reference Leslie1996): a Short/Small theory where all humanities have a final population N S and a Long/Large theory with final population N L in all humanities, where N L > N S. Each human at x-rank r is drawn from a set of possible humans ${\cal H}$, and measures their x-rank to be r, specifying the observation model. The possible humans in ${\cal H}$ correspond to different genetic makeups, microbiomes, life histories, memories encoded in the brain, and so on. The details of how humans at rank r are selected from ${\cal H}$ form the basis for each of the different model classes (Fig. 10).

Fig. 10. Different fine-grainings of the x-day Argument have different assumptions about the interchangeability of humans. The clouds represent an unknown or random selection from a set. The shown configurations are chosen so that observer A (blue circle) exists with rank 1, but this is a contingent selection in the models.

The four schemas differ by their restrictions on the possible permutations:

  • Indistinguishable Observers (IO) – Every possible human is treated as identical, with ${\cal H} = \{ A\}$. There is only one possible microhypothesis for each theory because the permutations are indistinguishable. Any information about x-rank is treated as purely indexical. IO is essentially the same kind of scenario as SB-B.

  • Unconstrained Observers (UO) – Humans are drawn from a very large set ${\cal H}$ (with $\vert {\cal H}\vert = {{\cal N}}$) of distinguishable observers, and any $h \in {\cal H}$ can be born at any rank r. In UO, people with different names, identities, and memories could be treated as distinct members of ${\cal H}$, but these details would no correlation with historical era.

  • Unconstrained Subset Ordering (USO) – The set of possible humans ${\cal H}$ is partitioned into ${\cal R}$ (≥N L) mutually disjoint subsets ${\cal H}_r$, with one for each r. Humans with rank r can only be drawn from ${\cal H}_r$. This reflects the fact that individuals are the result of a vast constellation of historical circumstances that should never be repeated again. In USO, we can specify the contents of these subsets, but we do not know which subset is assigned to each r. In an infinite Universe, every ‘copy’ of you will have the same r as you.

  • Known Subset Ordering (KSO) – As in USO, the set of possible humans is partitioned into disjoint subsets ${\cal H}_r$, with humans at rank r drawn only from ${\cal H}_r$. Unlike USO, we already know beforehand the ordering of these subsets – which one is ${\cal H}_1$, ${\cal H}_2$, and so on. All that remains to be discovered is the final human population N and which humans, in fact, are selected out of each ${\cal H}_r$. KSO models emphasize how we already know, by virtue of our historical knowledge, our place in history before applying an x-day Argument.Footnote 13

None of these models exactly corresponds to how we would approach the Doomsday Argument, since we do not actually know the set ${\cal H}$ or exactly how it is partitioned – these details are implicit for us. But by making explicit models, they illustrate when the Doomsday Argument is appropriate. Of these, KSO arguably is most analogous to our situation when we apply the Doomsday Argument, since humans who know they live in the year 2022 cannot be born in the Palaeolithic or an interstellar future (to the extent our memories are reliable). Our Short and Long theories start with our known history, and then propose an additional N future future people after us, with the likelihood of us existing at birthrank 1011 being 1 by assumption. This is not just a tautology because who you are is shaped by your place in history; realistic Small and Large models both predict that you, with all your personality, memories, and beliefs, could only appear this point in history (as in Korb and Oliver, Reference Korb and Oliver1998).Footnote 14

Self-applied x-day arguments in FGAI

An x-day Argument to constrain final population may be self-applied by a member of the population being constrained (as in the Bayesian Doomsday Argument, or astrobiological Copernican arguments), or applied by an external observer who happens upon the population at some time (analogous to some of Gott Reference Gott1993's examples). Self-applied x-day Arguments are by far more problematic.

Fine-graining the self-applied Doomsday argument when our rank is 1

Starting with a ‘single-world’ assumption that there is only one humanity in the cosmos, it is relatively simple to calculate the number of microhypotheses in IO, UO, USO, and KSO with combinatorial arguments. These are listed in Table 1, where the observer in question is labelled A and is located at N x = 1 without loss of generality. We can also consider ‘many worlds’ models in which there are many copies of humanity out the larger cosmos, each with different rank permutations of human individuals. This increases the number of microhypotheses, but as in the urn problem, our ignorance of which world we are in and the symmetry between the worlds yields the same results when our rank is 1.

Table 1. Microhypotheses counts for self-applied x-day arguments in single world models

Each macrotheory about N total has its own set of indexical weights. In the single world versions, the locations (or trajectories for these indexical weights are simply each possible x-rank 1 to N total. In multiple world versions, there is an location for each x-rank and each world. These weights are initialized to be equal for each possible location. Every human can learn their x-rank but not which of the worlds they live in.

The four schemas all arrive at the ultimate conclusion that the surviving microhypothesis fraction if A measures their rank to be 1 is independent of N. The self-applied x-day Argument is nullified in all of them, although they take different routes to reach this conclusion.

IO is qualitatively equivalent to SB-B. A is the only possible individual, and there only ever is one microhypothesis per theory, which can never be ruled out – A merely learns indexical information. Learning one has an x-rank of 1 shifts the indexical weights in the Long theory and the credences do not change. In a many world case where all worlds have the same N total, there is uncertainty about which world one is in, but the symmetry results in the same conclusion. If there are many worlds and we consider theories where N total is assigned randomly, we get a series of microhypotheses about which worlds are Small and which Large. A self-observed x-rank of 1 zeroes out the indexical weights for all positions with N x > 1 in every world, inhibiting the Doomsday Argument just as for the single-world case.

For UO and USO, there are more locations for A to be born in the Large theory, and thus more microhypotheses A observes themself to exist.Footnote 15 In WFG, indexical weights are initialized evenly for all observers, and the observation of one's existence provides no indexical information to change that. Thus, although the amount of prior credence that survives the self-observation is greater by N total for a single world model, it is weighted by a factor of 1/N total, as only that fraction of individuals has that specific self-data. Upon learning your x-rank, large N total theories are penalized by having fewer microhypotheses survive, but this is compensated by the indexical weights shifting to locate you at your actual x-rank. Thus you can conclude nothing from the self-applied x-day Argument. A worked example is given in the Appendix.Footnote 16

KSO proceeds more simply because learning your individual identity also allows one to assign your x-rank with certainty in all microhypotheses. The self-observation eliminates the same fraction of prior weight in all theories for observers with your x-rank (Fig. 11). The self-applied x-day Argument is again rendered powerless.

Fig. 11. Illustration of how a self-applied Doomsday Argument fails in the KSO schema according to WFG. Two models are compared, with N S = 1, N L = 2, ${\cal H}_1 = \{ A_1,\; \, B_1\}$, and ${\cal H}_2 = \{ A_2,\; \, B_2\}$. The combinatorial properties of the microhypotheses, aided by the shifting indexical weights, ensure the prior is unaffected by an observer learning they are A 1 at rank 1.

The anthropic shadow: when our rank is bigger than 1

According to Large theories about possible futures where we are likely to survive for a long time on Earth, or spread across the stars, we are among the earliest people in our society, but our birthranks are not actually 1. This can become crucial when judging Small theories. Perhaps it was nigh inevitable that humanity would be annihilated shortly after the invention of nuclear weapons and we are one of the rare exceptions, for example. Does the birthrank of today's newborns serve as evidence against N total distributions where we are an outlier?

WFG enforces a high level of agnosticism about the N total distribution below our current N past when we first apply the argument. Our initial application of the self-observation of a current x-rank fixes our indexical position in each world – really it selects a class of observer-trajectories that start at N x. The result is that any microhypothesis where there is at least one world that is large enough for N total ≥ N x survives unscathed. The only nonzero indexical weights are for our x-rank in those worlds that have them, all of which are completely compatible with our x-rank. This is the ‘anthropic shadow’ effect of Ćirković et al. (Reference Ćirković, Sandberg and Bostrom2010). Curiously, this leads to the credence in the distribution depending on the number of worlds, because with more worlds there is more likely to be one that survives long enough for our self-observation. In the limit that there are an infinite number of worlds, any distribution with nonzero probability of a world surviving as long as ours has a likelihood of 1.

After that initial self-observation, however, there is no more freedom to shift the indexical weights. Our observer-trajectory has a fixed beginning. As the world continues to survive, only some fraction of worlds last. The provisional credence for worlds that are destroyed in a microhypothesis is zeroed out by our continued survival, reducing our overall credence in the microhypothesis and its parent distribution theory. WFG conditionalizes the credences on our initial x-rank, still allowing us to constrain Small theories by their large N total tails.

This raises the question of which x-rank-trajectory we should use. Science is a collective enterprise, so when engaging in it we should not all be using our individual x-rank (c.f. the discussion in Garriga and Vilenkin, Reference Garriga and Vilenkin2008). Future generations can use our x-rank because the indexical weights are for trajectories which can start before they were born. We might ‘start the clock’ at some date corresponding to anywhere from the earliest time in cultural memory to the invention of the Doomsday Argument, or perhaps the beginning of humanity itself (or even earlier!). This is an open issue.

Why the self-applied Doomsday argument fails

WFG, with its emphasis on observations as physical events, requires exactly specified (if simplified) models instead of simply relying on analogies. Purely indexical problems, like SB-A or Leslie's ‘emerald’ thought experiment, do not accurately model the Doomsday Argument. Nor do purely physical problems like SB-C, urn problems, or the external Doomsday Argument (Fig. 7). The self-applied Doomsday Argument never yields new information within WFG because:

  • Indexical facts do not directly constrain physical distributions – In WFG, the indexical distributions emerge from the independently updating provisional credences, shrinking the reference class and the power of the 1/N total likelihoods, averting solipsism.

  • Large worlds have more positions for specific individuals to be born, balanced by the uninformative weights – In UO and USO, a greater fraction of physical microhypotheses is consistent with the existence of any specific individual in a Large world. Yet the smaller weights accorded to each position prevent any ‘Presumptuous Philosopher’ problems.

  • Physical observations obey causality, limiting outcomes – KSO provides an explicit model where one cannot treat the birthrank of yourself, a specific individual, as a uniform random variable. Physical distributions in themselves are updated solely by physical observations, which are constrained by causality. If an individual like you can only be born at birthrank 1011, there is only one possible outcome, with likelihood 1.

The rejection of the Doomsday Argument itself does not negate the many potential threats to ourselves and the Earth that we know about, especially when considering anthropic effects (Leslie, Reference Leslie1996; Ćirković et al., Reference Ćirković, Sandberg and Bostrom2010). Evaluating these threats could itself shift our beliefs to a short future, but this must be done through normal scientific investigation of their natures.

Implications for Copernican arguments in astrobiology

If we accept WFG as a theory of typicality, then the same arguments against self-applied x-day Arguments work against self-applied Copernican Arguments in astrobiology. Instead of birthranks, we may have habitats, and we are interested in constraining the existence of as-yet unobservable beings in other locations. But the number of microhypotheses where one individual specifically exists on Earth will be independent of the number of inhabited locations. The existence of exotic forms of alien life multiply the number of microhypotheses, but do not change the probability ratios.

Thus in WFG, we can conclude nothing about the existence of alien life in un-Earthlike locations from our own existence on Earth, as required by Hartle and Srednicki (Reference Hartle and Srednicki2007). We must either constrain them based on physical plausibility or by observation, just like any other astrophysical phenomenon.

Externally-applied x-day arguments in FGAI

Externally-applied x-day Arguments are structurally similar to ‘urn’ thought experiments, while seeming to also justify the self-applied Doomsday Argument by analogy. Under some circumstances, one can infer something about the lifespan distribution of an external phenomenon by drawing a sample of it at random (as in the most popularized examples of Gott Reference Gott1993). But the key difference with self-applied Doomsday arguments is the existence of an additional ‘observer’ that is the one carrying out the experiment, who is completely outside the population being constrained. This adds another physical degree of freedom, which changes the microhypothesis counts in FGAI to allow for a Doomsday Argument.

Suppose an observer X happens upon humanity, with individuals drawn from ${\cal H}$ according to IO, UO, USO, or KSO. All of the possible sequences of human individuals listed by the microhypotheses still are valid, and would have the same relative prior probabilities. Yet there is a final observable quantity, which is actually what X measures: the relative time t between humanity's start and X happening upon us. Even if the evolution of X is deterministic enough to demand they arrive at a specific time, X does not have enough information beforehand to determine a particular t; presumably such factors are microscopic and chaotic. Therefore, every single one of the microhypotheses of the previous section must now be divided into T even finer microhypotheses, one for each possible t interval X can arrive in.

Whether a Doomsday Argument applies depends on the selection process, in particular how X found humanity. If X could plausibly have arrived before or after humanity's existence, then T is both much larger than humanity's lifespan ${\cal T}_H$ (or its equivalent) and also independent of ${\cal T}_H$. This selection method is quite common: if one looks for examples of a rare phenomenon at a particular location, then most likely one will be searching in the vast epochs before or after it passes. It is in fact less of a coincidence for X to arrive at the right time to observe humanity if N is large, favouring a Large history – the fraction of surviving microhypotheses scales as $\sim {\cal T}_H/T$. This is well known in SETI, where the abundance of currently observable extraterrestrial intelligences scales directly with their mean lifespan in the Drake equation. Now suppose X measures the age of humanity. The fraction of microhypotheses consistent with X arriving at this particular epoch of human history (~1/T) is the same in Small and Large models. Thus there is no Doomsday Argument as such; X cannot constrain humanity's lifespan.Footnote 17

But what if there is no possibility of X failing to make an observation of humanity? One could suppose that rather than searching a particular location, X seeks out the nearest technological society, scouring the infinite cosmos if they have to until they find one. (We will also assume all such societies have exactly the same lifespan for the sake of argument.) This would be more like picking someone on the street to ask them their age. The only possible values of t are values within humanity's lifespan – each Large history theory has a larger number of possible outcomes. The microhypothesis counts in Table 1 are multiplied by N to account for the extra degree of freedom in t. Only $\sim 1/{\cal T}_H$ of them are consistent with any particular measurement of humanity's current age.

X may assume there are several possible locations they can arrive at, one for each moment in human history, and create a provisional physical distribution for each. These provisional physical distributions should be constructed as in SB-C: they should favour Small theories at a ratio proportional to $1/{\cal T}_H$. This is initially compensated by additional sets of microhypotheses in Large theories where X arrives at later times. Note that in each microhypothesis there is only one possible location for X, thus an indexical weight of 1 is applied to all standing microhypothesis. X learning that they have arrived at an earlier time rules out microhypotheses in Long theories where they arrive late in history, but the indexical weights do not shift. Thus, X now favours Small histories, and the externally-applied Bayesian Doomsday argument is in effect. Alternatively, X may work under the assumption that their location is fixed, and that it is the start of human history that is the free quantity, to get the same result.

And what if X then announces their finding to Earth – should we agree that humanity's lifespan is short? Actually, we do not have enough information to tell. It would be strong evidence if X is the only entity carrying out a survey like this, or if there is one or more target society per survey. If there are many such surveys/observers, however, then we expect an observer like X to show up frequently, even in the early epochs of a Large history. In fact, X being the first such observer implies our history is Long – otherwise, the multitudes of observers like X destined to choose our Earth would have had to squeeze into our short historical epoch before we went extinct.

These are not the only possible models for the time of X's arrival. Actual observations may deliberately be carried out at a specific time during its history, limiting the microhypotheses about t to a narrow range independent of ${\cal T}_H$ (as in the wedding example of Gott, Reference Gott1997). Furthermore, the age of the Universe sets an upper limit on the relative |t|. We cannot rule out astrophysical objects having lifespans of trillions of years – as we suspect most do (Adams and Laughlin, Reference Adams and Laughlin1997). Finally, the observational outcomes will be biased if the population varies with time. If we survey an exponentially growing population of human artefacts, for example, Gott (Reference Gott1993)'s Doomsday Argument necessarily underpredicts the lifespan.

Thus, an observation of a single ETI, for example, does give us information about the distribution of ETI properties (Madore, Reference Madore2010). We must be mindful of selection biases; in a volume-limited survey, we are more likely to observe a long-lived society, and we will only find life that we can recognize in places we look. In an unbiased survey, however, we can favour theories where a small sample is more typical.

Conclusion

Summary of WFG

I have developed WFG as a framework for interpreting typicality arguments. It shares elements with other frameworks: the auxiliary information about reference classes in Bostrom (Reference Bostrom2013), the use of extremely fine-grained data about observers in Neal (Reference Neal2006), and the distinction between indexical and physical questions from Srednicki and Hartle (Reference Srednicki and Hartle2010) and Garisto (Reference Garisto2020). It is based on two principles.

The first principle I have argued is that purely indexical facts cannot directly alter physical credence distributions. Indexical facts and physical facts are different types of data, and mixing them together into a joint prior leads to ‘Presumptuous Philosopher’ problems, biasing us towards Large worlds or Small worlds. The difference between indexicals and physical facts is elaborated by different variants of the Sleeping Beauty thought experiment, where the probability refers to different distributions. Yet both types of distributions always exist, even when perfect knowledge about the world or our location in it hides one. In WFG, the provisional credences are evaluated according to the likelihood of an observer at a specified position making a specific observation, a physical event. The indexical distribution emerges as a kind of ‘projection’ of these provisional credences, and the evaluated physical credences are a different projection using an averaged indexical distribution as weights.

The second principle is that high-level macrotheories about the world can be resolved into a multitude of physically-distinct microhypotheses. The consistency of evidence with each microhypothesis is calculated through physical distributions. In WFG, we do not necessarily know which observer we are, so there are provisional physical distributions for every possible trajectory we could be located along in a theory. These are averaged by the indexical weights, summarizing our current beliefs about our location. By defining reference classes by location rather than observer, observations in a large universe can nonetheless be constraining. Although updated separately, they are combined together to form a single physical credence and single indexical distribution for each microhypothesis. Our third-person credence in a macrotheory is then found by summing over all microhypotheses.

I have showed how WFG handles several problems involving ‘typicality’, avoiding fallacious Doomsday Arguments. Several issues remain. First, it is not immune to Bayesian shifts favouring Large theories (for example, in SB-D) unless ‘virtual observers’ are inserted into theories, trading physical credence with additional indexical weights. Further issues stem from the need to understand how indexical information is defined and handled. In particular, the indexical weights must be averaged between multiple microhypotheses, nominally all of them. Finally, the formulation in this paper assumes a finite number of possible observer locations. Nonetheless, it avoids solipsism and reduces to simple Bayesian updating when either indexical or physical data is fixed.

The role of the Copernican Principle in WFG

The Copernican insight that we are not unique miracles is deeply ingrained, but not all applications of the idea that ‘we are typical’ are defensible. The parable of the Noonday Argument describes a fallacious application of the principle, demonstrating the problem with the frequentist Doomsday Argument or trying to pick out ‘special’ points of history like its beginning. Other Copernican arguments can have more subtle flaws.

Bayesian typicality arguments about the distribution of observers rely on 1/N total likelihoods: the more possible outcomes there are, the less likely any specific one is observed. Although valid in some circumstances, applying it to rule out Large theories in general is unacceptable – it can lead to solipsism, in which you are the most typical being because you are the only being, negating science as an endeavour. Whereas the Copernican principle motivates us to consider the possibilities of a cosmos where we are living in but one world of many, unrestricted application leads to an epistemic arrogance where you decide your world is the only kind of thing that exists. Yet the mere existence of the environments unlike ours already invalidates the strongest formulations of the Copernican argument: we are either atypical because we are not the only observers that exist, or we live in an atypical place because it contains observers unlike everywhere else in the Universe.

WFG is my attempt to deal with how to make inferences in a vast cosmos, and it provides two manifestations of the Copernican Principle. First is an indexical role as a prior, choosing the initial provisional credences for each microhypothesis. This role is not trivial because the indexical weights are averages of these between microhypotheses, which penalizes theories where our location is both rare and random.

The second manifestation of the Copernican Principle is embodied by our physical distributions. WFG fine-grains physical theories into microhypotheses describing the exact physical details of every possible outcome. These microhypotheses make no reference to indexical notions like ‘me’ or ‘us’, only specific physical observers who make specific physical observations at specific physical positions. Observer-relative typicality emerges from the combinatorial properties of these microhypotheses when there is symmetry with respect to position. In other cases, when specific individuals can only exist in certain locations, typicality can fail – this is the anthropic principle at work. Ultimately, the reason why we may often assume ourselves to be typical is because of the uniformity of the physical laws of the cosmos, manifesting both in the symmetry and the panoply of microhypotheses.

Acknowledgements

I thank S. Jay Olson for comments on an earlier version of this paper. I also thank the referees for their comments. I acknowledge the Breakthrough Listen programme for their support. Funding for Breakthrough Listen research is sponsored by the Breakthrough Prize Foundation (https://breakthroughprize.org/).

Conflict of Interest

The author reports no conflicts of interest.

Appendix A. Explicit Doomsday problems in WFG

In this appendix, I work out explicit examples of Doomsday-type problems in WFG. Although extremely simple models are presented, they nonetheless illustrates several points: Doomsday failing for internal observers while working for an external observer, how to handle multiple ‘worlds’ of humanity, and how to treat hierarchical models in USO with unequal prior credence in the microhypotheses.

The doomday argument in a one-world USO model

This model consists of only one world (or, alternatively, a series of worlds that are exactly identical due to deterministic evolution). The world contains a short sequence of human observers. We wish to compare two extreme hypotheses for the distribution of N total. In the Small model, N total = 1 for both, while in the Large model, N total = 2 for both. These humans are drawn from two subsets, ${\cal H}_A = \{ A_a\}$ and ${\cal H}_B = \{ A_b,\; \, B_b\}$, of unequal size. This is a USO model: due to unknown historical contingencies, ${\cal H}_A$ humans can only be born at one of birthrank 1 or 2, and likewise for H b. In addition, an external alien observer X who knows all this deliberately seeks out this unique instance of humanity, but not knowing their arrival time beforehand. Each human observes which member of ${\cal H} \equiv {\cal H}_A \cup {\cal H}_B$ they are and measures their birthrank accurately, but cannot observe humans at the other birthrank. Likewise, if X is present at that time, they measure which human is present and what their birthrank is.

The microhypotheses in this problem are listed in Table 3. These microhypotheses indicate the birthrank order of ${\cal H}_A$ and ${\cal H}_B$, the exact sequence of individuals in the world, and the human observed by X, denoted by $\overline {Y}$. The arrival of X is a physical event and thus different locations of X are properly treated as physical microhypotheses. This example is somewhat complicated because ${\cal H}_A$ and ${\cal H}_B$ are different size. Thus permutations with only ${\cal H}_B$ are more numerous than those with ${\cal H}_A$. The correct approach is to regard the ordering of a and b as intermediate-level parameters: it is just as likely that ${\cal H}_A$ has rank 1 as ${\cal H}_B$ has rank 1. Thus Small microhypotheses with only b humans have lower $P_{k, j}^{{\rm prior}}$ than those with only a humans for a balanced credence. In the Table, quantities referring to humans at different birthranks within a world are separated by a comma.

Internal observers – You are a human and you have worked out this model exactly. You wish to use a Doomsday Argument by determining which human in H you are and also your birthrank. To do this in WFG, you must establish provisional prior credences for each microhypothesis at each possible location or trajectory. Since you know you are human and not the alien, these locations are birthrank 1 and birthrank 2 in the Large model and only birthrank 1 in the Small model. Now, if you were already certain of which microhypothesis is true, you would want to have an uninformative indexical prior. Therefore, $p_{k, j; i}^{{\rm prior}} = P_{k, j}^{{\rm prior}}$. With the provisional prior credences in hand, you can calculate your indexical weights as ${\Xi }^{{\rm prior}}_1 = 2/3$ and ${\Xi }^{{\rm prior}}_2 = 1/3$ (equation (10)). Columns 5–8 then give likelihoods that a human at each position will make a given observation.

Now suppose you observe that you are A a. Half of the provisional credence at each position is zeroed out because of incompatible microhypotheses. For the other half, the provisional credence is updated to 0. The weights remain unchanged (${\Xi }^{{\rm pos}}_1 = 2/3$, ${\Xi }^{{\rm pos}}_2 = 1/3$). The credence in the Small theory updates to (1 × 1/4 + 2 × 1 × 0)/[(1 × 1/4 + 2 × 1 × 0) + (4 × (2/3 × 1/16 + 1/3 × 0) + 4 × (2/3 × 0 + 1/3 × 1/16))] = (1/4)/[(1/4) + (4/16)] = 1/2. Next, you happen to measure your birthrank to be 1. In all microhypotheses in both theories, this is only consistent with the observer having a birthrank of 1, so all provisional credences for birthrank 2 are zeroed out. The indexical weights update to ${\Xi }^{{\rm pos}}_1 = 1$ and ${\Xi }^{{\rm pos}}_2 = 0$. Using the likelihoods in the Table, the credence in the Small theory updates to (1 × 1/4 + 2 × 1 × 0)/[(1 × 1/4 + 2 × 1 × 0) + (4 × (1 × 1/16 + 0 × 0) + 4 × (1 × 0 + 0 × 0))] = (1/4)/[(1/4) + (4/16)] = 1/2. The Doomsday Argument has failed for A a because of the shifting indexical weights (Fig. 12).

Fig. 12. Illustration of the self-applied (top) and external (bottom) Doomsday Argument USO example in WFG when A a is observed at rank 1. The observed human is bold and underlined (self-observation) or overlined (external observation). For the self-applied argument, the distinctions about X's position are ignored, where each shown microhypothesis stands for one (Short) to two (Long) from Table 2. For the external argument, X's location is fixed. The case where the epoch of humanity's start is fixed proceeds similarly, with the two locations splitting the microhypotheses between them for their provisional distributions. This is an example where X can apply the Copernican Argument to make inferences – note that X has no possibility of failing to observe humanity in this example.

Table 2. Microhypotheses for one-world USO Doomsday problem

The Doomsday Argument also fails for A b (and B b by symmetry) if they attempt it. Upon discovering they are A b, the weights remain unchanged (${\Xi }^{{\rm pos}}_1 = 2/3$ and ${\Xi }^{{\rm pos}}_2 = 1/3$) as does the credence in the Small theory, (1 × 0 + 1 × 0 + 1 × 1/8)/[(1 × 0 + 1 × 0 + 1 × 1/8) + (2 × (2/3 × 0 + 1/3 × 0) + 2 × (2/3 × 0 + 1/3 × 1/16) + 2 × (2/3 × 1/16 + 1/3 × 0) + 2 × (2/3 × 0 + 1/3 × 0)] = (1/8)/[(1/8) + (2/16)] = 1/2. If A b finds themself at birthrank 1, the Large theory indexical weights shift as the provisional credences update, and the credence in the Small theory is (1 × 0 + 1 × 0 + 1 × 1/8)/[(1 × 0 + 1 × 0 + 1 × 1/8) + (2 × (1 × 0 + 0 × 0) + 2 × (1 × 0 + 0 × 1/16) + 2 × (1 × 1/16 + 0 × 0) + 2 × (1 × 0 + 0 × 0)] = (1/8)/[(1/8) + (2/16)] = 1/2.

External observers – You are the alien observer and you have worked out this model exactly. You wish to use a Doomsday Argument based on your observations of who is currently living and their birthrank. You are the only external observer. There are two possible relative locations where you may observe humanity, the two possible birthranks. You may use Earth-relative locations, in which case there are two possible positions. Alternatively, you may assume that your location is fixed at x, and these positions correspond to hypotheses about the epoch of humanity's origin. The different possible locations of X – a physical distinction – also correspond to different groups of microhypotheses, each with only one indexical weight (Fig. 12). With only one possible position per microhypothesis, $p_{k, j; i}^{{\rm prior}} = P_{k, j}^{{\rm prior}}$ in each case. Columns 9–12 give likelihoods for the observations you may make.

Suppose you treat humanity's origin as uncertain while your own location is fixed, and your sample reveals A a from ${\cal H}_A$. There is only one indexical weight, and one provisional physical distribution following the $P_{k, j}^{{\rm prior}}$ listed in the Table. WFG reduces to Bayesian updating in this case. The observation of A a in the sample does not by itself affect your credence in the Small theory: (1/4 + 2 × 0)/[(1/4 + 2 × 0) + (4 × 1/16 + 12 × 0)] = (1/4)/[(1/4) + (4/16)] = 1/2. But if you measure A a's birthrank to be 1, that credence updates to (1/4 + 2 × 0)/[(1/4 + 2 × 0) + (2 × 1/16 + 14 × 0)] = (1/4)/[(1/4) + (2/16)] = 2/3. The same results are found if you observe A b at rank 1: first, a Small credence of (0 + 1/8 + 0)/[(0 + 1/8 + 0) + (2 × 1/16 + 14 × 0)] = (1/8)/[(1/8) + (2/16)] = 1/2. when you observe A b, updated to (0 + 1/8 + 0)/[(0 + 1/8 + 0) + (1 × 1/16 + 15 × 0)] = (1/8)/[(1/8) + (1/16)] = 2/3 when measuring A b's birthrank of 1. For you, the Doomsday Argument is validated by WFG.

If you instead treat humanity's origin as fixed while your own location is uncertain, these same conclusions are reached. There are two indexical weights, but only one of them is in ${\cal O}_{k, j}$ for each microhypothesis, since each microhypothesis specifies which location you observe. Thus all the relevant weights remain 1, and an observation of A a at rank 1 leads to a credence in Small of 2/3.

The difference between the internal and external cases results from two factors. First, there is an extra temporal degree of freedom for X's arrival. Second, X's position is entirely fixed in each microhypothesis – knowing they are the external observer, X can predict exactly who they will observe simply based on time of arrival, since the time of arrival is entirely a physical statement. In contrast, there is no single ‘time of observation’ for the humans in Large model, since one observes early and one observes late. For the internal case, all microhypotheses are present in the Long provisional physical distributions, because there are observers at all locations. A priori, the internal observer places equal provisional prior credence in the Short and Long theories for the rank 1 positions. But in the external case, the rank 1 and rank 2 positions constrain completely different Long microhypotheses, splitting the Long prior credence between them. A priori, the external observer places twice as much provisional prior credence in Short than Long for the rank 1 positions. This is illustrated in Fig. 12.

A case with two worlds

In a large enough universe, there is likely to be multiple instances of humanity (or technological societies in general). These may have varying properties, increasing the number of microhypotheses exponentially. A larger number of worlds makes it more likely that any particular viable world in a theory is realized, but otherwise the worlds may not be in contact with each other. To demonstrate how to handle cases with more than one world, I present a simple indistinguishable observer (IO) model with two worlds. These are labelled i and ii, with positions/trajectories consisting of a world and a birthrank. This time there will be three possible distributions for N total that the observers will try to compare: a Small theory in which all worlds have N total = 1; a Large theory in which all worlds have N total = 2; and an Intermediate (I) theory in which each world has even probability of being small or large independent of the other. All three macrotheories start with equal credence (P prior(S) = P prior(I) = P prior(L) = 1/3). Because of the indistinguishability of the observers, there is only one possible pair of sequences each in the Small and Large theories, and four in the Intermediate theory corresponding to the four possible choices for N total(i) and N total(ii). Table 3 lists all the microhypotheses in this scenario, with ‖ separating observable and likelihoods of worlds i and ii.

Table 3. Microhypotheses for two-world IO Doomsday problem

Internal observers – If you are a human who knows all of this but are otherwise ignorant of your location and N total for your society, there are now four possible starting locations you could be: world i at ranks 1 (i,  1) and 2 (i,  2), and world ii at ranks 1 (ii,  1) and 2 (ii,  2). In the Small theory, only the rank 1 starting locations are occupied, and in the Large, all are occupied. In the Intermediate theory, none, one, or both of the rank 2 starting locations may be occupied. Within each microhypothesis, you start with an uninformative indexical distribution, with all $p_{k, \hskip1ptj; i}^{{\rm prior}} = P_{k \hskip-0.3pt , {\, j} }^{{\rm prior}}$. By summing the provisional credences, you find prior weights ${\Xi }^{{\rm prior}}_{( {\rm i}, 1) } = {\Xi }^{{\rm prior}}_{( {\rm ii}, 1) } = 1/3$ and ${\Xi }^{{\rm prior}}_{( {\rm i}, 2) } = {\Xi }^{{\rm prior}}_{( {\rm ii}, 2) } = 1/6$.

Suppose you learn your birthrank is 1. Now all provisional credences for (i,  2) and (ii,  2) are zero, because observers at these positions necessarily have birthranks of 2. However, the provisional credences for the (i,  1) and (ii,  1) are untouched. There remains a complete symmetry between worlds i and ii, with ${\Xi }^{{\rm pos}}_{( {\rm i}, 1) } = {\Xi }^{{\rm pos}}_{( {\rm ii}, 1) } = 1/2$, as nothing you have learned leads you to favour one over the other. Furthermore, (i,  1) and (ii,  1) are always within ${\cal O}_{k, j}$ for every microhypothesis, since every inhabited world necessarily has at least one observer. Therefore, the sum of ${\Xi }^{{\rm pos}}_i p_{k, j; i}$ over all trajectories is unchanged. There is no Bayesian shift; P pos(S) = P pos(I) = P pos(L) = 1/3 (Fig. 13).

Fig. 13. Two world example of the O Doomsday problem. Each microhypothesis lists the sequence of realized birthranks, with a double vertical bar separating the sequences for worlds i and ii. The Small, Intermediate, and Large theories are compared after observing a human at rank 1.

External observers – Each microhypothesis specifies the location that X observes, and thus has a single provisional credence associated with it. When X samples an observer and finds their birthrank to be 1, all microhypotheses where X observes rank 2 have zero evaluated credence. This removes half the total credence in the Large theory, and 1/4 (0 + 1/3 + 1/3 + 1/2) = 7/24 of the credence in the Intermediate theory. As a result, the Short theory has twice the credence of the Long credence.

The credences depend slightly on the number of worlds, because of the variance in N total among the different Intermediate microhypotheses. Leaving all other model parameters fixed, the effective likelihood of the Intermediate distribution is

$${1\over 2^{{\cal N}_w}} \sum_{n = 0}^{{\cal N}_w} {{\cal N}_w \choose n} {{\cal N}_w\over {\cal N}_w + n},\;$$

which decreases from 3/4 when ${\cal N}_w = 1$ to 2/3 as ${\cal N}_w \to \infty$.

The anthropic shadow and the number of worlds

Suppose the rank 1 humans in the previous model live long enough to see the next generation being born, if it exists. Now, because they already have measured their birthranks, and because the weights actually parameterize temporal trajectories, the weights cannot shift any further without learning which world they are in. If such an observer witnesses the birth of rank 2, then the provisional credences of those microhypotheses where N total = 1 for the world the observer is located in update to 0. This results in an effective likelihood that equals the fraction of worlds that survive to N total = 2, which is 0 in the Small theory, 1/2 in the Intermediate theory, and 1 in the Large theory, with P pos(I) = 1/3 and P pos(L) = 2/3 (Fig. 14). Indeed, this holds regardless of the number of worlds due to the underlying symmetry, with all worlds having a rank 1 human.

Fig. 14. The anthropic shadow effect in WFG, demonstrated by the IO model, with one (top) and two (bottom) worlds. When an early human's posterior is adopted, the survival of their world to rank 2 has a relatively strong update (left) compared to if the later human uses an independent posterior (right). Furthermore, the later human's independent posterior constraints on the Intermediate theory weaken as the number of worlds increases.

Now, perhaps a rank 2 human adopts the rank 1 prior, if they see themselves as continuing a scientific programme initiated by their ancestors. But it is also possible that a rank 2 human starts from scratch, ignoring or ignorant of the ‘result’ of the predecessor in their world. If they begin with uncertainty about their starting location but otherwise aware of the setup of the model, they too begin with ${\Xi }^{{\rm prior}}_{( {\rm i}, 1) } = {\Xi }^{{\rm prior}}_{( {\rm ii}, 1) } = 1/3$ and ${\Xi }^{{\rm prior}}_{( {\rm i}, 2) } = {\Xi }^{{\rm prior}}_{( {\rm ii}, 2) } = 1/6$. Upon measuring their birthrank of 2, these update to ${\Xi }^{{\rm pos}}_{( {\rm i}, 1) } = {\Xi }^{{\rm pos}}_{( {\rm ii}, 1) } = 0$ and ${\Xi }^{{\rm prior}}_{( {\rm i}, 2) } = {\Xi }^{{\rm prior}}_{( {\rm ii}, 2) } = 1/2$. All of the indexical weight is in rank 2 starting locations. In microhypotheses where there is at least one large world, all of the weight is placed in provisional credences that are unscathed by the birthrank measurement. Thus, the effective likelihood for any microhypothesis with at least one long-lived world is 1. The effective likelihood of the Intermediate theory is $1 - 1/2^{{\cal N}_w}$, which approaches 1 as ${\cal N}_w \to \infty$ (Fig. 14).

This is a demonstration of the anthropic principle in WFG. Later observers necessarily only are born in worlds long-lived enough to host them, even if such worlds are rare. Unless they adopt their ancestor's posterior, they draw no conclusions in an infinite universe from their birthrank except that the probability of their own existence in any given world is nonzero.

Indeed, we can imagine this scenario playing out in humanity's future if we regard the ‘observers’ not as individuals but entire species. Suppose our current society collapses in the next few centuries, with humanity reduced to a few thousand survivors, all records of our science vanishing. A million years from now, an intelligent posthuman species evolves from our scattered descendants. They come to realize that their ancestors built a technological society comparable to or exceeding their own. Should they conclude that humanity was destined to survive the catastrophe after all instead of going extinct? Not if they believe in a large universe. As per the anthropic shadow of Ćirković et al. (Reference Ćirković, Sandberg and Bostrom2010), the remnants of humanity surviving is a necessary condition for their existence. In a large universe, for every Earth where humanity hangs on after such a catastrophe, there could be a billion where it perished or practically none – to the posthumans descendents of the survivors, the result will look much the same.

Footnotes

1 Bayes’ theorem can be adapted to continuous parameters by replacing P prior and P pos with probability distributions and the sum in the denominator with an integral over all possible parameter values.

2 Neal (Reference Neal2006) briefly discusses a case related to x = 1, with a deathrank motivated by a hypothetical asteroid impact.

3 Hartle and Srednicki (Reference Hartle and Srednicki2007) argue the unrestricted SSA is absurd because we could rule out aliens on Jupiter this way. But a proponent could argue that this example counts as a success: our modern evidence is consistent with Jupiter being uninhabited, the majority view now.

4 The SSSA does allow there to be multiple copies of you, exactly identical to yourself, since you have no way of telling which of these selves you are, but that is hardly any better.

5 To emphasize the potential for unrealistic Bayes factors, consider the case where you are immortal and the Long version lasts for a trillion days. If you use the SIA, could you ever be convinced the experimenters are truthful if they come in and tell you the coin landed on Heads?

6 This paper uses thought experiments where there is a simple binary choice between Small and Large models, for which 1/2 is the uninformative prior probability. If there are many choices for N total spanning a wide range of values, a flat log prior in N total is more appropriate.

7 Garisto (Reference Garisto2020) notes the distinction between an ‘inclusiverse’ like SB-A where all possibilities exist and an ‘exclusiverse’ like SB-B and SB-C where only some do, arguing there is a weighting factor in inclusive selections that changes the prior.

8 This is a toy model. In reality, one would need a very rigidly defined mechanism with no possibility for unanticipated factors (e.g. directed panspermia) to calculate a probability this low. A modified version of L-B may very well yield higher probabilities.

9 Leslie (Reference Leslie1996) takes the position that indeterminism blunts SSA-like arguments, but only if the result has not been decided yet, because it is obvious the probability of an indeterministic future event like a dice throw cannot be affected by who we are. I believe this attempt to soften Doomdsay fails, because we are considering the probability conditionalized on you being you, an individual whose possible location is contingent on those indeterministic events.

10 To use a loose analogy, the combination of indexical weights and provisional physical distributions can be compared to density matrices in quantum mechanics. Density matrices are generally needed to describe the mixed state of a part of a system. Metaphorically speaking, WFG is needed when an observer sees only a part of the consequences of a microhypothesis and no single pure distribution can describe credences.

11 Additionally, the weights are assigned to individual locations rather than indexical probability distributions. Srednicki and Hartle (Reference Srednicki and Hartle2010) first proposed that we test indexical distributions (called xerographical distributions) by comparing predictions in physical theories with evidence. The problem is that an indexical distribution expressing uncertainty in our location can be consistent with evidence even when we know our exact location, as pointed out by Friederich (Reference Friederich2017). WFG avoids this issue because indexical locations themselves are weighted – the xerographical distributions associated with each weight are orthogonal.

12 This does not solve the Boltzmann brain problem in general. First, Boltzmann brains may so outnumber us (2η 2 N B ≫ N E in this example) that the credence shift is negligible (Dyson et al., Reference Dyson, Kleban and Susskind2002). The measure problem is the lack of consensus on how these probabilities should be calculated in multiverses (e.g. Bousso et al., Reference Bousso, Freivogel and Yang2008; Freivogel, Reference Freivogel2011). Second, our memories of even a few moments ago are probably unreliable if we are Boltzmann brains. Boltzmann brains who ‘remember’ observing the world a few seconds ago and thus conclude it is stable may outnumber the evolved observers (Carroll, Reference Carroll, Dasgupta, Dotan and Weslake2017). The implication is that we should reset our cosmological priors every moment. All we can say is that if some distribution, apparently dating back to some past date, is valid, it is updated to another distribution given our current observations.

13 The Doomsday Argument yields a ‘correct’ result for most people in history, leading Leslie (Reference Leslie1996) (and with more cavaets, Bostrom Reference Bostrom2013) to conclude that we should use it as well, as should have early humans. This logic fails in the KSO model. Of course, the Doomsday Argument works for most people by construction, but in KSO, it always fails for early humans in their Large future theories. We are not interested in whether Doomsday ‘works’ for most people in history, but whether it applies to us specifically.

14 All measurements are physical events. When someone learns an indexical fact, they are actually interacting with a physical environment that is location-dependent and changing as a result. Therefore, a truly physical theory cannot regard observers as identical if they have different indexical knowledge; observers with different (and reliable) indexical knowledge are necessarily found at different locations, which makes USO and KSO more physically grounded. For example, hypotheses about individual A of Figure 10 really should be fine-grained into hypotheses about A 1, who can only exist at rank 1, and A 2, who can only exist at rank 2, and so on. UO or IO may be regarded as coarse-grainings of these hypotheses.

15 A having more locations to be born is a physical statement about the number of actual physically occurring trials, with each outcome being a physically distinct history (contra the ‘souls waiting to be born’ characterization of SIA in Leslie, Reference Leslie1996; Bostrom, Reference Bostrom2013).

16 Actually, USO can be much more complicated if the different ${\cal H}_r$ have different sizes. Then the fraction of surviving microhypotheses (and posterior probabilities) is N-dependent if each possible permutation of humans is given equal initial credence within each macrotheory (e.g. if N = 1, ${\cal H}_A = \{ A_a\}$, and ${\cal H}_B = \{ A_b,\; \, B_b\}$, then the microhypotheses of A a only, A b only, and B b only are given equal credence). A more natural weighting is to first divide each macrotheory into ‘mesohypotheses’ about the ordering of the H r, and each permutation microhypothesis inherits an equal fraction of its mesohypothesis's credence (e.g. A a only would get twice as much initial credence as A b only).

17 For this reason, if we specifically observe Proxima b and discover a very young technological society, we are not justified in concluding they typically live for a short period before intelligence goes extinct on a planet forever, because of the sheer improbability of making that discovery in any theory. We could legitimately suspect that every planet gives rise to a long procession of many such short-lived societies, however.

The order column indicates lists the relative order of ${\cal H}_A$ and ${\cal H}_B$, with ab meaning ${\cal H}_A = {\cal H}_1$ and ${\cal H}_B = {\cal H}_2$ and ba meaning the reverse. The μ column lists the microhypotheses: each possible sequence of humans drawn from ${\cal H}_A$ and/or ${\cal H}_B$ along with which is observed by the external observer X. The human observation likelihoods indicate the probability that an observer o observes themself to be a particular human (A a or A b), possibly at a specified rank, according to each microhypotheses. The external observation likelihoods indicate the probability that X observes a particular human when they arrive, and possibly measure their birthrank to be 1.

Each microhypothesis (listed under μ) consists of two sequences of observers, one for each world, separated by a ‖. The worlds are indexed i and ii. Under human observation likelihoods, o @ (w,  1) indicates the probability that each human in the sequences observe themself to have rank 1, while o@(w,  2) indicates the probability they observe themself to have rank 2. o @ (w,  1) → 2 is the probability that an immortal rank 1 human observes the world surviving long enough for rank 2 to be born.

References

Adams, FC and Laughlin, G (1997) A dying universe: the long-term fate and evolution of astrophysical objects. Reviews of Modern Physics 69, 337372.CrossRefGoogle Scholar
Albrecht, A and Sorbo, L (2004) Can the universe afford inflation. Physical Review D 70, eid063528.CrossRefGoogle Scholar
Barrow, JD (1983) Anthropic Definitions. Quarterly Journal of the Royal Astronomical Society 24, 146.Google Scholar
Benétreau-Dupin, Y (2015) Blurring out cosmic puzzles. Philosophy of Science 82, 879891.CrossRefGoogle Scholar
Bostrom, N (2001) The Doomsday Argument Adam & Eve, UN++, and Quantum Joe. Synthese 127, 359387.CrossRefGoogle Scholar
Bostrom, N (2002) Self-locating belief in big worlds: Cosmology's missing link to observation. The Journal of Philosophy 99, 607623.Google Scholar
Bostrom, N (2003) Astronomical waste: The opportunity cost of delayed technological development. Utilitas 15, 308314.CrossRefGoogle Scholar
Bostrom, N (2007) Sleeping Beauty and self-location: A hybrid model. Synthese 157, 5978.CrossRefGoogle Scholar
Bostrom, N (2013) Anthropic bias: Observation selection effects in science and philosophy. New York: Routledge.CrossRefGoogle Scholar
Bostrom, N and Ćirković, MM (2003) The Doomsday argument and the self-indication assumption: reply to Olum. The Philosophical Quarterly 53, 8391.CrossRefGoogle Scholar
Bousso, R (2002) The holographic principle. Reviews of Modern Physics 74, 825874.CrossRefGoogle Scholar
Bousso, R, Freivogel, B and Yang, IS (2008) Boltzmann babies in the proper time measure. Physical Review D 77, eid103514.CrossRefGoogle Scholar
Bracewell, RN (1960) Communications from Superior Galactic Communities. Nature 186, 670671. doi:10.1038/186670a0CrossRefGoogle Scholar
Carroll, SM (2017) Why Boltzmann Brains Are Bad. In Dasgupta, S, Dotan, R and Weslake, B (eds). Current Controversies in Philosophy of Science. New York: Routledge, pp. 720.Google Scholar
Carter, B. (1974) Large number coincidences and the anthropic principle in cosmology. In Longair, MS (ed). Confrontation of Cosmological Theories with Observational Data, International Astronomical Union Symposium vol. 63. Boston, USA: Dordrecht-Holland, pp. 291298.CrossRefGoogle Scholar
Carter, B (1983) The Anthropic Principle and its Implications for Biological Evolution. Philosophical Transactions of the Royal Society of London Series A 310, 347363. doi:10.1098/rsta.1983.0096Google Scholar
Ćirković, MM (2004) Forecast for the Next Eon: Applied Cosmology and the Long-Term Fate of Intelligent Beings. Foundations of Physics 34, 239.CrossRefGoogle Scholar
Ćirković, MM (2004) Is Many Likelier than Few? A Critical Assessment of the Self-Indicating Assumption. Epistemologia 27, 265298.Google Scholar
Ćirković, MM and Balbi, A (2020) Copernicanism and the typicality in time. International Journal of Astrobiology 19, 101109.CrossRefGoogle Scholar
Ćirković, MM, Sandberg, A and Bostrom, N (2010) Anthropic shadow: observation selection effects and human extinction risks. Risk Analysis: An International Journal 30, 14951506.CrossRefGoogle ScholarPubMed
Crowe, MJ (1999) The Extraterrestrial Life Debate 1750–1900: The idea of a plurality of worlds from Kant to Lowell. Mineola, NY, USA: Dover Publications.Google Scholar
Dayal, P, Cockell, C, Rice, K and Mazumdar, A (2015) The Quest for Cradles of Life: Using the Fundamental Metallicity Relation to Hunt for the Most Habitable Type of Galaxy. Astrophysical Journal Letters 810, eidL2.CrossRefGoogle Scholar
Dieks, D (1992) Doomsday–or: The dangers of statistics. The Philosophical Quarterly (1950-) 42, 7884.CrossRefGoogle Scholar
Dyson, FJ (1960) Search for Artificial Stellar Sources of Infrared Radiation. Science 131, 16671668. doi:10.1126/science.131.3414.1667CrossRefGoogle ScholarPubMed
Dyson, L, Kleban, M and Susskind, L (2002) Disturbing Implications of a Cosmological Constant. Journal of High Energy Physics 2002, eid011.CrossRefGoogle Scholar
Elga, A (2000) Self-locating belief and the Sleeping Beauty problem. Analysis 60, 143147.CrossRefGoogle Scholar
Forgan, DH and Nichol, RC (2011) A failure of serendipity: the Square Kilometre Array will struggle to eavesdrop on human-like extraterrestrial intelligence. International Journal of Astrobiology 10, 7781.CrossRefGoogle Scholar
Freivogel, B (2011) Making predictions in the multiverse. Classical and Quantum Gravity 28, eid204007.CrossRefGoogle Scholar
Friederich, S (2017) Resolving the observer reference class problem in cosmology. Physical Review D 95, eid123520.CrossRefGoogle Scholar
Garisto, R (2020) How to select observers. Physical Review Research 2, eid033464. doi:10.1103/PhysRevResearch.2.033464CrossRefGoogle Scholar
Garriga, J and Vilenkin, A (2008) Prediction and explanation in the multiverse. Physical Review D 77, eid043526.CrossRefGoogle Scholar
Gott, JRI (1993) Implications of the Copernican principle for our future prospects. Nature 363, 315319. doi:10.1038/363315a0CrossRefGoogle Scholar
Gott, JR (1997) A grim reckoning. New scientist, pp. 36–39.Google Scholar
Griffin, DR and Speck, GB (2004) New evidence of animal consciousness. Animal Cognition 7, 518.CrossRefGoogle ScholarPubMed
Haqq-Misra, J, Kopparapu, RK and Wolf, ET (2018) Why do we find ourselves around a yellow star instead of a red star?. International Journal of Astrobiology 17, 7786.CrossRefGoogle Scholar
Hartle, JB and Srednicki, M (2007) Are we typical?. Physical Review D 75, eid123523.CrossRefGoogle Scholar
Jefferys, WH and Berger, JO (1991) Sharpening Ockham's razor on a Bayesian strop. Technical Report 91-44C. Department of Statistics, Purdue University.Google Scholar
Kaneda, T and Haub, C (2020) How many people have ever lived on earth. https://www.prb.org/howmanypeoplehaveeverlivedonearth/.Google Scholar
Kardashev, NS (1964) Transmission of Information by Extraterrestrial Civilizations. Soviet Astronomy 8, 217.Google Scholar
Knobe, J, Olum, KD and Vilenkin, A (2006) Philosophical implications of inflationary cosmology. The British Journal for the Philosophy of Science 57, 4767.CrossRefGoogle Scholar
Kopf, T, Krtous, P and Page, DN (1994) Too Soon for Doom Gloom? preprint arXiv:gr-qc/9407002.Google Scholar
Korb, KB and Oliver, JJ (1998) A refutation of the Doomsday argument. Mind; a Quarterly Review of Psychology and Philosophy 107, 403410.CrossRefGoogle Scholar
Laplace, PS (1902) A philosophical essay on probabilities (translator F.W. Truscott and translator F.L. Emory, Trans.). http://www.gutenberg.org/ebooks/58881.Google Scholar
Leslie, J (1996) The end of the world: the science and ethics of human extinction. New York: Routledge.Google Scholar
Madore, BF (2010) Sigma One. Astronomical Journal 139, 20522055.CrossRefGoogle Scholar
Monton, B and Roush, S (2001) Gott's Doomsday argument. http://philsci-archive.pitt.edu/1205/1/gott1f.pdf.Google Scholar
Neal, RM (2006) Puzzles of Anthropic Reasoning Resolved Using Full Non-indexical Conditioning. preprint arXiv:math/0608592.Google Scholar
Nielsen, HB (1989) Random dynamics and relations between the number of fermion generations and the fine structure constants. Acta Physica Polonica, Series b 20, 427468.Google Scholar
Olum, KD (2002) The Doomsday argument and the number of possible observers. The Philosophical Quarterly 52, 164184.CrossRefGoogle Scholar
Olum, KD (2004) Conflict between anthropic reasoning and observation. Analysis 64, 18.CrossRefGoogle Scholar
Sagan, C (1973) On the Detectivity of Advanced Galactic Civilizations. Icarus 19, 350352. doi:10.1016/0019-1035(73)90112-7CrossRefGoogle Scholar
Srednicki, M and Hartle, J (2010) Science in a very large universe. Physical Review D 81, eid123524.CrossRefGoogle Scholar
Tarter, J (2001) The Search for Extraterrestrial Intelligence (SETI). Annual Review of Astronomy and Astrophysics 39, 511548.CrossRefGoogle Scholar
Vilenkin, A (1995) Predictions from Quantum Cosmology. Physical Review Letters 74, 846849.CrossRefGoogle ScholarPubMed
Whitmire, DP (2020) The habitability of large elliptical galaxies. Monthly Notices of the Royal Astronomical Society 494, 30483052.CrossRefGoogle Scholar
Zackrisson, E, Calissendorff, P, González, J, Benson, A, Johansen, A and Janson, M (2016) Terrestrial Planets across Space and Time. Astrophysical Journal 833, 214.CrossRefGoogle Scholar
Figure 0

Fig. 1. The Sleeping Beauty variants SB-A, SB-B, and SB-C, illustrating how different theories of typicality handle Bayesian credence, before and after learning it is Monday (m) instead of Tuesday (t). Ruled out hypotheses are coloured in black and do not count towards the normalization. The SSA, with or without the SIA, leads to presumptuous conclusions in SB-B. In FGAI, indexical and physical distributions are not mixed. Instead, there is an overarching physical distribution, and each model has an associated indexical probability distribution (indicated by the arrows).

Figure 1

Fig. 2. SB-D is a variant of Sleeping Beauty that is challenging for theories with separate indexical and physical distributions. More outcomes are instantiated in the Long theory than in either Short microhypothesis, thus seemingly favouring the Long theory unless the SSA is applied.

Figure 2

Fig. 3. If the Sleeping Beauty experiment is run many times, with each possible sequence of Long and Short a priori equally likely, a vast number of microhypotheses about the sequence of Short and Long is generated. Shown here are the 64 microhypotheses when there are n = 6 runs. Attached to every single one of these fine models is an indexical distribution.

Figure 3

Fig. 4. Extremely simplified representation of how FGAI treats theories like L-B. Probabilities of observing a particular outcome in a macrotheory may be presumed to result from some unknown fine-scale division of parameter space into regions delineating different equivalence classes. Then microhypotheses would be constructed by considering all possible outcomes for all regions. For L-B, we observe the red dwarf nearest to Earth (Proxima $\oplus$) and see whether it is inhabited (flower symbol). The observation of an inhabited Proxima $\oplus$ reduces the number of allowed microhypotheses. Left: observers on other planets distinguishable from Earth would observe red dwarfs with different properties (Proxima X, Y, and Z) and probe different classes. Right: many identical Earths observe distinct Proxima $\oplus$ (red stars). These are treated by assuming there is some indexing that allows microhypotheses to be constructed, and their likelihoods calculated by symmetry.

Figure 4

Fig. 5. The indication paradox of SB-B′, and how it is treated with different modifications of FGAI. Each box, representing the credence in a microhypothesis, lists the observed outcomes, ordered by index in the used coordinate system. An underlined, bold outcome is treated as the one the observer measures for the purposes of calculating likelihoods. In the Weighted Fine Graining treatment, the normalized weights applied to each microhypotheses are given to the left of each provisional physical credence.

Figure 5

Fig. 6. Treatment of the ‘urn problems’ U-I and U-II in WFG. In U-I (top), there is one urn. We and one other participant try to determine if it is Small or Large by drawing one ball with replacement. The different indices correspond to the order in which the participants draw. In U-II (bottom), there are two urns at different locations. We and one other participant are assigned randomly to the different urns and try to determine which urn location we are at based on that draw.

Figure 6

Fig. 7. Treatment of the fundamental variations of the Sleeping Beauty problem in WFG. In SB-A, only the indexical weights shift, while in SB-C, the physical credence distributions entirely drive the conclusion. In SB-B, the indexical information shifts the indexical weights assigned to each provisional distribution, ensuring that neither the Short nor Long distributions are favoured.

Figure 7

Fig. 8. Illustration of an indication paradox resulting from SB-D. Whether we are told it is Monday or Tuesday, we seemingly favour the conclusion we are in the Long experiment (top). Virtual observers can be introduced to prevent this paradox (bottom).

Figure 8

Fig. 9. Minimal example of Boltzmann brain problem treated in WFG with Nb = 2, and Ne = 1. Both the false and true cosmologies have two Boltzmann observers identical to ourselves; in this case, each with an equal probability of being long-lived (ℓ) and decaying in a short time (s). The true cosmology also has an evolved observer E that always observes it correctly. An observation of T leaves the e provisional physical distribution untouched and also shifts indexical weight to e, favouring T.

Figure 9

Fig. 10. Different fine-grainings of the x-day Argument have different assumptions about the interchangeability of humans. The clouds represent an unknown or random selection from a set. The shown configurations are chosen so that observer A (blue circle) exists with rank 1, but this is a contingent selection in the models.

Figure 10

Table 1. Microhypotheses counts for self-applied x-day arguments in single world models

Figure 11

Fig. 11. Illustration of how a self-applied Doomsday Argument fails in the KSO schema according to WFG. Two models are compared, with NS = 1, NL = 2, ${\cal H}_1 = \{ A_1,\; \, B_1\}$, and ${\cal H}_2 = \{ A_2,\; \, B_2\}$. The combinatorial properties of the microhypotheses, aided by the shifting indexical weights, ensure the prior is unaffected by an observer learning they are A1 at rank 1.

Figure 12

Fig. 12. Illustration of the self-applied (top) and external (bottom) Doomsday Argument USO example in WFG when Aa is observed at rank 1. The observed human is bold and underlined (self-observation) or overlined (external observation). For the self-applied argument, the distinctions about X's position are ignored, where each shown microhypothesis stands for one (Short) to two (Long) from Table 2. For the external argument, X's location is fixed. The case where the epoch of humanity's start is fixed proceeds similarly, with the two locations splitting the microhypotheses between them for their provisional distributions. This is an example where X can apply the Copernican Argument to make inferences – note that X has no possibility of failing to observe humanity in this example.

Figure 13

Table 2. Microhypotheses for one-world USO Doomsday problem

Figure 14

Table 3. Microhypotheses for two-world IO Doomsday problem

Figure 15

Fig. 13. Two world example of the O Doomsday problem. Each microhypothesis lists the sequence of realized birthranks, with a double vertical bar separating the sequences for worlds i and ii. The Small, Intermediate, and Large theories are compared after observing a human at rank 1.

Figure 16

Fig. 14. The anthropic shadow effect in WFG, demonstrated by the IO model, with one (top) and two (bottom) worlds. When an early human's posterior is adopted, the survival of their world to rank 2 has a relatively strong update (left) compared to if the later human uses an independent posterior (right). Furthermore, the later human's independent posterior constraints on the Intermediate theory weaken as the number of worlds increases.