1. Introduction
A finding of guilt in criminal trials requires proof beyond a reasonable doubt (BARD). This gold standard of proof, the law’s strictest, is widely known but poorly understood. Attempts to make the standard more precise have met with little success. Recently, attempts to replace it have been proposed.Footnote 1
An underlying issue in clarifying legal proof – both generally and for BARD, in particular – concerns the extent to which it is best understood in terms of mathematical probabilities. Can the strength of the evidence offered at trial be quantified? Is a justified finding of guilt supposed to be reflected by this quantity? Is there a threshold probability, above which guilt is proven BARD?
I offer an account of BARD, compatible with Subjective Bayesianism. The account does not reduce BARD to a threshold probability of guilt: critics of that reduction, I believe, are correct. Nor will I defend the view that naked statistical evidence suffices for conviction, a view that plausibly follows from the acceptance of threshold probabilities.
What I offer instead is an explication of BARD in Bayesian terms. The explication treats BARD as a judgment, not merely about the credal state the factfinder endorses as her own (i.e. not merely as one’s own credence in guilt), but as a judgment about alternative possible credences; specifically those the factfinder does not actually endorse, but still finds reasonable. To this end, I employ a Bayesian framework, expounded by Lange (Reference Lange1999), that permits revision of past prior probability assignments. In determining whether the evidence proves guilt beyond reasonable doubt, a trier-of-fact asks not merely about her own credence, but whether sufficient doubt persists among any reasonable starting point one might have taken; if it does, acquittal is warranted.
The proposed understanding of BARD also aids in the elucidation of two features of proof in criminal trial, both puzzling in Subjective Bayesian terms: the presumption of innocence and the dictum that factfinders must determine guilt exclusively on the evidence formally presented at trial. The account, if successful, will not only supply the correct account of these three principles in Bayesian terms but will vindicate Bayesian or probabilistic accounts of evidence at trial generally.
Furthermore, beyond criminal legal proof, the account sheds light on reasonable doubt more generally and on the norms governing the doxastic attitude of doubt, which are of independent interest in many facets of epistemology and inquiry. BARD, in other words, is not just a legally constructed notion; the law is responding to a feature of our epistemic life.
While the account is cashed out in Bayesian terms, aspects should be exportable to non-Bayesian or even non-probabilistic accounts as well. The account treats a finding of guilt as beyond reasonable doubt when the finder of fact can state that the assumptions one would have to make to defeat a conclusion of guilt are unreasonable ones to have made. It is not enough that the factfinder herself doesn’t share those assumptions, or that she finds those assumptions improbable. There must be something sufficiently defective about these assumptions such that making these assumptions would be unreasonable; otherwise, reasonable doubt is not defeated, and an acquittal is warranted.
2. Reasonable doubt
Historically, BARD grew out of the concern that a false judgment of guilt would result in damnation of the judge or factfinder ( F ), for visiting wrong upon an innocent. To remedy this, the idea was that if the factfinder were morally certain, she would be absolved of moral culpability for an erroneous judgment to convict.Footnote 2 Massachusetts’ jury charge is typical:Footnote 3
What is proof beyond a reasonable doubt? The term is often used and probably pretty well understood, though it is not easily defined. Proof beyond a reasonable doubt does not mean proof beyond all possible doubt, for everything in the lives of human beings is open to some possible or imaginary doubt. A charge is proved beyond a reasonable doubt if, after you have compared and considered all of the evidence, you have in your minds an abiding conviction, to a moral certainty, that the charge is true. When we refer to moral certainty, we mean the highest degree of certainty possible in matters relating to human affairs — based solely on the evidence that has been put before you in this case.
I have told you that every person is presumed to be innocent until he or she is proved guilty, and that the burden of proof is on the prosecutor. If you evaluate all the evidence and you still have a reasonable doubt remaining, the defendant is entitled to the benefit of that doubt and must be acquitted.
It is not enough… to establish a probability, even a strong probability, that the defendant is more likely to be guilty than not guilty. That is not enough. Instead, the evidence must convince you of the defendant’s guilt to a reasonable and moral certainty; a certainty that convinces your understanding and satisfies your reason and judgment as jurors who are sworn to act conscientiously on the evidence.
This is what we mean by proof beyond a reasonable doubt.Footnote 4
The verbal formulae used by courts, signifying nebulous standards spelled out in arcane ways is a serious problem. On the other hand, the general BARD standard has great intuitive and moral appeal. It would be a mistake to give up on it, but we must be clearer about what it requires.
As a first gesture, it is clear that BARD or moral certainty is a weaker standard than absolute certainty. BARD, whatever it is, requires less than a demonstrable proof of guilt. On the other hand, it must be stronger than mere suspicion or a hunch. In particular, we can calibrate BARD by comparing it to alternative standards the law employs, such as the preponderance of evidence standard (also known as the “balance of probabilities” or “more likely than not”) employed in civil cases, or clear and convincing evidence employed in some contexts in American law. BARD must be a higher degree of certainty or confidence than either of those, but lower than demonstrable proof.
Speaking of BARD in terms of degrees of certainty lends itself to a quantitative interpretation. One tempting path to doing this is to think of proof in terms of probabilities. This is the path recommended by legal probabilists.
3. Legal Probabilism
Legal probabilismFootnote 5 uses probability theory to “analyze, model and improve the evaluation of evidence and the process of decision-making in trial proceedings.” (Urbaniak & Di Bello, Reference Urbaniak and Di Bello2021).
Of particular interest is the Bayesian interpretation of probabilism, which works with subjective probabilities. Bayesians represent partial beliefs or degrees of beliefFootnote 6 as credences obeying the probability calculus. This means, at the very least, that these credences obey the Kolmogorov axioms. On the extreme subjective interpretation of Bayesianism, meeting this condition suffices for rational belief.Footnote 7
Under Orthodox Subjectivist Bayesianism,Footnote 8 agents begin with a prior probability;Footnote 9 the only constraint is probabilistic coherence according to the axioms. As the agent learns new information, she updates her beliefs via the rule of conditionalization, which updates her new credence in a hypothesis (H) to what her prior probability accorded that hypothesis, conditional upon the learning of this new evidence (E): PrNew(H) = PrOld(H|E). This can be calculated using Bayes’ theorem:

If rationality merely requires obeying the axioms, as subjectivists claim, these requirements are permissive; they admit any possible states satisfying the axioms. If conditionalization is required for updating,Footnote 10 subsequent posterior probabilities will be thereby constrained, but they are still a function of what the prior probability is.
Are there rational constraints on the prior probability distribution beyond adherence to the axioms? Subjectivists say “no.” Any prior satisfying the axioms is rational. Objective Bayesians, on the other hand, claim that there is a unique rational prior probability function for an agent to have.Footnote 11 Whether such a prior exists and how it is determined raises serious difficulties. Regardless, jurors don’t actually begin with such a prior and certainly do not all begin the trial with the same credences. On the other hand, Subjective Bayesians inherit the problem of the priors. The prior probability function is purely subjective: it is not subject to any rational constraint except consistency with the axioms.
If F updates using conditionalization, where F ends up after hearing the evidence is a function of where she began. Two triers-of-fact with different assumptions – and starting points – at the outset of the trial could reach different conclusions about guilt; two triers-of-fact might even rationally disagree whether the very same evidence weighs for or against H. On subjectivist views, this divergence can be entirely rational, as long as each factfinder responds to the new evidence consistent with the axioms.Footnote 12
A Bayesian trier of fact, will, at the conclusion of trial, upon hearing the evidence, assign a probability to the question at hand: of liability (in a civil case) and of guilt (in a criminal one). A natural extension of the Bayesian model would then render the preponderance standard as met when Pr(Liability|Evidence)>0.5. A further extension might render BARD as met when Pr(Guilt|Evidence) > t where t is some threshold (0.5 < < t ≤ 1). I’ll call this last extension the Threshold view.
Much of the debate pertaining to Bayesian analyses of factfinding at trial traces to an influential paper by Laurence Tribe on “trial by mathematics.”Footnote 13 The debate is most heated around the use of statistics to determine guilt.Footnote 14 Statistics, of course, are relevant for many inferences in trial, and nobody denies that a factfinder should be attentive to them, but naked statistical evidence conjoined with the Threshold view, delivers perverse results, no matter how high we adjust the threshold (unless we require 100%).
Consider a lottery-style case, Gatecrasher.Footnote 15 In Gatecrasher, we stipulate that (104 – 1) people at a show crashed the gate, with 104 in attendance, such that (104 – 1) committed a crime; exactly one ticket was sold and subsequently collected; we know that Pr(¬Guilt) = 10−4 for any given defendant, quite plausibly beyond the threshold. Gatecrasher can be raised as an objection to threshold views more generally. For any threshold shy of 1, a lottery/gatecrasher scenario can be constructed where the threshold is met, and yet a verdict of guilt would not be warranted.Footnote 16
3.1. What legal probabilism requires
We should, following (Di Bello Reference Di Bello2013), distinguish between two claims:Footnote 17
QUANTIFICATION: a probabilistic quantification of the case for defendant’s guilt can be given through an appropriate weighing of all the fallible evidence available (that is, of all the evidence against, and of all the evidence in defense of, the accused);
THRESHOLD: an appropriately high threshold guilt probability should be the decision criterion for criminal convictions.
In defending Bayesianism, I endorse Quantification but reject Threshold. I will also resist the sufficiency of naked statistical evidence for conviction, although I only devote a little bit of space to it here.
But while I reject Threshold, it will not do to simply state that one should assign a credence of guilt according to Bayes’ rule. For probabilism to be illuminating, it should go beyond how evidence is weighted, to how, ultimately, that weighting determines a verdict of guilt.
The basic commonsense story of the factfinder’s job is that she begins with a presumption of innocence, hears the admissible evidence at trial, and forms a judgment based on that evidence as to whether guilt has been proven. A judgment that guilt has been proven is not exactly the same as a judgment that the defendant is guilty. It is a judgment that the weight of the evidence is sufficient to sustain a judgment of guilt. A judgment of guilt requires BARD. The judgment of the evidence is an evidential probability. A probabilist will capture it with Pr(Guilt|Evidence). If our account of factfinding at trial is a Subjective Bayesian one, using subjective probabilities, the story presumably is as follows: the factfinder begins the trial with some prior probability of guilt, she then learns new information (the evidence at trial) and subsequently updates her credences via Bayes’ rule, arriving at a posterior probability of guilt.
The above is a tempting story of what a Bayesian factfinder does. But it raises three challenges for the Bayesian:
-
1. Presumption of Innocence. What is the presumption of innocence for a Bayesian? It is not obvious what a prior that respects this presumption is. Some have suggested a very low probability of guilt (how low?), and others that the probability of guilt is 1/n, where n denotes the other possible suspects (which creates a reference class problem).Footnote 18 But this will not suffice, since adjusting one’s probabilities in this manner to accommodate the presumption has spillover effects. One cannot just isolate credence in one particular proposition and adjust it. All other evidentially relevant propositions must be adjusted as well. This can be done, of course, via conditionalization, but the resulting “prior” will not be the epistemic state of the actual factfinder. An account that adjusts the prior in this way has already strayed from the standard subjectivist story.Footnote 19 Additionally, these adjustments require “unlearning” propositions that the subject has already learned and assigned probability 1. This cannot be undone by conditionalization.
-
2. Restriction to Admitted Evidence. How are we to understand the injunction to conditionalize only on the evidence at trial? Real factfinders learn and infer many things. In what sense is the probability of guilt a subjective probability, if it neither emanates from the subject’s own prior (due to the presumption of innocence) nor conditionalizes exclusively on the information the subject actually learns (due to the restriction to evidence at trial)? One could formalize this as an evidential probability of E, conditional on some artificially constructed prior probability, but this would neither correspond to the factfinder’s own posterior probability of guilt nor would it truly be free of the subject’s priors, since, that conditional probability itself will be informed by other features of the subject’s priors. Unless, that is, the prior is entirely artificially constructed, at which point, the account is no longer one of subjective probability at all.
-
3. BARD. The threshold picture is too simplistic, even if we ignore problems 1 and 2. But these issues play off each other as well. Ultimately, the question is not simply what credence the factfinder has in guilt. It is a question of what credence an ideal fact finder who began with the presumption of innocence and only updated on the evidence would have in guilt. Even if such a credence could be established, the BARD question remains: what facts about a credal state or otherwise determine whether BARD has been achieved?
3.2. Thresholds
It is tempting to assign a particular value of credence as a threshold for BARD. Substantial disagreement exists as to what the threshold probability actually is.Footnote 20 While it must clearly be above 50%, no obvious reason favors selection of any particular value in the closed interval between 50% and 100%.
Suppose the threshold was set at 100%. This would avert the charge of arbitrariness. But would arguably be too strong, since guilt is rarely (if ever) established at that level. Furthermore, it would be contrary to the spirit of the BARD/moral certainty standard, which was to allow for a lesser standard than certainty itself. Arguably, it would also be too weak. Couldn’t a factfinder harbor no doubts but still find it reasonable for someone else to doubt? I’ll come back to this objection, which characterizes all threshold accounts of BARD, later.
Suppose instead we settled on a threshold less than 100%, but higher than 50%. This causes several problems. For one, we get the gatecrasher/lottery problem issue, discussed earlier.Footnote 21 No threshold, shy of 100% avoids that problem. Secondly, the threshold seems arbitrary: why, e.g. 0.9 and not 0.91?
The arbitrariness concern is, perhaps, not hopeless. A similar debate surrounds the analysis of full belief for Bayesian and the relationship between full beliefs and credence. Not all Bayesians have need for a category of “full belief.”Footnote 22 Among those that do are those that identify belief with probability 1.Footnote 23 Others advocate versions of the Lockean Thesis,Footnote 24 identifying belief as some probability in the range of 0.5–1. The Lockean Thesis can be interpreted either descriptively (what credence constitutes belief?) or normatively (what credence justifies belief?); as an objective standard (setting the same threshold for all believers, sometimes referred to as the Strong Lockean Thesis) or a subjective standard (determining what threshold a particular believer believes at, referred to as the Weak Lockean Thesis), possibly incorporating contextualist or pragmatic considerations as well.Footnote 25 I don’t intend to delve into the debates concerning the Lockean Thesis, except to note that many of the issues that arise with threshold arise there as well. Perhaps it’s plausible to locate a threshold for belief.Footnote 26
What about doubt? Can doubt be identified with a threshold? Doubt might be scalar: whatever credence the agent has in the negation of the hypothesis considered (doubt of guilt = credence of innocence). Alternatively, it could be categorical: doubt is any non-zero credence in innocence or credence in innocence above a particular threshold. Alternatively, doubt is a doxastic attitude, distinct from credence. Still, if this attitude connects to a credal state, a threshold would still be possible, involving bridge principles (descriptive or normative) between one’s credence in ∼p and doubt that p. Perhaps there is no threshold for doubt, but, at the very least, if one disbelieves the hypothesis in question, then one at least doubts it.
Reasonable doubt, on the other hand, is a normative judgment on doubting. The norms that govern BARD are the norms beyond which doubt is no longer warranted. Just as with the normative Lockean principle for belief, a normative principle for doubt cashed out exclusively in terms of a threshold looks unpromising. A better understanding of BARD is one that goes beyond one’s own credence.
4. What is BARD for a Bayesian?
My suggestion is that judging doubt as beyond reasonable requires going beyond the actual credence you have in guilt (which may be a necessary condition but not a sufficient one for conviction), referring to other credences it would be reasonable to have. This requires a slightly different perspective on Subjective Bayesianism; one that involves stepping outside of your own credences. This stepping outside is limited, in that it retains subjectivism’s insight that you are always judging probabilities from your own subjective point of view. But it affords you the ability to look at alternative credal functions as reasonable or relevant alternatives.
Rejecting thresholds as an account of BARD, need not imply rejecting them as an account of doubt. It is one thing to doubt a claim, it is another to judge doubt as unreasonable. It is the latter that BARD is after. Thresholds are inadequate at capturing BARD and are at best, a necessary, but not a sufficient condition for BARD. Moreover, the problem of the priors exacerbates that inadequacy: credence in guilt (or innocence) is a function of one’s prior, i.e., of where one began.
Perhaps doubt can be defined adequately in terms of the degree of belief in innocence. Our question is whether such doubt, however defined, is reasonable. For this, F doesn’t determine merely whether F doubts, which is a function of where F began; instead, F must ask about other starting points as well, thinking about which would be reasonable and which unreasonable. My suggestion is that, for BARD to obtain, doubt must be ruled out from all reasonable starting points. A judgment of guilt consists in the judgment that all such reasonable starting points lead beyond doubt.
Except for cases of deductive certainty, there will always be priors, starting from which, and conditional upon evidence adduced at trial, doubt would be warranted. In other words, on any body of evidence, there is a coherent prior probability (consistent with the axioms) for which, conditional on the evidence, Pr(guilt) is low. If BARD meant ruling out such rational doubt, one could never convict, since there will always have been at least one other rational possible starting point that would have led to doubt. BARD cannot mean the absence of any rational doubt, at least not in a Subjective Bayesian framework.
4.1. Reasonable doubt v. rational doubt
What we require is a distinction between rational doubt and reasonable doubt.Footnote 27 Rational doubt just is doubt that would be rationally permissible per the axioms to have. In Orthodox Bayesianism, this just means doubt that is consistent with updating on some prior probability. This is too easy. BARD does not require ruling out rational doubt, but it does require ruling out reasonable doubt. This is more demanding than ascertaining that Pr(∼Guilt) is low but less demanding than ascertaining that doubt is rationally impermissible: it requires that F judge the (rationally possible) alternative conclusion (that Pr(Guilt) is low) as unreasonable. In what can this unreasonableness consist? It cannot be in the updating itself, since this follows mathematically via conditionalization. There is nothing unreasonable about the alternative conclusion given the alternative prior. Rather, the reasonableness must be in the prior itself. It is this, rationally permissible prior that the finder of fact needs to be able to rule out as an unreasonable one to have had.
Accordingly, the BARD test traces back to all possible starting points (priors) and asks: of all the possible starting points that lead to doubt, is there at least one that is reasonable? If there is, then guilt is not proven, since doubt is reasonable by the factfinder’s own lights; even if the factfinder herself doesn’t doubt the defendant’s guilt.
On the view being put forth, some alternative priors are reasonable alternatives, whereas others are unreasonable (even though they are rationally permissible). Only if doubt emanates from reasonable priors is doubt reasonable; if it only results from unreasonable priors, it is not reasonable doubt.
4.2. Reasonable credence functions and conditionalization
In developing this model, I am employing an account of conditionalization due to Lange (Reference Lange1999). Lange views the requirements of Bayesianism as steps in a justificatory argument from a set of reasonable priors to an argued-for conclusion, rather than as a description of how an agent must update her views given some actual past credal state. Lange shows how an agent can step outside of her particular priors and ask if and how a belief state that would have been reasonable at time t1 could have rationally evolved, given the evidence, into a different credal state at t2, regardless of whether the agent possessed that credal state at t1. What matters is the calibration of credal states across time relative to the available evidence at that time. On this view, an agent can recognize at some later time that her earlier credal state was unreasonable and adjust it.
Traditionally, Bayesianism represents a subject’s epistemic state via her credence function. In Orthodox Subjective Bayesianism, credences are determined by the prior probability, a historic, prior belief state. Subsequent belief states are permutations of this function via conditionalization. Subjective Bayesians afford the subject no recourse to any vantage point outside her own subjective probability function by which to judge whether her beliefs are reasonable: as long as they are probabilistically coherent and obey the axioms, they are rational. This means that subjectivists allow a degree of arbitrariness as to what the subject believes: the rational thing to believe at t2 depends on what the subject believed at t1, which depends on the prior probability, of which there is no unique rational answer as to what the subject should have believed. Each step is just the product of conditionalization, ultimately tracing back to the subject’s arational determined prior.
All subsequent beliefs that the subject has are determined by this prior. This creates trouble as an account of factfinding at trial, both because it suggests a high degree of dependence on the idiosyncrasies of the particular factfinder, as well as because it fails to account for the factfinder’s duty to be impartial and afford defendants the presumption of innocence.Footnote 28 A factfinder who begins with a highly biased prior against the defendant could rationally end up with a high probability of guilt through no epistemic fault of her own.Footnote 29
The view I am proposing does not treat the prior as an incorrigible starting point to which the subject is bound for all time. It does not deny the subjectivist insight that one’s credences at any moment rationally involve a degree of subjectivity, and hence that rational agents might disagree. It does, however, assume that agents can rethink their own credences and adjust them. A prior, on this view, is not necessarily a historic actual subjective probability that the agent affirmed at a prior time; nor is it a randomly assigned credal state that is entirely free of judgment. Rather, it is a probability distribution that an agent now affirms as the correct probability distribution to have held at a previous time, given what was in fact known at that time.
Upon learning new evidence, an agent might reconsider her prior, just as, in non-probabilistic contexts, an agent might reconsider her premises upon discovering their unwelcome conclusion. It is possible that at t1, the agent actually believed Pr(H)=p1 and Pr(H|E)=p2, but upon learning E, instead of adjusting her credence to p2 (as orthodox subjectivists require), the agent may come to realize that she should not have had those priors. Thus, suppose the agent adjusts her posterior Pr(H) to p3. This is permissible, so long as she simultaneously affirms that her prior probability Pr(H|E) should have been p3 as well. The agent may adjust her prior or her posterior, but the adjustments must be in tandem, keeping her well-calibrated. Rationality, on this view, requires conditionalization, not because you are bound by your actual priors; but rather because affirming that something would have been rational constrains you to believe what follows from it conditional on the new evidence.
In other words, the requirements of probabilistic rationality involve determining what a reasonable thing to believe would have been and showing how, given such a starting point, where the agent ended up was just a stepwise process of updating on the evidence (via conditionalization) from what is now taken as a reasonable starting point. Such an agent is not a prisoner of her prior, since she can learn that her actual prior was, in fact, unreasonable and retroactively adjust it accordingly.
A similar process can occur between jurors when deliberating. Their deliberation is not restricted to how to conditionalize on their own priors, but on what the appropriate priors should have been.
More importantly, for our purposes, the ability to judge what a reasonable prior is involves a modal judgment, not only of the agent’s actual prior but of the relevant reasonable alternatives to the agent’s priors. When an agent judges an alternative prior as reasonable, she is still doing so by her own lights. But this judgment need not involve endorsing that prior. She still represents the world via her own credences, which are determined by her own prior (i.e., the prior she currently endorses, and which she judges now as most reasonable), but judging any prior as reasonable involves the ability to make (some) comparisons with priors other than her own. This is the sense of reasonableness we are after: not merely by the lights of one’s own prior, but by the lights of other priors one deems as reasonable alternatives to one’s own. If an agent can rule out doubt on all reasonable priors, she is beyond reasonable doubt.
What counts as a reasonable prior? It is not algorithmic like the constraints of rationality. The details can be filled in by how objectivist or subjectivist your favorite account of Bayesianism is. On the objective end of this spectrum, the range of reasonable priors is narrow, perhaps constrained by indifference principles; on the subjectivist end, we can say at least this: the reasonableness of the prior is determined by the subject herself. This is subjective, indeed, but no more so than the selection of a prior probability function in the first place. Unlike the single prior, however, it allows for a range of possibilities, an assumption that seems not unrealistic: Don’t we all distinguish between those with whom we disagree and those whom we find unreasonable?Footnote 30 By the lights of one’s own actual credences, all other priors are simply mistaken. The ability to consider alternative priors means that one has the ability to consider a prior from a vantage point outside of one’s own actual specific commitments and consider whether such a prior is intelligible or acceptable.Footnote 31 Absent this ability, there is no sense in which an agent can consider whether a particular prior is worthy of adoption.
Some alternative priors might seem wrong but reasonable; others are ruled out as not worthy of consideration. It is important to understand that a rejected but reasonable prior is different from a mere credence. I might have credence C in a particular proposition, leaving a remaining credence (1–C) to express my doubt. This is not the same as judging an alternative credence as reasonable. I might doubt the testimony I hear in a case, while still judging someone who takes that testimony to be credible as reasonable; contrast this with the claim that I might see how a particular theory, if true, would undermine my confidence in guilt, but that lending credence to such a theory would not merely be mistaken but unreasonable. This involves judging a prior that gave high credence to this claim as itself unreasonable.
The reasonable doubt standard is one in which you look to those alternative credences that it would have been reasonable to have adopted, even though they are not your own, and ask whether any of them lead to doubt. It is not enough to remove doubt on your own credences – that would just be a threshold view – rather, you must do so along all credence functions within the set of reasonableness. When this happens, guilt is proven BARD.
The reasonable set is a proper subset of the rationally permissible set: the set of all consistent credence functions. BARD is a property of the reasonable set as a whole, rather than of a particular credence function. In other words, BARD does not supervene on your credences. Two agents might agree on all aspects of their credence function (particularly, they might agree on the probability of guilt) but still disagree on whether guilt is proven beyond reasonable doubt because they disagree about which alternative credences are reasonable.
This lack of direct dependence of BARD on probability is a feature of any view that rejects thresholds. It is brought out nicely in a recent paper by Moss (Reference Moss, Gendler, Hawthorne and Chung2022), which shares an important insight with the view presented here: namely that BARD requires ruling out relevant alternative hypotheses. In Moss’ account, this is captured by a knowledge requirement. The present account takes no stand on whether knowledge itself is required, but shares with Moss the idea that it is the relevant salience of alternatives (which are always conceivable) that makes or breaks legal proof. Moss’ account, as an externalist knowledge-based account, treats the context dependence of knowledge as the central guiding criterion. On my subjective Bayesian account, the relevance of an alternative is captured in the subjective assessment of the priors themselves and requires no externalist features.
For this account to work, all that is required is for the agent to be able to judge priors other than her own as reasonable alternatives. Presumably, this would also involve treating some alternatives as unreasonable as well. The sense of reasonableness here is by the agent’s own lights. This goes beyond orthodox Subjective Bayesianism in granting coherence to the judgment that some alternatives are consistent but unreasonable, without committing to the Objectivist Bayesian claim that there are objective constraints on priors (although such objective constraints are consistent with the view).
On the other hand, if you like Moss’ account of credences, according to which one can have knowledge about the probability of a claim (e.g. knowledge that the probability of guilt = x or is larger than x), that can work with this account as well. On such an account, the agent must look not only at her own credence in the defendant’s guilt; she must be in a position to rule out alternative credences that would undermine her judgment. This means that she must look to alternative priors, on the basis of which, Pr(guilt) is low, and be in a position to rule them out. On Moss’ account, this means she must know these priors are false. To defeat BARD, in Moss’s view, it is not enough to know that there are internally consistent alternative priors. Rather, one must know that those priors are irrelevant and thus dismissible. The basic structure, according to which BARD is a matter of proof along a range of permissible credence functions, rather than merely following from the properties of one’s own credence (even if justified), remains the same.
Suppose at the beginning of trial, F has high credence in medical testimony. This could be for various reasons: perhaps she has had good experience with doctors, or perhaps she is related to (or is) a doctor. A second factfinder might have low credence in such testimony. If the evidence at trial involves a doctor’s testimony, when each factfinder conditionalizes on this evidence, they will arrive at different posterior probabilities. On the standard subjectivist view, as long as the conditionalization was carried out correctly, both credences can be rational. End of story.
On the proposed view, however, F ’s job does not end here. The question factfinders must ask is not merely whether their own credence of guilt is rationally attained, but whether a reasonable factfinder might have, on the very same evidence at trial, arrived at a different conclusion. For this, they must ask: what credences must one have had at the outset of the proceedings to rationally arrive at a different conclusion? This will yield a set of possible prior probabilities, each of which is rational. Among this set of priors, they must then ask: are any of these reasonable? In other words, even if the alternative prior probability distribution is not the factfinder’s own, the account still assumes that a factfinder can distinguish between a reasonable and unreasonable alternative point of view: if the alternative prior needed would be one that involves slightly higher or lower credence in medical testimony, the answer would presumably be “yes”: one could reasonably have lower or higher confidence in such testimony than one has. If this is so, that one reasonably believes in guilt or innocence based on such testimony does not place the matter beyond reasonable doubt, no matter how high the credence in guilt. Importantly, this judgment of a reasonable alternative is not captured in my current credences, or even in the weight of my evidence. I may have overwhelming evidence for a proposition and believe with very high credence, and yet still recognize the reasonableness of another starting point.
On the other hand, suppose that one’s high credence in guilt or innocence would only be defeated by very high credence in a conspiracy theory or some alternative belief to which one not only attaches low credence but also judges that attaching a high credence would be wholly unreasonable. In such a case, while one recognizes that rational doubt is possible (because one cannot rule out the conspiracy theory), the conditions under which such doubt could be possible, i.e. conditional upon a prior one deems unreasonable, are such that one reasonably rules it out, and thus one can deem such doubt unreasonable.
4.3. Applied to specific accounts of doubt
Turning back to the various candidates for doubt discussed earlier, my account is neutral between these, as it is not meant as an analysis of doubt per se. But any account of doubt can be plugged back into BARD. This is easiest to see with threshold accounts. But, even without a threshold, the account can still shed light.
-
1. Doubt as credence >0 in innocence. Doubt of guilt just means lack of certainty in guilt. But certainty of guilt, while necessary, is not sufficient for BARD. F might believe in guilt with credence 1, but nevertheless judge that a reasonable set of priors would have led to a credence lower than 1. This would be a very demanding standard.
-
2. Doubt as a threshold between 0.5 and 1. F must ascertain that all reasonable priors meet the threshold. This will hold regardless of whether the threshold is a strong/objective one or a weak/subjective one.
-
3. No threshold for doubt. If there is no threshold for doubt, the test will run on whatever else, other than a threshold, determines doubt. For example, if the additional factor is something akin to resilience,Footnote 32 then resilience must be met in every relevant alternative. On the other hand, if the relation between doubt and credence is one of complete independence, the account is no longer Bayesian in any interesting sense. Notice that, on my account, credences still determine a finding of guilt, it’s just that F looks beyond her own credences towards alternative credences she deems reasonable. Lacking even that, the credences lose much of their traction.
Still, there are lessons here for non-Bayesian accounts as well. Whatever it is that merits doubt must be met from every reasonable starting point. The injunction to go beyond your own free assumptions (assumptions you could have with reason modified) in determining whether something is reasonably doubted can be generalized.
I’ll add that even on views that eschew a threshold for doubt, another possibility presents itself:
-
4. BARD as reasonable belief in innocence. In other words, in the examination of alternative credences, F asks whether, in any of them, belief in innocence is warranted, i.e., whether in any of them credence(not-guilt)>0.5 (or any other range up to the Lockean threshold for belief). This is a far more permissive standard than one that rules out doubt, but, as a floor, it is difficult to resist, even for those who reject thresholds.
5. Applying the analysis to the other norms
What about the presumption of innocence? Notice that a satisfactory solution falls right out of our BARD standard: In order to satisfy BARD, proof must be established from all reasonable priors. This means that even if F’s own prior is too suspicious of the defendant, she must factor in priors that are not. So long as it is reasonable to presuppose such a prior, F must include it in the set, even if in her own credence she assigns it low probability. Even if such a set is not within the trier’s reasonable set, we can stipulate it as a member of the set, for normative reasons, without compromising the subjective Bayesian nature of the calculation. F needn’t abandon her priors, she must merely accept the restriction on the set that includes the presumption.
As for restriction to evidence at trial, that the actual trier of fact knows more than what was introduced at evidence no longer matters. We can exclude information not formally presented as evidence, because in order to establish BARD, the relevant question is not “what do I believe about the defendant’s guilt?”, it is whether all admissible priors (including those that lack my information) would have arrived at a judgment of guilt, conditional on the evidence presented. The subjectivist nature of this judgment is not in the updating, it is in the endorsement of priors as reasonable. Once the priors are established, the rest of the procedure is mechanical, conditional on the evidence. As long as we can determine what the admissible priors are, the rest of the procedure presents no special difficulties. The question is now properly framed as whether guilt was established on all of them, conditional on the evidence at trial.
6. A Note on Naked Statistical Evidence
As stated, the insufficiency of naked statistical evidence was presupposed. In fact, it formed part of the motivation for rejecting thresholds. Still, one might wonder whether my account excludes proof on the basis of naked statistical evidence after all. Couldn’t we construct a gatecrasher/lottery case large enough to establish “doubt” on every suitable prior? Would such a case suggest that BARD can be established on the basis of naked statistical evidence alone?
I think not. While it is true that for any threshold there is a large enough sample of gatecrashers to place the probability that the defendant crashed the gate above that threshold, if we assume that each defendant was equally likely to crash the gate (i.e. if we assume something like an indifference principle), this is insufficient for BARD. BARD requires that for every reasonable prior, doubt can be eliminated. The factfinder in a naked statistical evidence case has nothing herself to go on beyond the statistics. But when contemplating what the evidence shows, she must consider all reasonable alternative priors. Suppose that an alternative factfinder had a high prior that this defendant would never crash the gate. In this case, the statistical evidence would barely move the needle, if at all. The large number of gatecrashers would not put any pressure on the hypothesis that D did not crash the gate, so long as the other participants are deemed sufficiently likely to gatecrash. In other words, the larger number of participants renders the original hypothesis (that D would not crash the gate) hardly less likely or more surprising than it was before.Footnote 33 This would not be true if D had similar views about more than one defendant. In such a case, the evidence would put pressure on that original hypothesis and point to revision. But for BARD, the hypothetical prior that treats this particular defendant as special seems unimpeachable in terms of reasonableness, even if the factfinder herself has no particular reason or inclination to adopt it.
This account is similar to Moss’ in this case. On her telling of Gatecrasher, you do not know that this defendant crashed the gate, even if you have highly justified credence that he did. You do not know because you cannot rule out the relevant possibility that this defendant’s character distinguishes him from the rest of the group.
Still, if this doesn’t satisfy you, all the standard responses to statistical evidence remain available. In other words, one can accept my account, including thresholds for doubt, and still reject naked statistical evidence as inadmissible because, e.g., it cannot result in belief,Footnote 34 cannot result in knowledge,Footnote 35 it is uncausedFootnote 36 , or due to normalcyFootnote 37 or other moral or policy reasons.Footnote 38
7. A Note on Applicability
This account is not meant as jury instructions. Nor do I advocate that juries necessarily sit with a calculator. I am not, after all, offering a threshold for BARD. Rather, this is a model of what a factfinder is aiming at. Perhaps some factfinders would best achieve this aim by actually using the model. For (most) others, heuristics may be preferable. Perhaps in cases with explicit probabilities, such calculation should be actively encouraged. But even then, factfinders must be careful to proceed from the entirety of the set of reasonable priors.
In practice, a reverse form of inference might be more effective. Ask yourself: what need I assume to make doubt salient? Then ask whether that is an assumption you’d rule out as unreasonable. This in practice will matter far more than settling the precise thresholds.
8. Conclusion
I have defended a Bayesian approach to proof BARD, both as a concept in the law and in epistemic practice more generally. This sheds light on what Bayesians should say about the presumption of innocence and about the restriction to admitted evidence. My account rejects thresholds as an analysis of BARD. Rather, it calls for a judgment that all reasonable paths, not just one’s own, lead to a guilty verdict.
By emphasizing the modal epistemic profile, the approach makes the BARD standard clearer and, as such, enables evaluation of whether the standard was met (e.g. via appellate review). One tries to identify a prior that a reasonable factfinder could have had that would have led to acquittal.Footnote 39