Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Sally Clark’s two baby boys, Christopher and Harry, were found dead on separate occasions.Footnote 1 Christopher’s death was at first treated as sudden infant death syndrome (‘SIDS’, also known as cot death).Footnote 2 However, when Harry died two years later, the autopsy revealed suspicious injuries and the findings from Christopher’s autopsy were re-evaluated. Clark was then charged with and convicted of the murder of both babies. At the trial, the prosecution called Professor Meadow, an expert paediatrician, to counter a potential defence claim that both deaths were natural as due to SIDS. In his testimony, Professor Meadow said that the probability of two occurrences of SIDS in a family similar to the Clarks was 1 in 73 million. This calculation was found to be flawed, but the Court of Appeal upheld the two murder convictions after finding the case against Clark still ‘overwhelming’ on each account (Clark-I: [254], [272]). A few years later, Clark’s husband found evidence in the hospital archives of microbiological results indicating that Harry had died from natural causes. A second appeal was allowed, and Clark was set free on 29 January 2003 after serving more than three years in prison.Footnote 3 Sadly, four years later, she was found dead in her home as a result of alcohol poisoning (BBC 2007).

While the wrongful convictionsFootnote 4 of Clark raise various issues,Footnote 5 much of the public and scholarly attention given to this case focused on the use of the SIDS statistics (the probability of two cases of SIDS in the same family).Footnote 6 The SIDS statistics were commonly regarded in the public media as somehow responsible, at least in part, for the wrongful convictions of Clark.Footnote 7 One newspaper even went as far as to assert that the statistical evidence was almost the only evidence against Clark.Footnote 8 The existence of some causal connection between the (mis)use of the SIDS statistics and the wrongful convictions not only appeared in the public media but was also hinted at by expert statisticians, such as Professor DonnellyFootnote 9 and Professor Dawid.Footnote 10

This chapter examines whether it was wrong to use the SIDS statistics in Clark, and if so, why. After providing a more detailed factual description of Clark in Section 2, in Section 3 the chapter scrutinises and rejects four common explanations, each of which draws a connection, either explicitly or implicitly, between the SIDS statistics and the wrongful convictions. The first explanation concentrates on the flaws in the calculation made by Meadow that led to the 1 in 73 million figure. The second holds that the SIDS statistics had an overwhelming psychological effect which overshadowed other evidence more favourable to Clark. The third explanation focuses on a logical mistake called the ‘prosecutor’s fallacy’ (explained below). The last explanation considers the courts’ refusal to use Bayes’ Theorem (also explained below). This chapter seeks to show that each of these explanations suffers from its own specific weaknesses. More generally, it is shown that these explanations are unpersuasive, taken either separately or jointly, since any connection between the SIDS statistics and the wrongful convictions is hard to establish. This is because it is difficult to accept that the wrongful convictions could have been prevented if the SIDS statistics had been used correctly, or even if they had not been used at all.

Section 4 suggests an alternative explanation of why it was wrong to use the SIDS statistics in Clark. Drawing on the theory of contrastive explanation from the philosophy of science (van Fraassen 1980: 97–157; Lipton 1990), this chapter shows that the use of SIDS statistics in Clark was wrong in principle. This is because the only way to make the SIDS statistics meaningful is to compare them, even implicitly, to another piece of statistical evidence: in this case, the rate of mothers who smothered to death both of their babies among the population of mothers who are similar to Clark. Using this second piece of statistical evidence is clearly objectionable. Hence, even if all the difficulties highlighted by the existing explanations were rectified, it would still have been wrong to use the SIDS statistics in Clark.

This explanation provides a principled reason that does not hinge on the actual consequences of the use of statistics in the specific case of Clark. It is thus not limited to Clark and can be applied to any similar case in which such statistics are adduced to eliminate potential natural, accidental, or other types of non-culpatory causes. It is argued here that even when the statistical evidence is gathered, analysed, and presented in the utmost professional manner by experienced expert statisticians and interpreted correctly by the judges and jurors, it is still wrong in principle to use it in cases such as Clark. This chapter thus concludes that such statistics should not be used in criminal proceedings to prove the material fact of causation (i.e. the fact that the harm was caused by the accused’s misconduct).

This chapter focuses mainly on the decision of the first appeal to uphold the convictions, rather than on the initial trial or the second appeal, for several reasons. Firstly, trained and experienced judges, such as those who presided over the first appeal, are said to be less vulnerable to counsel’s trial tactics, logical fallacies and so on, than lay juries, such as those who convicted Clark in the actual trial and who, most likely, were participating in a trial setting for the first time in their lives. Focusing on the first appeal thus reduces the importance of factors external to the available evidence and to the legal reasoning. Secondly, while the jury in the actual trial did not provide any detailed account of its reasoning,Footnote 11 the decision of the first appeal contains immense detail about the judges’ reasons for upholding the convictions. Lastly, the convictions were upheld by the first appeal despite the immense attention to detail and argument.Footnote 12 It is the thoroughness of the judgment which makes its erroneous outcome so troubling and thus also makes it a more suitable source for analysis than the second appeal or the actual trial.

2 Factual Background

Before considering possible explanations, it is first necessary to provide a more detailed description of the case.Footnote 13 Clark, who was 35 years old at the time of the trial, was a solicitor with no previous criminal record. Christopher, her first child, was born healthy but died at the age of 11 weeks while her husband was out at an office party. Dr Williams, a pathologist, initially treated the death as a case of SIDS and considered the cause of death to be lower respiratory tract infection. He found bruises on the body and a small split in the frenulum, but he thought at the time that these were caused by resuscitation attempts. Before the body was cremated, Dr Williams took photographs and preserved slides of the lungs. Two years later, Harry, the second child, was also born healthy but was found dead at the age of 8 weeks by his mother when the husband was not in the room. The findings at the autopsy were indicative of nonaccidental injury, and Dr Williams determined the cause of death as shaking. He also revisited Christopher’s case, conducted further tests, and altered his opinion, concluding that Christopher’s death was also unnatural and that the evidence was suggestive of smothering (Clark-I: [2]–[3]).

The prosecution case was that Clark had murdered both babies (Clark-I: [6]–[7]). According to the prosecution, neither could be considered SIDS because of the existence of recent and old injuries that had been found in each case. There were several similarities between the cases: both babies were of similar age and found unconscious shortly after having been fed, in the same room, by Ms Clark, when she was alone with them, and when Mr Clark was absent or about to leave. Most importantly, in each case there was evidence of previous abuse and of deliberate injury recently inflicted. With regard to Christopher,Footnote 14 three pieces of medical evidence were adduced. Firstly, there was bleeding in his lungs, both old and fresh. The old bleeding is a marker for asphyxia and cannot be explained by the nosebleed Christopher had during a family trip to London because so much blood going into the lungs would have required urgent hospital treatment, but in fact Christopher recovered spontaneously. The nosebleed was also independently consistent with a prior attempt at smothering. Secondly, in the autopsy, Christopher’s frenulum was found to be torn. The prosecution alleged that this was a result of deliberately inflicted injury rather than of resuscitation efforts and thus consistent with smothering. Lastly, Dr Williams had no doubt that he saw bruises on the body.

The defence case was that the evidence available was insufficient to prove beyond reasonable doubt that the babies’ cause of death was unnatural (Clark-I: [10]–[15]). The defence therefore suggested that the babies must have died of natural causes, though it did not commit to any specific natural cause (including SIDS).Footnote 15 The defence emphasised that two of the prosecution’s expert pathologists (Professor Green and Dr Keeling) gave the cause of death as unascertained and that therefore the prosecution’s entire case hinged on the reliability of Dr Williams, who performed the autopsies. With regard to the medical evidence, the defence claimed that the marks on the body, which were interpreted by Dr Williams as bruises, were not examined under a microscope and were not seen at the hospital by other personnel who saw the baby. The injured frenulum could have been caused during insertion of the laryngoscope. As for the fresh bleeding, the defence claimed that it was only a marker for smothering and was often found both in suspicious cases and in cases of cot death. The old bleeding could have been a result of the nosebleed, the occurrence of which was not disputed. The defence argued that it was not caused by an attempted smothering because it was unlikely that Clark would have attempted to smother Christopher on the day she had brought him to London to show him to her friends.

3 Existing Explanations

3.1 The Flaws in Meadow’s Calculation

A common explanation of why it was wrong to use the SIDS statistics in Clark focuses on the way the statistical evidence was used or, perhaps more precisely, misused. The statistical calculation used by Professor Meadow, which aimed to establish that the probability of two cases of SIDS occurring in the same family was 1 in 73 million, drew substantial criticism from the statistician communityFootnote 16 and was even singled out as the cause of the wrongful convictions.Footnote 17 Meadow reached this figure by first calculating the probability of a SIDS death in a family similar to the Clarks (professional, non-smokers, and mother aged over 26) to be 1 in 8,543. He then multiplied this figure by itself to reach the probability of two cases of SIDS in the same family. This calculation was challenged on two grounds. Firstly, the probability of one case of SIDS (1 in 8,543) was contested (Clark-I: [138]), also with reference being made to another study (the CONI study, ibid.). Secondly, and more importantly, Meadow’s calculation was flawed because it assumed independence between the two events of death.Footnote 18 In other words, Meadow assumed the probability of a second SIDS death to be equal to the probability of a first SIDS death.Footnote 19 However, there are numerous potential genetic and environmental reasons why a family which has already experienced SIDS is at higher risk of experiencing (another) SIDS death than a family which has never previously experienced SIDS (e.g. if the parents have certain genes which increase the risk of SIDS).Footnote 20 Had the calculation been accurate, the correct figure would have been less dramatic than 1 in 73 million.Footnote 21

However, it remains questionable how the flaws in Meadow’s calculation, serious as they may be, can explain why it was wrong to use the SIDS statistics in Clark. On their own, the most that these flaws can establish is that the calculation should have been done more carefully. However, that leaves open both the question of whether the wrongful convictions would have been prevented and the question of whether such statistics, even if calculated correctly, should have been used in Clark.

Furthermore, the flaws in Meadow’s calculation were known and highlighted not only during the first appeal but also during the actual trial, before the involvement of the expert statisticians. During the trial, the defence referred to Professor Emery’s study which showed cases of a second cot death to be more frequent than was argued by Sir Meadow (Clark-I: [116]). Professor Berry, one of the defence experts, emphasised the possibility of unknown factors, which further undermined Meadow’s assumption of independence (Clark-I: [122]), as the court also noted (Clark-I: [155]). The jury was also warned by the trial judge that the 73 million figure should be treated with caution (Clark-I: [144]) and was reminded that the risks of SIDS were inherently greater in a family which had already experienced SIDS (Clark-I: [145]), a warning which undermined the assumption of independence made by Meadow in his calculation. Given that Meadow’s calculation was strongly disputed from the outset, it is difficult to accept that the flaws in the calculation were responsible for the wrongful convictions. Therefore, while such calculations should be made more carefully, it is hard to see how the flaws in Meadow’s calculation can offer an explanation of why it was wrong to use the SIDS statistics in Clark.

3.2 The Psychological Effect of the Statistical Evidence

Another related common explanation of why it was wrong to use the SIDS statistics in Clark refers to the psychological effect that the impressive figure of 1 in 73 million might have had on the jurors in Clark’s trial or on the judges in the first appeal. According to this explanation, the figure of 1 in 73 million had such a strong psychological impact that it caused the suppression and underappreciation of the nonstatistical evidence that was more favourable to Clark. Stephen Clark, her husband, for instance, commented on the statistics as being ‘an arrow through the fog’ that gave the jury a compelling case against his wife.Footnote 22 According to this explanation, it was wrong to use the SIDS statistics because had they not been used, other evidence would have received adequate attention and the wrongful convictions could have been prevented.Footnote 23

However, this explanation relies on the empirical assumption that, when considering statistical evidence, people tend to discount other pieces of nonstatistical evidence which are available. By contrast, several commentators have pointed out that empirical research reveals the exact opposite (Shaviro 1989: 545). For example, the influential psychological experiments conducted by Kahneman and Tversky show that people tend to disregard statistical evidence (‘background information’) when other specific evidence is also available.Footnote 24 This empirical research is particularly relevant to Clark, since there, the fact-finder had immensely detailed specific evidence to consider, mostly about the pathological findings found in the autopsies.Footnote 25 The empirical research may thus suggest that it is unlikely that this specific evidence was suppressed by the general base-rate statistics presented by Meadow. Furthermore, if a hypothetical fact-finder in a similar case to Clark were presented with specific evidence which clearly showed a natural cause of death, it is difficult to accept that she would be so overwhelmed by the 1 in 73 million figure that she would convict in spite of such specific evidence. If anything, she would be more likely to fail to give this base-rate statistic the weight it deserved.Footnote 26 Hence, if there is a concern about the evaluation of the SIDS statistics, it is more likely to involve the unjustified underweighting of the statistics than any overwhelming effect.

3.3 The Prosecutor’s Fallacy

Yet another common attempt to explain why it was wrong to use the SIDS statistics in Clark, one which also refers to the psychological effects of the SIDS statistics, concerns the ‘prosecutor’s fallacy’.Footnote 27 The term ‘prosecutor’s fallacy’ identifies the logical mistake of treating the probability of the occurrence of the available evidence given the innocence of the accused as if it were the probability of innocence given the available evidence.Footnote 28 In the first appeal, the defence argued that the probability of two deaths given Clark’s innocence was confused with the probability of Clark’s innocence given the occurrence of two deaths. The defence alleged that the jury fell prey to the prosecutor’s fallacy by understanding the SIDS statistics as showing that the probability of Clark’s innocence given these two deaths was 1 in 73 million, instead of understanding them as showing that the probability of two SIDS deaths given Clark’s innocence was 1 in 73 million.Footnote 29

However, to the extent that this explanation has strength, it is mainly with regard to the decision of the jury rather than of the judges of the first appeal.Footnote 30 In general, judges have more experience in dealing with complex and scientific evidence because such evidence appears in many cases and evaluating it is a frequent task in their day-to-day routine. More importantly, the judges of the first appeal were equipped with the expert opinions of two distinguished statisticians (Professor Phil Dawid and Dr Ian Evett).Footnote 31 Therefore, it is hard to accept that the judges of the first appeal fell prey to the prosecutor’s fallacy, especially since they had been warned about it specifically by the expert statisticians (even assuming such a warning was in any way necessary).

3.4 Bayes’ Theorem

A further explanation for why it was wrong to use the SIDS statistics in Clark considers the courts’ refusal to use Bayes’ Theorem. Bayes’ Theorem is a mathematical formula that aims to instruct an agent on how to rationally alter his or her initial (prior) probability in light of new evidence.Footnote 32 Several eminent statisticians support using Bayes’ Theorem in situations where the jury faces both statistical and nonstatistical evidence.Footnote 33 It could be argued that it was wrong to use the SIDS statistics in Clark because the figure was adduced on its own. Instead, Bayes’ Theorem should have been used to combine the statistical evidence with the nonstatistical evidence and to assess accurately the probability of Clark’s guilt given all the available evidence.Footnote 34 It should be emphasised that such an explanation does not imply that using the SIDS statistics in Clark was wrong in principle. On the contrary, such critics tend to be sympathetic to the use of statistical evidence in court, as long as it is used correctly,Footnote 35 which means, inter alia, using Bayes’ Theorem.Footnote 36

However, any explanation that refers to Bayes’ Theorem faces various difficulties, most evidently the need to quantify the numerous pieces of complex medical evidence into precise probabilities. Furthermore, the legal literature has noted several general difficulties in applying Bayes’ Theorem in criminal courts, and the question of whether this statistical method should be used in court is probably one of the most debated issues in the theory of evidence law.Footnote 37 It is worth mentioning some of the main difficulties in simplified form. Firstly, Bayes’ Theorem requires an assignment of prior probability of guilt, before any evidence is introduced (p(G)). However, it is questionable how assigning prior probability of guilt could be consistent with the presumption of innocence.Footnote 38 Secondly, it is questionable whether jurors, lawyers, and judges, all of whom usually lack any statistical training, would be able to deploy this method accurately.Footnote 39 Thirdly, once the calculation has produced a figure, it is contentious as to whether and how this figure could or should be translated into a guilty/not-guilty verdict (Nesson 1985).

Given these difficulties in quantifying the complex medical evidence and in applying Bayes’ Theorem in court, it is far from clear that using Bayes’ Theorem would have assisted the administration of justice to avoid the wrongful convictions. While the applicability of Bayes’ Theorem in court remains an important theoretical question, it is hard to see how the courts’ refusal to use Bayes’ Theorem can explain why it was wrong to use the SIDS statistics in Clark.

3.5 The Insignificance of the SIDS Statistics

All four explanations seem to assume, either explicitly or implicitly, that the use of the SIDS statistics in Clark was wrong because it was responsible, at least in part, for the wrongful convictions.Footnote 40 They mainly diverge on the details of exactly how the SIDS statistics led to the wrongful convictions: by exaggerating the rarity of two SIDS deaths in one family, by creating a psychological effect that overshadowed other exonerating evidence, by inducing the prosecutor’s fallacy or by requiring the use of Bayes’ Theorem, a method which the courts refused to adopt. However, all these common explanations seem to share the assumption that had the SIDS statistics been used correctly, or had they not been used at all, the wrongful convictions could have been prevented.

Contrary to this shared assumption, it is argued here that the role of the SIDS statistics in Clark is much overrated. Firstly, the prosecution had a strong case even without the SIDS statistics. In the weaker case of Christopher, the prosecution pointed to bruises, the torn frenulum and fresh bleeding in the lungs. The defence challenged the existence of each of these pieces of evidence, and they received thorough and repeated scrutiny both during the long trial and in the lengthy decision of the first appeal. Yet it is worthwhile to note that each of the defence medical experts ‘agreed that if there was bruising, the injury to the frenulum and bleeding in the lungs, it suggested asphyxia’ (Clark-I: [40]). The prosecution case about Christopher did not hinge on a single piece of medical evidence, while the evidence relating to Harry was even more worrying.Footnote 41 Even if the defence were successful in establishing a reasonable doubt about one of the pieces of evidence, the cumulative weight of these pieces of evidence was probably (and should have been) higher than the sum of its parts. It is therefore questionable whether using the SIDS statistics correctly, or not using them at all, would have, or should have, changed either the jury’s decision to convict or the first appeal’s decision to uphold the convictions. Given that the microbiological results which led to Clark’s release were not yet known at that stage, the convictions would probably have been reached by the jury and upheld by the court in the first appeal, with or without the SIDS statistics.

Secondly, the SIDS statistics were inessential to the prosecution case, since the defence experts accepted the very fact that this evidence was adduced to prove: that SIDS was not the cause of death. This was most evident in Harry’s case. Dr Whitwell, for the defence, testified that ‘[s]he would not classify this a SIDS death because a true SIDS death should be completely negative and would not normally occur at this time in the evening, after a feed, with the child in a bouncy chair’ (Clark-I: [77]). Dr Rushton, also for the defence, went even further and ‘agreed that there were features in both deaths that gave rise to very great concern and for that reason he would not class them as SIDS deaths’ (ibid.). Little wonder that, when considering the fallacies of Meadow’s statistics, the court concluded that:

[The statistical evidence] was very much a side-show at trial. The experts were debating the incidence of genuine SIDS (unexplained deaths with no suspicious circumstances) in a case where both sides agreed that neither Christopher’s death nor Harry’s death qualified as such.Footnote 42

4 The Contrastive Explanation

Having considered and rejected the existing common explanations of why it was wrong to use the SIDS statistics in Clark, this part proposes an alternative explanation. It is argued here that it was wrong in principle to use the SIDS statistics, regardless of their actual share in the responsibility for the wrongful convictions. The explanation here is also not connected to concerns regarding the correct use and presentation of statistics, which have been raised in relation to Clark and have been discussed more generally in the academic literature.Footnote 43 While objections to the use of statistical evidence which are based on such concerns are important, they are nevertheless practical in nature and may be overcome by better education and training of the legal professionFootnote 44 and/or by more assistance from expert statisticians.Footnote 45 Such concerns should be distinguished from the position that the use of the SIDS statistics in Clark was wrong in principle, which is what the following section seeks to establish.

The most appropriate point to begin is the way in which the expert statisticians in Clark thought the SIDS statistics should have been used. Consider the testimony of Professor Dawid, one of the two expert statisticians for the defence, who states that:

The laws of probability now focus attention on, not the absolute values of these probabilities of the two deaths in one family arising from the different causes considered, but on their relative values.Footnote 46

He then concludes that:

[The probability of two SIDS deaths in the same family] could only be useful if compared with a similar figure calculated under the alternative hypothesis that both babies were murdered.Footnote 47

To some extent, Dawid’s conclusion already includes an objection to the use of the SIDS statistics. According to this objection, the absolute probability that the deaths occurred for innocent reasons (the first hypothesis) is meaningless on its own. It does not matter how likely or unlikely it is that the deaths occurred for innocent reasons. All that matters is how low (or high) this probability is relative to the probability that the events occurred for culpable reasons (the second hypothesis). Even if the probability of two SIDS deaths is strikingly small, all that matters is how it stands relative to the probability that a mother would murder both her babies, which was estimated by Dawid to be about 1 in 2 billion.Footnote 48 As this probability is much lower than the probability of two SIDS deaths, the SIDS statistics actually support Clark’s innocence.Footnote 49 In addition to his criticism of Meadow’s amateur calculation, Dawid’s main point is that discussing the probability of two SIDS deaths in one family without referring to the alternative hypothesis is meaningless and misleading.Footnote 50

In the first appeal, the court dismissed the exercise of comparing the probabilities of the two hypotheses as ‘not realistic’ and rejected any suggestion of estimating the probability of the second hypothesis (Clark-I: [176]). This dismissal was too quick. The probability of the first hypothesis (two SIDS deaths), as given by Meadow, was accepted by the court in the first appeal, after it had concluded unequivocally that this evidence was ‘clearly relevant and admissible’ (Clark-I: [166]). So why was it unrealistic to consider the probability of the second hypothesis (two murders)? The difference between the two pieces of evidence cannot lie in a lack of information, because Dawid’s expert report offered preliminary statistical evidence for the second hypothesis.Footnote 51 Nor can the difference lie in a lack of statistical expertise, because the court in the first appeal was equipped with the expert opinions of two experienced statisticians, Professor Dawid and Dr Evett. Lastly, the difference cannot lie in a general antagonism to statistics.Footnote 52 Such an antagonism would apply equally to the SIDS statistics, which the court readily admitted. The approach of the court in the first appeal is therefore hard to understand or defend.

In the second appeal, the court reached a different conclusion, according to which the SIDS statistics probably should not have been admitted.Footnote 53 However, the court did not offer any explanation of why the first appeal’s ruling regarding the admissibility of the SIDS statistics should be reversed. Nor did the court in the second appeal give its reasons for concluding that the SIDS statistics should not have been admitted. No guidance was provided for distinguishing this piece of statistical evidence from other kinds of statistical evidence which are regularly used in court (e.g. DNA evidence). The inconsistency between the two decisions and the lack of judicial reasoning in the second appeal demonstrate the need for a principled approach to the use of statistical evidence in court. Developing such a general approach requires a separate and extensive research project,Footnote 54 and thus it is beyond the scope of this chapter. However, the following explanation for why it was wrong in principle to use the SIDS statistics in Clark can be applied to other similar cases, and thus this case study of Clark is concluded with a more general recommendation.

The starting point of the proposed explanation is the issue of competing hypotheses, to which the expert statisticians drew attention. Competing hypotheses are central to the theory of legal fact-finding.Footnote 55 One context in which competing hypotheses are particularly important is that of determining the role of causal explanations in legal fact-finding. The importance of explanatory aspects of legal fact-finding has recently been highlighted by Allen and Pardo in their discussion of the theory of inference to the best explanation and its application to evidence law (Allen and Pardo 2008). A related theory, one which can shed light on the way the SIDS statistics were used to prove the material fact of causation, is the theory of contrastive explanation (van Fraassen 1980: 97–157; Lipton 1990).

The theory of contrastive explanation can be described as follows. The idea, in outline, is that most, if not all, requests for explanation (‘why’ questions) contain a contrast: instead of understanding the request as ‘why P’, the theory of contrastive explanation suggests understanding such questions as ‘why P and not Q’. To borrow Lipton’s terminology, ‘why’ questions contrast between facts and foils.Footnote 56 Sometimes the contrast is explicit: ‘why did you order cheese quiche rather than beef lasagne?’ Here the foil (beef lasagne) is stated explicitly as part of the question. However, the theory of contrastive explanation argues that often (if not always)Footnote 57 the contrast is implicit. For example, when someone asks ‘why did the mercury in the barometer go up before the storm?’, the implicit contrast is ‘why did it go up rather than stay where it is’, and not ‘why did it go up rather than break the glass’.Footnote 58 If someone answered the question with a detailed explanation of the rigidity of the glass,Footnote 59 we would consider this explanation to be irrelevant and inadequate.Footnote 60

Exploring the full application of the theory of contrastive explanation to legal fact-finding in general, and to the proof of causation in particular,Footnote 61 would probably require yet another separate research project. Nonetheless, in the limited context of analysing the use of the SIDS statistics in Clark, this theory can be employed to substantiate the expert statisticians’ assertion that the SIDS statistics make sense only if they are compared with the probability that a mother would smother her two babies. The fact-finder in Clark sought to answer the question of why the babies died. The contrast class of this why question was constructed from possible causes of death. The prosecution needed to establish that the preferred contrast was unnatural (smothering in this case)Footnote 62 rather than being any other contrast of natural causes of death such as SIDS. On its own, the very low probability of SIDS does not provide support for preferring one specific contrast to another. The low probability of SIDS supports the prosecution’s preferred contrast only if this probability is lower than the probability of the prosecution’s preferred contrast (smothering). But this comparative claim can be established only with evidence about the probability of the prosecution’s preferred contrast. To support the prosecution’s answer to the question of why the two babies died, the SIDS statistics must be accompanied by evidence detailing the rate of mothers smothering both their babies in the population of mothers who are similar to Clark.

The probability of smothering may be referred to either explicitly or implicitly. According to the expert statisticians, this reference should be made explicit: one should use empirical data to calculate the rate of mothers who smother both their babies in the wider population of non-smoking professional mothers aged over 26.Footnote 63 One might reject this idea and hold that the fact-finder should not be provided with any evidence of that sort. Yet ample empirical research shows that fact-finders consider explanations of innocence by contrasting them with explanations of guilt, and vice versa, as a matter of course.Footnote 64 In the absence of empirical data, the fact-finder might resort to general knowledge and common sense at best, or to prejudice and arbitrary guesswork at worst. Therefore, as supported by empirical research, emphasised by the expert statisticians and substantiated by the theory of contrastive explanation, using the SIDS statistics requires the fact-finder to compare, either explicitly or implicitly, the probability of SIDS with the probability that a professional non-smoking mother aged over 26 would smother both her babies on two separate occasions.

Once the need for this comparison is recognised, the reason why using the SIDS statistics in Clark was wrong in principle becomes clear. Developing a comprehensive and general account to justify why courts should not use statistical evidence about the rate of similar misconduct in a population similar to the accused is outside the scope of this chapter. Yet it is difficult to underplay the fact that justice systems around the world hardly ever, if at all, use such statistical evidence to convict.Footnote 65 While explaining why this is the case is a challenging theoretical question, the probability of smothering seems a paradigmatic example of statistical evidence of a kind which should not be used in court.

Bearing in mind the scope of this chapter, a few preliminary remarks can nevertheless be made as to why evidence about the probability of smothering seems so intuitively objectionable. Using evidence about the rate of smothering among other similar mothers seems objectionable because it is inconsistent with regarding Clark as unique and morally autonomous individual. Using such evidence assumes that she would exhibit the typical behaviour of her peers (professional non-smoker mothers aged over 26). It also presupposes that these characteristics determine, either fully or partly, the individual’s behaviour. Without presupposing that such a common determining property is shared between these mothers, it is unclear why this is the relevant group from which an inference to Clark’s individual case should be made. Why not refer to mothers in general, parents of both genders, people whose last name starts with C and so on? The choice of this particular group as relevant to Clark’s individual case thus implies that the members of this group share a property which affects their behaviour.Footnote 66 The guilt of an individual should be proved based on the particular facts of her case rather than on the rate of similar misconduct among other people with similar characteristics to hers.Footnote 67 These remarks are necessarily preliminary. But the main point is that evidence regarding the probability of smothering is clearly objectionable, even if it is difficult to account for why this is so.

5 Conclusion

This chapter has questioned whether it was wrong to use the SIDS statistics in Clark, and if so, why. Four common explanations were discussed and rejected. The first explanation refers to the flaws in Meadow’s calculation of the 1 in 73 million figure. Yet, however serious these flaws are, it is difficult to accept that they can explain why it was wrong to use the SIDS statistics in Clark. The second explanation raises the suspicion that the SIDS statistics had an overwhelming psychological effect, which overshadowed other evidence more favourable to Clark. However, if there is a concern about the weight given to the SIDS statistics, it is more likely to be a concern of underweighting rather than overweighting. The third explanation focuses on the ‘prosecutor’s fallacy’, yet it is hard to substantiate the concern that the court fell prey to this fallacy, especially given it was specifically warned about it by the expert statisticians. The last explanation considers the courts’ refusal to use Bayes’ Theorem. However, the difficulties with quantifying the complex medical evidence and applying this statistical method in court make explanations which hinge on it difficult to sustain.

A more general difficulty with all four explanations is that the connection between the SIDS statistics and the wrongful convictions is hard to sustain. The role of this evidence in Clark is much overrated, since the prosecution case was strong without the SIDS statistics. Furthermore, even the defence experts agreed that the deaths should not be categorised as SIDS, making the SIDS statistics inessential for the prosecution case. It is therefore difficult to accept that the wrongful convictions could have been prevented had the SIDS statistics been used correctly, or even had they not been used at all.

However, the question still remains whether it was wrong in principle to use the SIDS statistics in Clark, and if so, why. This chapter has argued that even if the issues identified by the existing explanations were properly addressed, it would still have been wrong in principle to use the SIDS statistics in Clark. This position is based on the theory of contrastive explanation. Its application to Clark begins with a similar point to that of the expert statisticians, namely, that the SIDS probability makes sense only when it is compared with the smothering probability. However, unlike the expert statisticians’ approach, the theory of contrastive explanation does not require any commitment to Bayesian methods in order to reach this point. This is an advantage because Bayesian methods were explicitly rejected by the English Court of Appeals for ‘[plunging] the jury into inappropriate and unnecessary realms of theory and complexity deflecting them from their proper tasks’ (Adams-I: 482).

Yet the main difference between the approach of this chapter and that of the expert statisticians lies in the conclusion. The expert statisticians’ approach would require the application of Bayes’ Theorem in order to make sense of the SIDS statistics, because using this method is ‘the only logically sound and consistent approach to considering situations such as this’.Footnote 68 Their position thus implies that in order to make sense of the SIDS statistics, it would be necessary to use Bayes’ Theorem and thus also necessary to adduce further statistical evidence to assess the probability that a mother similar to Clark would smother both her babies.

This chapter, by contrast, has reached a different conclusion. The problem with the use of statistical evidence in Clark was more fundamental than merely a flaw in the statistical analysis. The SIDS statistics should not have been used, for they would have required the use of another piece of evidence which is clearly objectionable: statistical evidence on the rate of smothering. If one piece of evidence cannot be used without another piece of objectionable evidence, then neither of them should be used. Regardless of whether the exercise of comparing probabilities of SIDS and smothering is ‘realistic’ or not, it should not be conducted as part of a criminal proceeding.

This chapter has focused on only a single case study, and hence any attempt to generalise from Clark to the wider issue of the use of statistical evidence in court should be made with caution. However, perhaps there is one lesson from Clark that can be phrased in more general terms. It is sometimes tempting to ask how likely it is that a given unusual event (e.g. two baby deaths in one family) will happen randomly, by accident, naturally, or for any other non-culpatory cause. If the event in question happens so rarely in the course of nature, this fact alone seems to provide evidential support for the hypothesis that the event must have happened as a result of deliberate human intervention, namely, the accused’s misconduct. However, Clark alerts us to the fact that this type of inference is problematic, because the natural/random/non-culpatory causal explanation must be contrasted with a culpatory one. Using statistical evidence on the rate of a certain non-culpatory cause would thus require also using statistical evidence on the rate of the type of criminal behaviour attributed to the accused among other people similar to him or her.Footnote 69

There may be contexts in which it would be appropriate to refer to the probability of an unusual event happening naturally and to use statistical evidence and methods to calculate this probability. For example, in the decision whether to equip parents with an apnoea monitor which detects pauses in breathing in young babies,Footnote 70 it may be wise to consider the probability of SIDS among families with similar characteristics. However, when the same question arises in the context of determining whether an individual is guilty of an alleged crime, using this kind of evidence becomes problematic. Since statistical evidence regarding the rate of misconduct among other people similar to the accused should not be used to determine the accused’s guilt, the same goes for statistical evidence regarding the probability of non-culpatory causes. No doubt the use of statistical evidence in court is a complex issue, and the same logic would not necessarily apply to statistical evidence used to prove other facts (such as identification proven with DNA evidence) or to statistical evidence used in noncriminal proceedings. However, this case study of the use of statistics in Clark exemplifies the following general point. Statistical evidence on the probability of non-culpatory causes should not be used in criminal courts to prove the material fact of causation. Before delving into the technical complexities of how to gather, analyse and present statistical evidence which shows the exact value of such probabilities, one should first ensure that raising the question of probabilities (and using statistical evidence to answer it) is appropriate in the given context. At least when determining an individual’s guilt, it seems rather that it is not.