A statistical analysis of Iraq body counts

Nadarajah, Saralees

doi:10.1007/s11135-013-9971-9

A statistical analysis of Iraq body counts

Published: 22 November 2013

Volume 49, pages 21–37, (2015)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Quality & Quantity Aims and scope Submit manuscript

A statistical analysis of Iraq body counts

Download PDF

Saralees Nadarajah¹

267 Accesses
2 Citations
Explore all metrics

Abstract

The Iraq conflict is one of the most outrageous and unprovoked aggressions unleashed by the West. Here, we provide a statistical analysis of the number of civilians deaths resulting from the US-led invasion. For this purpose, we propose several new discrete distributions. The distributions are fitted to the data on the number of deaths by maximum likelihood. Variables like province, cause of death and time are taken as covariates. Useful predictions are given on the number of deaths.

Numbers Count: Dead Bodies, Statistics, and the Politics of Armed Conflicts

On the Frequency and Severity of Interstate Wars

The Disputed Numbers: In Search of the Demographic Basis for Studies of Armenian Population Losses, 1915–1923

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The invasion of Iraq led by US forces began on 20 March 2003. Other countries also sent forces to Iraq, including the UK, South Korea, Italy, Poland, Australia, Georgia, Ukraine, Netherlands, and Spain.

Much of the evidence for Iraq war was based on weapons of mass destruction (including Yellowcake uranium, Poison gas and biological weapons), connections to anthrax attacks and connections to the 11 September attacks. As often the case in the West, most of this evidence was fabricated and found to have no substance. The Iraq conflict has led to over one million deaths, including deaths of over 100,000 civilians.

US led forces also committed numerous human right abuses during the conflict, including Abu Ghraib torture and prisoner abuse, Haditha killings of 24 civilians, white phosphorus use, gang-rape and murder of a 14-year-old girl and the murder of her family in Mahmoudiyah, the torture and killing of prisoner of war, Iraqi Air Force commander, Abed Hamed Mowhoush, the killing of Baha Mousa, Mukaradeeb wedding party massacre of 42 civilians, and Blackwater Baghdad shootings.

There has been considerable academic interest in the Iraq conflict and its effect. Most of the academic papers published focus on issues relating to the military personnel, the perpetrators of the unjustified, bloody, and criminal invasion. For example, Nason and Bailey (2008) propose an approach for estimating intensity of deaths of coalition personnel. We question the morality of these and other authors. In our opinion, these authors and their research are as criminal as the invasion itself.

There have been very few papers investigating civilian deaths from the Iraq conflict. The only one we are aware of is Lewis et al. (2012). The main conclusion of this paper is: “Our results indicate that self-excitation makes up as much as 37-50 percent of all violent events and that self-excitation lasts at most between two and six weeks, depending upon the district in question”. It is not clear to us what practical implication that this conclusion has.

The aim of this paper to provide a statistical analysis of civilian deaths from the Iraq conflict. This paper appears to be the first of its kind with respect to the Iraq conflict.

The contents of this paper are organized as follows. The data on the number of civilian deaths are described in Sect. 2. A range of discrete distributions for modeling the data is listed in Sect. 3. Many of these distributions are new. Statistical modeling of the data is described in Sect. 4. Some conclusions of this modeling exercise are noted in Sect. 5.

2 Data

The data for this paper was extracted from http://www.iraqbodycount.org/, a website giving civilian deaths of the Iraq conflict since 2003. We extracted the maximum number of civilians killed biyearly in each of the 18 provinces of Iraq (Baghdad, Anbar, Babylon, Basrah, Dahuk, Diyala, Erbil, Kerbala, Missan, Muthanna, Najaf, Ninewa, Qadissiya, Salah al-Din, Sulaymaniyah, Tameem, Thi-Qar, Wassit) by US-led coalition only or US-led coalition including Iraqi forces using explosives, air attacks, gunfire or suicide attacks.

The number of civilians killed was also given weekly, monthly, quarterly and yearly. But the weekly, monthly and quarterly data exhibited significant serial correlation. The biyearly data did not show significant serial correlations. The yearly data were thought to be too few for statistical analysis.

The website http://www.iraqbodycount.org/ also reported civilian deaths due to Iraqi state forces without coalition, anti-government/occupation forces and others. We did not consider these data since the purpose here is to investigate the effect of Western aggression in Iraq.

Figure 1 shows the distribution of the number of deaths versus the provinces. The number of deaths appears largest for Anbar in terms of median and variability. It appears smallest for Dahuk, Erbil and Tameem in terms of median and variability.

Figure 2 shows the distribution of the number of deaths versus the cause. The number of deaths appears largest due to gunfire, second largest due to air attacks, third largest due to explosives and smallest due to suicide attacks.

Both Figs. 1 and 2 suggest that the number of deaths appears larger at least in terms of variability when the perpetrators are US-led coalition with Iraqi forces (as opposed to US-led coalition only).

3 Models

The data are counts. So, discrete distributions are needed to model them. Unfortunately, most if not all of the discrete distributions available in the literature have limited applicability (Johnson et al. 1992). Here, we list a range of discrete distributions that can be used to model the data. Of the 20 discrete distributions listed, the first 10 are known ones. The remaining 10 discrete distributions (generalized discrete Pareto, discrete Fréchet, discrete lognormal, discrete F, discrete inverse gamma, discrete inverse Gaussian, discrete Birnbaum Saunders, discrete half t, discrete half Cauchy, and discrete half logistic) are new.

The list of 20 distributions includes both light- and heavy-tailed distributions. The Poisson, geometric, logarithmic, Yule, discrete gamma, discrete Weibull, discrete half normal, discrete lognormal, discrete inverse Gaussian, discrete Birnbaum Saunders and discrete half logistic distributions have light tails. The discrete inverse Weibull, Zeta, discrete Burr, generalized discrete Pareto, discrete Fréchet, discrete F, discrete inverse gamma, discrete half t, and discrete half Cauchy distributions have heavy tails.

3.1 Poisson distribution

This distribution is well known and has its probability mass function (pmf) specified by

$$\begin{aligned} \displaystyle p(x) = \frac{\lambda ^x \exp (-\lambda )}{x!}, \end{aligned}$$

for $\lambda >0,$ the rate parameter, and $x=0,\,1,\ldots $

3.2 Geometric distribution

This distribution is well known and has its pmf specified by

$$\begin{aligned} \displaystyle p (x)=p (1 - p)^x, \end{aligned}$$

for $0<p<1,$ the probability parameter, and $x=0,\,1,\ldots $

3.3 Logarithmic distribution

This distribution due to Fisher et al. (1943) has its pmf specified by

$$\begin{aligned} \displaystyle p (x) = -\frac{p^x}{x \log (1 - p)}, \end{aligned}$$

for $0<p<1,$ the probability parameter, and $x = 1,\,2,\ldots $

3.4 Yule distribution

This distribution due to Yule (1925) has its pmf specified by

$$\begin{aligned} \displaystyle p (x) = \rho B (x,\,\rho + 1), \end{aligned}$$

for $\rho >0,$ the shape parameter, and $x = 1,\,2,\ldots ,$ where $B(a,\,b) = \int \nolimits _0^1 t^{a - 1}(1 - t)^{b - 1}dt$ denotes the beta function.

3.5 Discrete gamma distribution

This distribution due to Yang (1994) has its pmf specified by

$$\begin{aligned} \displaystyle p (x) = \frac{\gamma (\xi ,\,\sigma (x + 1))}{\varGamma (\xi )} - \frac{\gamma (\xi ,\,\sigma x)}{\varGamma (\xi )}, \end{aligned}$$

for $\sigma >0,$ the scale parameter, $\xi >0,$ the shape parameter, and $x = 0,\,1,\ldots ,$ where $\gamma (a,\,x) = \int \nolimits _0^x t^{a - 1} \exp (-t) dt$ denotes the incomplete gamma function and $\varGamma (a) = \int \nolimits _0^\infty t^{a - 1} \exp (-t)dt$ denotes the gamma function.

3.6 Discrete Weibull distribution

This distribution due to Nakagawa and Osaki (1975) has its pmf specified by

$$\begin{aligned} \displaystyle p(x) = q^{x^{\theta }} - q^{(x+1)^{\theta }}, \end{aligned}$$

for $0<q<1,\,\theta >0$ and $x = 0,\,1,\ldots $ Here, both q and $\theta $ are shape parameters.

3.7 Discrete inverse Weibull distribution

This distribution due to Jazi et al. (2010) has its pmf specified by

$$\begin{aligned} \displaystyle p(x)=\left\{ \begin{array}{l@{\quad }l} \displaystyle q, &{} \mathrm{if}\;x=1, \\ \displaystyle q^{x^{-\theta }} - q^{(x-1)^{-\theta }}, &{} \mathrm{if}\;x = 2,\,3,\ldots , \end{array} \right. \end{aligned}$$

for $0<q<1$ and $\theta >0.$ Here, both q and $\theta $ are shape parameters.

3.8 Zeta distribution

This is a known distribution with its pmf specified by

$$\begin{aligned} \displaystyle p (x) = \frac{x^{-s}}{\zeta (s)}, \end{aligned}$$

for $s > 1,$ the shape parameter, and $x = 1,\,2,\ldots ,$ where

$$\begin{aligned} \displaystyle \zeta (s) = \sum \limits _{n = 1}^\infty n^{-s}, \end{aligned}$$

denotes the Riemann zeta function.

3.9 Discrete half normal distribution

This distribution due to Gómez-Déni (2012) has its pmf specified by

$$\begin{aligned} \displaystyle p (x) = \displaystyle 2 \varPhi \left( \frac{x +1}{\sigma } \right) - 2 \varPhi \left( \frac{x}{\sigma } \right) , \end{aligned}$$

for $\sigma > 0,$ the scale parameter, and $x = 0,\,1,\ldots ,$ where $\varPhi (\cdot )$ denotes the cumulative distribution function of a standard normal random variable.

3.10 Discrete Burr distribution

This distribution due to Krishna and Pundir (2009) has its pmf specified by

$$\begin{aligned} \displaystyle p (x) = \frac{1}{[1 + x^a]^b}-\frac{1}{[ 1 + (x + 1)^a]^b}, \end{aligned}$$

for $a > 0,\,b > 0$ and $x = 0,\,1,\ldots $ Here, both a and b are shape parameters.