Abstract
Multiple-list capture–recapture data can be used to estimate the size of a population. In this manuscript two problems are studied and solved using a common solution. The first problem is that the lists refer to different but overlapping populations. An example is that lists refer to different but overlapping regions, different but overlapping periods in time, or different but overlapping age groups. The second problem is that each list has a set of covariates and the sets of covariates are not identical. By considering both problems as missing data problems, a solution is obtained through the EM algorithm. This approach is illustrated by two examples.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Bishop, Y.M.M., Fienberg, S.E., Holland, P.W.: Discrete Multivariate Analysis, Theory and Practice. McGraw-Hill, New York (1975)
Chao, A., Tsay, P., Lin, S., Shau, W., Chao, D.: The applications of capture–recapture models to epidemiological data. Stat. Med. 20, 3123–3157 (2001)
Cormack, J.M.: Log-linear models for capture-recapture. Biometrics 45, 395–413 (1989)
Fienberg, S.E.: The multiple recapture census for closed populations and incomplete 2k contingency tables. Biometrika 59, 591–603 (1972)
International Working Group for Disease Monitoring and Forecasting: Capture–recapture and multiple-record systems estimation 1: history and theoretical development. Am. J. Epidemiol. 142, 1047–1058 (1995)
Little, R.J.A., Rubin, D.B.: Statistical Analysis with Missing Data. Wiley, New York (1987)
Sinharay, S., Stern, H., Russell, D.: The use of multiple imputation for the analysis of missing data. Psychol. Meth. 6, 317–329 (2001)
Sutherland, J.M., Schwarz, C.J., Rivest, L.-P.: Multilist population estimation with incomplete and partial stratification. Biometrics 63, 910–916 (2007)
Tsay, P.K., Chao, A.: Population size estimation for capture–recapture models with applications to epidemiological data. J. Appl. Stat. 28, 25–36 (2001)
Van der Heijden, P.G.M., Zwane, E., Hessen, D.: Schatting van aantal in Nederland verblijvende Antillianen die niet ingeschreven zijn in de GBA. Een “capture–recapture”-analyse in opdracht van het Ministerie van Justitie. Utrecht, Utrecht University, Department of Methodology and Statistics (2006)
Van der Pal, K.M., Van der Heijden, P.G.M., Buitendijk, S.E., Den Ouden, A.L.: Periconceptional folic acid use and the prevalence of neural tube defects in the Netherlands. Eur. J. Obset. Gynecol. Reprod. Biol. 108, 33–39 (2003)
Zwane, E., Van der Heijden, P.G.M.: Analysing capture–recapture data when some variables of heterogeneous catchability are not collected or asked in all registrations. Stat. Med. 26, 1069–1089 (2007)
Zwane, E., Van der Heijden, P.G.M.: Capture–recapture studies with incomplete mixed categorical and continuous covariates. J. Data Sci. 6, 557–572 (2008)
Zwane, E., Van der Pal, K., Van der Heijden, P.G.M.: The multiple-record systems estimator when registrations refer to different but overlapping populations. Stat. Med. 23, 2267–2281 (2004)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
About this article
Cite this article
van der Heijden, P.G.M., Zwane, E. & Hessen, D. Structurally missing data problems in multiple list capture–recapture data. AStA Adv Stat Anal 93, 5–21 (2009). https://doi.org/10.1007/s10182-008-0098-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10182-008-0098-6