Abstract
Goodness-of-fit tests are proposed for the case of independent observations coming from the same family of distributions but with different parameters. The most popular related context is that of generalized linear models (GLMs) where the mean of the distribution varies with regressors. In the proposed procedures, and based on suitable estimators of the parameters involved, the data are transformed to normality. Then any test for normality for i.i.d. data may be applied. The method suggested is in full generality as it may be applied to arbitrary laws with continuous or discrete distribution functions, provided that an efficient method of estimation exists for the parameters. We investigate by Monte Carlo the relative performance of classical tests based on the empirical distribution function, in comparison to a corresponding test which instead of the empirical distribution function, utilizes the empirical characteristic function. Standard measures of goodness-of-fit often used in the context of GLM are also included in the comparison. The paper concludes with several real-data examples.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Chen G, Balakrishnan N (1995) A general purpose approximate goodness-of-fit test. J Qual Technol 27: 154–161
Cox DR, Snell EJ (1968) A general definition of residuals (with discussion). J R Stat Soc B 30: 248–275
D’Agostino M, Stephens R (1986) Goodness-of-fit techniques. Marcel Dekker. Inc., New York
Davison AC (2003) Statistical models. Cambridge University Press, Cambridge
Dunn PK, Smyth GK (1996) Randomized quantile residuals. J Comput Graph Stat 5: 236–244
Epps TW, Pulley LB (1983) A test for normality based on the empirical characteristic function procedures. Biometrika 70: 723–726
Faraway JJ (2006) Extending the Linear Model with R: generalized linear. Mixed effects and nonparametric regression models. Chapman & Hall/CRC
Faraway JJ (2011) Faraway: functions and datasets for books by Julian Faraway. http://cran.r-project.org/web/packages/faraway/index.html
Hallin M, Ingenbleek J-F (1983) The Swedish automobile portfolio in 1977. A statistical study. Scand Actuarial J 49–64
Hardin JW, Hilbe JM (2007) Generalized linear models and extensions, 2nd edn. Stata Press, College Station
Henze N (1990) An approximation to the limit distribution of the Epps-Pulley test statistic for normality. Metrika 37: 7–18
Hu B, Shao J (2008) Generalized linear model selection using R 2. J Stat Plan Infer 138: 3705–3712
Loynes RM (1980) The Empirical distribution function of residuals from generalised regression. Ann Stat 8: 285–298
McCullagh P, Nelder JA (1989) Generalized linear models. 2nd edn. Chapman and Hall, London
Meintanis SG (2009) Goodness-of-fit testing by transforming to normality: comparison between classical and characteristic function-based methods. J Stat Comput Simul 79: 205–212
Mittlböck M, Heinzl H (2002) Measures of explained variation in gamma regression models. Commun Stat Simul Comput 31: 61–73
Paul SR, Deng D (2002) Score tests for goodness of fit of generalized linear models to sparse data. Sankhya 64: 179–191
R Development Core Team (2011) R: a language and environment for statistical computing. R Foundation for Statistical Computing. http://www.R-project.org
Shayib MA, Young DH (2002) Modified goodness of fit of tests in gamma regression. J Stat Comput Simul 33: 125–133
Smyth G, with contributions from Hu Y, Dunn P, Phipson B (2011) statmod: Statistical Modeling. http://CRAN.R-project.org/package=statmod
Spinelli JJ, Lockhart RA, Stephens MA (2002) Tests for the response distribution in a Poisson regression model. J Statist Plan Infer 108: 137–154
Thode HC (2002) Testing for normality. Marcel Dekker. Inc., New York
Venables WN, Ripley BD (2002) Modern applied statistics with S, 4th edn. Springer, New York
Wood GR (2002) Generalized linear accident models and goodness of fit testing. Accident Anal Prevent 34: 417–427
Zheng B (2000) Summarizing the goodness of fit of generalized linear models for longitudinal data. Stat Med 19: 1265–1275
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Klar, B., Meintanis, S.G. Specification tests for the response distribution in generalized linear models. Comput Stat 27, 251–267 (2012). https://doi.org/10.1007/s00180-011-0253-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-011-0253-5