Abstract
We introduce a family of goodness-of-fit statistics for testing composite null hypotheses in multidimensional contingency tables. These statistics are quadratic forms in marginal residuals up to order r. They are asymptotically chi-square under the null hypothesis when parameters are estimated using any asymptotically normal consistent estimator. For a widely used item response model, when r is small and multidimensional tables are sparse, the proposed statistics have accurate empirical Type I errors, unlike Pearson’s X 2. For this model in nonsparse situations, the proposed statistics are also more powerful than X 2. In addition, the proposed statistics are asymptotically chi-square when applied to subtables, and can be used for a piecewise goodness-of-fit assessment to determine the source of misfit in poorly fitting models.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Agresti, A. (2002). Categorical data analysis (2nd edn.). New York: Wiley.
Bartholomew, D.J., & Knott, M. (1999). Latent variable models and factor analysis (2nd edn.). London: Arnold.
Bartholomew, D.J., & Leung, S.O. (2002). A goodness-of-fit test for sparse 2p contingency tables. British Journal of Mathematical and Statistical Psychology, 55, 1–5.
Bartholomew, D.J., & Tzamourani, P. (1999). The goodness of fit of latent trait models in attitude measurement. Sociolological Methods and Research, 27, 525–46.
Bentler, P.M. (1995). EQS. Encino, CA: Multivariate Software.
Bishop, Y.M.M., Fienberg, S.E., & Holland, P.W. (1975). Discrete multivariate analysis. Cambridge, MA: MIT Press.
Bock, R.D. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika, 37, 29–1.
Bock, R.D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46, 443–59.
Cai, L., Maydeu-Olivares, A., Coffman, D.L., & Thissen, D. (2006). Limited information goodness of fit testing of item response theory models for sparse 2p tables. British Journal of Mathematical and Statistical Psychology, 59, 173–94.
Christoffersson, A. (1975). Factor analysis of dichotomized variables. Psychometrika, 40, 5–2.
Cochran, W.G. (1952). The X2 test of goodness of fit. Annals of Mathematical Statistics, 23, 315–45.
Collins, L.M., Fidler, P.L., Wugalter, S.E., & Long, J. (1993). Goodness-of-fit testing for latent class models. Multivariate Behavioral Research, 28, 375–89.
Diener, E., Emmons, R.A., Larsen, R.J., & Griffin, S. (1985). The Satisfaction with Life Scale. Journal of Personality Assessment, 49, 71–5.
Drasgow, F., Levine, M.V., Tsien, S., Williams, B., & Mead, A. (1995). Fitting polytomous item response theory models to multiple-choice tests. Applied Psychological Measurement, 19, 143–65.
D’Zurilla, T.J., Nezu, A.M., & Maydeu-Olivares, A. (2002). Manual of the social problem-solving inventory-Revised. North Tonawanda, NY: Multi-Health Systems.
Fraser, C., & McDonald, R.P. (1988). NOHARM: Least squares item factor analysis. Multivariate Behavioral Research, 23, 267–69.
Glas, C.A.W. (1988). The derivation of some tests for the Rasch model from the multinomial distribution. Psychometrika, 53, 525–46.
Glas, C.A.W. (1999). Modification indices for the 2-PL and the nominal response model. Psychometrika, 64, 273–94.
Glas, C.A.W., & Verhelst, N.D. (1989). Extensions of the partial credit model. Psychometrika, 54, 635–59.
Godambe, V.P. (Ed.) (1991). Estimating functions. Oxford: Oxford University Press.
Joe, H. (1997). Multivariate models and dependence concepts. London: Chapman & Hall.
Jöreskog, K.G. (1994). On the estimation of polychoric correlations and their asymptotic covariance matrix. Psychometrika, 59, 381–89.
Jöreskog, K.G., & Moustaki, I. (2001). Factor analysis of ordinal variables: A comparison of three approaches. Multivariate Behavioral Research, 36, 347–87.
Jöreskog, K.G., & Sörbom, D. (2001). LISREL 8. Chicago: Scientific Software.
Koehler, K., & Larntz, K. (1980). An empirical investigation of goodness-of-fit statistics for sparse multinomials. Journal of the American Statistical Association, 75, 336–44.
Kramp, U. (2006). Effects of the number of response options on personality rating scales. Unpublished doctoral dissertation. University of Barcelona.
Lee, S.Y., Poon, W.Y., & Bentler, P.M. (1995). A two-stage estimation of structural equation models with continuous and polytomous variables. British Journal of Mathematical and Statistical Psychology, 48, 339–58.
Lord, F.M., & Novick, M.R. (1968). Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
Masters, G.N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, 149–74.
Maydeu-Olivares, A. (2001a). Limited information estimation and testing of Thurstonian models for paired comparison data under multiple judgment sampling. Psychometrika, 66, 209–28.
Maydeu-Olivares, A. (2001b). Multidimensional item response theory modeling of binary data: Large sample properties of NOHARM estimates. Journal of Educational and Behavioral Statistics, 26, 49–9.
Maydeu-Olivares, A. (2005). Further empirical results on parametric vs.non-parametric IRT modeling of Likert-type personality data. Multivariate Behavioral Research, 40, 275–93.
Maydeu-Olivares, A. (2006). Limited information estimation and testing of discretized multivariate normal structural models. Psychometrika, 71, 57–7.
Maydeu-Olivares, A., & Joe, H. (2005). Limited and full information estimation and goodness-of-fit testing in 2n contingency tables: A unified framework. Journal of the American Statistical Association, 100, 1009–020.
Muthén, B. (1978). Contributions to factor analysis of dichotomous variables. Psychometrika, 43, 551–60.
Muthén, B. (1984). A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators. Psychometrika, 49, 115–32.
Muthén, B. (1993). Goodness of fit with categorical and other nonnormal variables. In K.A. Bollen, & J.S. Long (Eds.), Testing structural equation models (pp. 205–34). Newbury Park, CA: Sage.
Muthén, L., & Muthén, B. (2001). MPLUS. Los Angeles: Muthén & Muthén.
Rao, C.R. (1973). Linear statistical inference and its applications. New York: Wiley.
Reiser, M. (1996). Analysis of residuals for the multinomial item response model. Psychometrika, 61, 509–28.
Reiser, M., & Lin, Y. (1999). A goodness-of-fit test for the latent class model when expected frequencies are small. In M. Sobel, & M. Becker (Eds.), Sociological methodology 1999 (pp. 81–11). Boston: Blackwell.
Reiser, M., & VandenBerg, M. (1994). Validity of the chi-square test in dichotomous variable factor analysis when expected frequencies are small. British Journal of Mathematical and Statistical Psychology, 47, 85–07.
Samejima, F. (1969). Calibration of latent ability using a response pattern of graded scores. Psychometrika Monograph Supplement, No. 17.
Schott, J.R. (1997). Matrix analysis for statistics. New York: Wiley.
Teugels, J.L. (1990). Some representations of the multivariate Bernoulli and binomial distributions. Journal of Multivariate Analysis, 32, 256–68.
Thissen, D., & Steinberg, L. (1986). A taxonomy of item response models. Psychometrika, 51, 567–77.
Tollenaar, N., & Mooijaart, A. (2003). Type I errors and power of the parametric bootstrap goodness-of-fit test: Full and limited information. British Journal of Mathematical and Statistical Psychology, 56, 271–88.
van der Linden, W.J., & Hambleton, R.K. (Eds.) (1997). Handbook of modern item response theory. New York: Springer-Verlag.
Zhao, Y., & Joe, H. (2005). Composite likelihood estimation in multivariate data analysis. Canadian Journal of Statistics, 33, 335–56.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research has been supported by the Department of Universities, Research, and Information Society (DURSI) of the Catalan Government, by grant BSO2003-08507 of the Spanish Ministry of Science and Technology, and an NSERC Canada grant. We are grateful to the referees for comments leading to improvements.
Rights and permissions
About this article
Cite this article
Maydeu-Olivares, A., Joe, H. Limited Information Goodness-of-fit Testing in Multidimensional Contingency Tables. Psychometrika 71, 713–732 (2006). https://doi.org/10.1007/s11336-005-1295-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11336-005-1295-9