Conditional Independence and Dimensionality of Cognitive Diagnostic Models: a Test for Model Fit

Lim, Youn Seon; Drasgow, Fritz

doi:10.1007/s00357-018-9287-5

Conditional Independence and Dimensionality of Cognitive Diagnostic Models: a Test for Model Fit

Published: 29 March 2019

Volume 36, pages 295–305, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Classification Aims and scope Submit manuscript

Conditional Independence and Dimensionality of Cognitive Diagnostic Models: a Test for Model Fit

Download PDF

Youn Seon Lim¹ &
Fritz Drasgow²

356 Accesses
3 Citations
Explore all metrics

Abstract

Nonparametric cognitive diagnosis methods are useful in cognitive diagnosis modeling for calibration efficiency, especially when sample size is small or large, or the latent attributes are more complex. This article proposes the Mantel-Haenszel chi-squared statistic as an index for detecting the misspecification of latent attributes as well as testlet effects in nonparametric cognitive diagnosis methods. The proposed theoretical considerations are augmented by simulation studies conducted to assess the performance of the Mantel-Haenszel statistic under various conditions within the nonparametric diagnosis framework, with a special focus on situations were the set of latent abilities assumed to underlie the data was underspecified.

Insights from Reparameterized DINA and Beyond

Assessing the Dimensionality of the Latent Attribute Space in Cognitive Diagnosis Through Testing for Conditional Independence

Nonparametric Methods in Cognitively Diagnostic Assessment

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Cognitive diagnosis models are used to provide diagnostic feedback to examinees and stakeholders at a finer grain size than a single test score. Many different models have been proposed, but they all require a common feature, the Q-matrix, that indicates the item J by latent attribute K relationship (Tatsuoka 1983). Each entry q_jk in the matrix indicates whether the kth attribute is necessary in the solution of the jth item. An examinee’s performance with respect to what is measured is assumed to be influenced by a composite of the latent attributes such that different combinations define profiles of distinct proficiency classes, which are characterized by the K-dimensional latent attribute vectors α₁, α₂, …, α_C, with C = 2^K.

The validity of a cognitive diagnosis model depends on whether the K-dimensional latent attribute vector entirely determines classes of examinees so that the conditional distributions of item scores are all independent of each other after adjusting for the effect of the attributes. This property is often called local independence (e.g., Rupp et al. 2010; Lord and Novick 1968). The assumption of local independence is equivalent to the assumption that the K attributes α₁, α₂, …, α_K span the complete latent space—that is, no latent attributes have been missed or left out. Said differently, violations of local independence indicate the possible misspecification of attributes.

The testlet effect also calls into question the assumption of local independence. A testlet is a cluster of items that shares a common stimulus, such as a reading passage and measures something additional in common (Wainer and Kiely 1987). One way to account for the testlet effect is to incorporate specific dimensions in addition to the K-dimensional latent attributes the Q-matrix specifies. Therefore, testing for local independence can be used as a diagnostic tool for detecting testlet effects as well as incorrect specifications of the latent attributes in cognitive diagnostic modeling.

In cognitive diagnosis modeling, evaluations of model-data fit provide information about the cognitive diagnosis model and data fit as well as the Q-matrix and data fit (e.g., Chen et al. 2013). Various fit statistics and methods have been proposed for both types of evaluations. Some of them include conventional relative fit measures such as Akaike’s Information Criterion (AIC), Bayesian Information Criterion (BIC), log likelihood, and Bayes factor (e.g., Chen et al. 2013; Kunina-Habenicht et al. 2012; Rupp et al. 2010). Furthermore, absolute fit measures such as the residual between the observed and predicted Fisher-transformed correlation, the residual between the observed and predicted correct proportion, the residual between the observed and predicted log-odds ratios, and the G statistic have been proposed (e.g., Chen et al. 2013; Rupp et al. 2010). These statistics and methods are limited to use in parametric cognitive diagnosis models because most statistics are computed as a function of maximum likelihood estimates; the predicted item responses are generated based on the fitted model.

This article proposes the Mantel-Haenszel (MH) statistic as an index for detecting misspecification of latent attributes as well as testlet effects in nonparametric cognitive diagnosis methods. Obviously, under the nonparametric methods, evaluation of model-data fit is informative about the Q-matrix and data fit only. The MH statistic is a well-researched tool for evaluating the conditional independence of binary variables that are stratified along the levels of a third random variable, for example, examining conditional independence of item pairs that are stratified along the levels of total test scores under an IRT model (Rosenbaum 1984).

The next section describes the assumption of conditional independence underlying cognitive diagnosis models and provides a brief review of nonparametric cognitive diagnosis methods. Then, the MH test of model fit is presented. Next, simulation studies are described with a wide range of conditions. Then, an analysis of real data is described. In the final section, applications and implications of the method are discussed.

1 Conditional Independence and Its Violations

Let Y_ij denote the binary item response of the ith examinee to the jth item, i = 1, ..., I, j = 1, ..., J. Cognitive diagnosis models describe the joint distribution of item response vector Y_i conditional on binary attribute vector α_ic = {α_ick}, for c = 1, 2, ..., 2^K, and for k = 1, ..., K. Each entry α_ick indicates whether the ith examinee has mastered the kth attribute. Each binary entry q_jk in the Q-matrix indicates whether the kth attribute is relevant for the jth item, with 1 meaning the attribute is relevant and 0 indicating it is irrelevant. The joint probability of a cognitive diagnosis model for the ith examinee is

$$ P\ \left({\boldsymbol{Y}}_i\ \right|\ {\boldsymbol{\alpha}}_i\left)=\prod \limits_{j=1}^JP\left({Y}_{ij}\ \right|{\boldsymbol{\alpha}}_i\right). $$

(1)

Therefore, most models are required to satisfy the assumption of conditional independence among item responses Y_ij given the attribute vector α_ic (e.g., Rupp et al. 2010) because the assumption makes it possible to assess the joint probability or likelihood of the models.

The assumption of conditional independence is violated when the dimensionality of the Q-matrix is incorrectly specified. More specifically, a necessary attribute may be omitted. The assumption may be also a concern when the response to an item is based on the responses to the previous items, or when items are grouped by sharing a common stimulus such as a reading passage or a common scenario. Such a grouping of items is referred to the testlet effect, and an additional dimension may be required to adequately model the data, but would be considered as a nuisance dimension because it is not substantively meaningful (e.g., Wainer and Kiely 1987). Most cognitive diagnosis models ignore the testlet effect, and it may result in underspecified dimensions. Therefore, the existence of testlets calls into question the assumption of local independence. A misfit item j may indicate that the item is problematic or q_j is underspecified; a few misfit items indicate the dimensions of Q-matrix may be underspecified; misfit items sharing a common stimulus may indicate a testlet effect.

2 Nonparametric Cognitive Diagnosis Methods

Nonparametric cognitive diagnostic methods assess examinees’ mastery and nonmastery of attributes without regard to parametric form. These methods are useful in cognitive diagnosis modeling, especially when parametric model fitting is inefficient because of too small or large sample sizes, or more complex sets of latent attributes (Junker 2011).

One approach to nonparametric cognitive diagnosis methods is to apply cluster analysis to identify groups of examinees with similar pattern of latent attributes given the assumption of a conjunctive relationship among attributes and a valid Q-matrix. Chiu et al. (2009) clustered the sum score vectors W_i = (W_i1, …, W_iK) using hierarchical agglomerative and K-means clustering to produce the 2^K latent classes. Ayers et al. (2008) utilized the capability score vectors B_i = (B_i1, …, B_iK), where $ {B}_{ik}={\sum}_j{Y}_{ij}{q}_{jk}/{\sum}_j{q}_{jk} $ instead of W_i. Park and Lee (2011) mapped item responses to an attribute matrix and then conducted K-means and hierarchical agglomerative clustering.

Another approach utilizes the Hamming distance technique that was originally proposed by Barnes (2003) with a valid Q-matrix. In this technique, examinees’ latent attribute vectors are obtained by minimizing the Hamming distance between the observed item responses Y_i and all possible ideal responses η₁, η₂, ⋯, η_C, C = 2^K,

$$ D\left({\boldsymbol{Y}}_i,{\boldsymbol{\alpha}}_c\right)=\sum \limits_{j=1}^J\mid {Y}_{ij}-{\eta}_{cj}\mid . $$

(2)

Like Barnes (2003), Chiu and Douglas (2013) posited a conjunctive relationship among the attributes. Lim and Drasgow (2017) proposed an algorithm given the assumption of conjunctive, disjunctive, or compensatory relationships among attributes. The theoretical justification of this approach is that the true attribute pattern minimizes the expected distance between Y_i and η_c regardless of what the true model is, under some regularity conditions (Lim and Drasgow 2017; Wang and Douglas 2015).

3 Mantel and Haenszel Test of Model Fit

The MH statistic χ² introduced by Mantel and Haenszel (1959) is generally used to test for conditional independence—of two dichotomous or categorical variables j and j^′ by forming the row-by-column contingency tables, conditional on the levels of the control variable C. For IRT models, the MH statistic has been commonly used to detect differential item functioning, items that function differently for two groups of examinees called focal and reference groups with different experiences or backgrounds (Holland and Thayer 1988). In the procedure, the sample is stratified into C classes according to their observed total test score.

In this study, the latent attribute vector α_c = (α_c1, α_c2, ⋯, α_cK)^′, for c = 1, 2, …2^K = C is proposed as the stratification variable. As discussed above, in cognitive diagnosis models, item responses are assumed to be independent given the correct α_c, and a higher value of α_c implies a higher probability that Y_j = 1 for each j = 1, 2, …, J (e.g., Holland and Rosenbaum 1985). Then, any pair of vectors of monotonic nondecreasing functions g_j(Y) and g_j′(Y) of a vector of dichotomous responses Y to item j and j^′, given any monotonic nondecreasing function h(α_c ), has a nonnegative conditional covariance, a result of Rosenbaum (1984).

Let $ \left\{{i}_{jj{\prime}_c}\right\} $ denote the frequencies of examinees in the 2 × 2 × C contingency table. The marginal frequencies are the row totals $ \left\{{i}_{1{+}_c}\right\} $ and the column totals {$ {i}_{+1{\prime}_c}\Big\} $, and $ {i}_{++_c} $ represents the total sample size in the cth stratum. Strata having a minimum total sample size $ {i}_{++_c} $ equal or larger than 1 are included. If any cell count in a table is 0, then the Haldane correction is applied to each cell in the table to obtain a more accurate significance level of the MH test (e.g., Li et al. 1979). Under the null hypothesis of conditional independence between j and j^′, the following statistic is proposed:

$$ \mathrm{MH}\ {\chi}^2=\frac{{\left(|{\sum}_c{i}_{11c}-{\sum}_cE\left({i}_{11c}\right)|-1/2\right)}^2}{\sum_c\operatorname{var}\left({i}_{11c}\right)}, $$

(3)

where E(i_11c) = i_1 + ci_+1c/i_+ + c and $ \operatorname{var}\left({i}_{11c}\right)={i}_{0+c}\ {i}_{1+c}\ {i}_{+0c}\ {i}_{+1c}/{i}_{++c}^2\ \left({i}_{++c}-1\right). $

Under the null hypothesis, the test statistic has approximately a chi-squared distribution with degrees of freedom equal to 1 when sample sizes in each contingency table become large, and in cognitive diagnosis models, if each examinee’s true latent attribute vector α_i is known. Mantel and Haenszel (1959) indicate that this summary chi-square reference distribution is suitable even when some of the strata have small counts. This statistic would be suitable for the analysis of sparse contingency tables, provided the overall counts for each cell in the combined table obtained by collapsing across all C contingency tables are sufficiently large. The null hypothesis of independence is equivalent to the odds ratio equal to 1.

$$ {\mathrm{Odds}\ \mathrm{ratio}}_{MHj,j\prime }=\frac{\sum_{c=1}^C\left({i}_{11_c}\ {i}_{00_c}\right)/{i}_c}{\sum_{c=1}^C\left({i}_{10_c}{i}_{01_c}\right)/{i}_c}, $$

(4)

where $ {i}_c={i}_{11_c}+{i}_{00_c}+{i}_{10_c}+{i}_{01_c}. $

4 Heuristic Justification of the Large Sample Chi-square Reference Distribution

The estimated test statistic $ \mathrm{MH}\ {\widehat{\chi}}^2 $ would have an asymptotic chi-square distribution with one degree of freedom as the true MH statistic MH χ² would, if the true attribute vector α were known. Mantel and Haenszel (1959) asserted that under the null hypothesis, the MH χ² has an asymptotic chi-squared distribution with one degree of freedom, under some general conditions.

It is assumed here that the number of items J is sufficiently large so that $ P\left[\widehat{\boldsymbol{\alpha}}=\boldsymbol{\alpha} \right] $ is close to 1, a result of previous theoretical studies (Lim and Drasgow 2017; Wang and Douglas 2015). A rigorous argument requires that the number of items J grows sufficiently fast with the sample size N. Note that

$$ \mathrm{MH}\ {\widehat{\chi}}^2=\mathrm{MH}\ {\chi}^2+\left(\mathrm{MH}\ {\widehat{\chi}}^2-\mathrm{MH}\ {\chi}^2\right), $$

(5)

where $ \left( MH{\widehat{\chi}}^2-{MH\chi}^2\right) $ represents error, due to using $ \widehat{\boldsymbol{\alpha}} $ rather than α. If in (5), we have convergence in probability to zero in the second term on the right, we see that our approximate M-H test statistic $ MH{\widehat{\chi}}^2 $ has the same asymptotic distribution as the desired MH statistic MHχ². Specifically,

$$ \left( MH{\widehat{\chi}}^2-{MH\chi}^2\right)\overset{P}{\to }0\Longrightarrow MH{\widehat{\chi}}^2\overset{D}{\to }{\chi}_1^2. $$

(6)

The result in (6) is obtained if J is sufficiently large, so that under the null hypothesis the overwhelming majority of estimated attribute patterns are identical to the true attribute patterns. Finite test length and sample size properties are studied in the following simulation studies, and type I error rate power rates are summarized.

5 Simulation Study

To investigate the performance of the MH statistic, a variety of simulation conditions were studied by crossing the number of examinees, the length of tests, the number of attributes, and the distribution of α under the nonparametric cognitive diagnosis model.

6 Simulation Design

For each condition, item response data of sample sizes I = 500, or 2000, were drawn from a discretized multivariate normal distribution MVN(0_K, Σ), where the covariance matrix Σ has unit variance and common correlation ρ = .3, or .6 (e.g., Chiu et al. 2009). The K-dimensional continuous vector θ_i = (θ_i1, θ_i2, ⋯, θ_iK)^′ were dichotomized by

$$ {\boldsymbol{a}}_{ik}=\left\{\begin{array}{cc}1,& \mathrm{if}{\theta}_{ik}\ge {\Phi}^{-1}\left(\frac{k}{\left(K+1\right)}\right);\\ {}0,& \mathrm{otherwise}\end{array}\right. $$

(7)

Test lengths J = 20 or 40 were studied with attribute vectors of length K = 3 or 5. The correctly specified Q-matrix for J = 20 is presented in Table 1. The Q-matrix for J = 40 was obtained by duplicating the matrix. Item response data sets were generated from the DINA model and its item parameters were drawn from the uniform (0, .3) distribution. The Hamming distance–based nonparametric cognitive diagnosis model (Lim and Drasgow 2017) was used for the estimation of latent attributes. The main advantage with this proposed method is that it can be applied to parametric models because only class information is necessary.

Table 1 Correctly specified Q-matrix (K = 5)

Full size table

6.1 Results

For each condition, sets of item response vectors were simulated for 100 replications. The proposed MH statistics and their corresponding p values were computed for all J × (J − 1)/2 item pairs in an individual replication. Of 100 trials, the proportion of times the p value of each item pair was smaller than the significance level .05, which was recorded and summarized in the tables.

Type I Error Study

In this simulation study, the correctly specified Q-matrices (K = 5, or K = 3) were used to fit the data to examine type I error rates. Table 2 shows that most type I error rates were around the nominal significance level .05. The MH statistic appears consistent under all conditions when J = 40, confirming asymptotic consistency. In the condition K = 5, J = 20, and I = 2000, the type I error rate was slightly increased.

Table 2 Type I error study: correctly specified Q-matrix

Full size table

The MH statistic with known true class membership α was also examined because it is not confounded by possible estimation errors due to the specific algorithm used to estimate latent attributes. The rejection rates were very close to the nominal significance level .05 for all conditions.

Power Study with Underspecified Q-matrices

A data set was generated with the Q-matrix (K = 5) in Table 1. The data was fitted with the embedded Q-matrix (K = 3) in each replication (Table 3). One dimension (a total of 9 items) or two dimensions (4 items) were underspecified. The average power rate of the item pairs where both items were underspecified in the same dimension was .572 with power relatively consistent across all conditions. The average rejection rate across item pairs where either item was underspecified was .124. Taking this finding into account, like the other statistics, the MH test is sensitive to Q-underspecification and has moderately high power, particularly for the larger sample size.

Table 3 Power study: underspecified Q-matrices with true K = 5 and fitted K = 3

Full size table

Power Study with Testlet-Dependent Data

For this simulation study, the fixed T-matrix in Table 4 was utilized to generate the testlet-dependent data. The entry t_mj of the T-matrix indicates whether the mth testlet, for m = 1, 2, ... M, includes the jth item. For each replication, the transpose of T-matrix was combined with the Q-matrix (K = 3) embedded in Table 1, to simulate item responses. A model was fitted only with the Q-matrix (K = 3). The T-matrix for J = 40 was obtained by duplicating the matrix.

Table 4 T-matrix: testlet specification (M = 2)

Full size table

As shown in Table 5, high rejection rates for testlet-dependent item pairs were obtained (i.e., .922 or above). The power rates were moderately consistent across conditions. The rejection rates of the MH statistic for item pairs in which only either item was testlet dependent were low (i.e., .088 or below). This implies that the MH test can play an important role only in detecting testlet-dependent items.

Table 5 Testlet-dependent data with Q-matrix (K = 3)

Full size table

7 Fraction Subtraction Data

Fraction subtraction data (e.g., Tatsuoka 1983) were analyzed to investigate the performance of the MH statistic in practice. The data include the item responses to 20 items with 8 necessary attributes from 536 examinees. In this study, the Q-matrix (see Table 6) that appeared originally in de la Torre and Douglas (2004) was used. The specified attributes are interpreted as (1) convert a whole number to a fraction, (2) separate a whole number from fraction, (3) simplify before subtracting, (4) find a common denominator, (5) borrow from whole number part, (6) column borrow to subtract the second numerator from the first, (7) subtract numerators, and (8) reduce answers to simplest form.

Table 6 Q-matrix for fraction subtraction data

Full size table

7.1 Results

The data were analyzed with seven different cognitive diagnosis models: the nonparametric model, the DINA model, the DINO model, the A-CDM, the saturated model, the log linear model, and the R-RUM. Additional fit statistics, the chi-squared statistic x_jj′ (Chen and Thissen 1997) and absolute deviations of observed and predicted corrections r_jj′ (Chen et al. 2013), were used for the evaluation of model-data fit. The average rejection rates of 190 item pairs are summarized and reported in Table 7. Interestingly, the MH statistic indicates substantially fewer model violations than the other two fit measures.

Table 7 Proportion of conditionally dependent item pairs

Full size table

Table 8 reports the most frequently rejected four items for each of the statistics over all model settings. The results of statistics were consistent with those of Lim and Drasgow (2017). In their data-driven Q-matrix estimation study, the component-wise agreement rates between the implemented Q-matrix in this study and a data-driven Q-matrix were obtained as shown in Table 8. The items for which the q-vectors may have been incorrectly specified were the most frequently rejected by the MH statistic. The disagreement across methods is especially noticeable for item 8. This result may imply that this item was overspecified based on the results of previous studies (e.g., Chen et al. 2013).

Table 8 Most frequently rejected items

Full size table

8 Discussion

The significance of this study lies in proposing a test of model fit for detecting Q-matrix misspecifications and identifying testlet effects. The only requirement for this method is the availability of an estimate of the latent attributes, which serves as the stratification variable in the MH statistic. Several simulation studies investigated the usefulness and sensitivity of the MH statistic in a variety of conditions. The primary findings were that the MH test could play an important role in identifying underspecified q-vectors when the true model is unknown. It performs reasonably well in detecting testlet-dependent items. These results are important because ignoring such dependencies could possibly lead to inaccurate estimates of model parameters as shown in Table 9 as well as misclassifications of examinees (e.g., Chen et al. 2015; Rupp et al. 2010).

Table 9 Mean of absolute difference of estimated and true DINA item parameters (α with ρ = .3)

Full size table

The real data analysis illustrated how the MH test can be used with different cognitive diagnosis models along with other model fit test statistics. The MH test found less misfit and was less sensitive to the use of different models. For q-vector misspecifications, it can be effective to identify problematic items. When it is used with the other test statistics, the results can provide more detail—whether an item may be underspecified, or a different model is needed for the data.

Whether the fit evaluation is to detect the Q-matrix underspecification, or testlet effects, the MH test is simple, easy to implement, and theoretically supported. The results of the simulations suggest that the MH is a reasonably efficient test of model fit. Nevertheless, some consideration of other tests of model fit will always be desirable. Future research might include more attributes as well as more complex models. At the present time, however, the MH test appears to be a promising statistic for the detection of local dependence in cognitive diagnosis models.

References

Ayers, E., Nugent, R., & Dean, N. (2008). Skill set profile clustering based on student capability vectors computed from online tutoring data. In R. S. J. de Baker, T. Barnes, & J. E. Beck (Eds.), Educational data mining 2008: proceedings of the 1st International Conference on Educational Data Mining (pp. 210–217). Montréal: International Data Mining Society.
Google Scholar
Barnes, T. (2003). The Q-matrix method of fault-tolerant teaching in knowledge assessment and data mining. North Carolina State University, USA. http://www.lib.ncsu.edu/resolver/1840.16/4612. Accessed March 27, 2011
Chen, W., & Thissen, D. (1997). Local dependence indexes for item pairs using item response theory. Journal of Educational and Behavioral Statistics, 22, 265–289.
Article Google Scholar
Chen, J., de la Torre, J., & Zhang, Z. (2013). Relative and absolute fit evaluation in cognitive diagnosis modeling. Journal of Educational Measurement, 50, 123–140.
Article Google Scholar
Chen, Y., Liu, J., Xu, G., & Ying, Z. (2015). Statistical analysis of Q-matrix based diagnostic classification models. Journal of the American Statistical Association, 110(510), 850–866.
Article MathSciNet Google Scholar
Chiu, C. Y., & Douglas, J. (2013). A nonparametric approach to cognitive diagnosis by proximity to ideal response patterns. Journal of Classification, 30(2), 225–250.
Article MathSciNet Google Scholar
Chiu, C., Douglas, J., & Li, X. (2009). Cluster analysis for cognitive diagnosis: theory and applications. Psychometrika, 74, 633–655.
Article MathSciNet Google Scholar
de la Torre, J., & Douglas, J. (2004). Higher-order latent trait models for cognitive diagnosis. Psychometrika, 69, 333–353.
Article MathSciNet Google Scholar
Holland, W. P., & Rosenbaum, P. R. (1985). Conditional association and unidimensionality in monotone latent variable models (Research Report No. 85 – 47). Princeton: Educational Testing Service.
Google Scholar
Holland, W. P., & Thayer, D. T. (1988). Differential item performance and the Mantel Haenszel procedure. In H. Wainer & H. I. Braun (Eds.), Test validity (pp. 129–145). Hillsdale: LEA.
Google Scholar
Junker, B. W. (2011). The role of nonparametric analysis in assessment modeling: then and now. In N. J. Dorans & S. Sinharay (Eds.), Looking back: proceedings of a conference in honor of Paul W. Holland (pp. 67–85). New York: Springer-Verlag.
Chapter Google Scholar
Kunina-Habenicht, O., Rupp, A. A., & Wilhelm, O. (2012). The impact of model misspecification on parameter estimation and item-fit assessment in log-linear diagnostic classification models. Journal of Educational Measurement, 49, 59–81.
Article Google Scholar
Li, S., Simon, R. M., & Gart, J. J. (1979). Small sample properties of the Mantel-Haenszel test. Biometrika, 66, 181–183.
Article Google Scholar
Lim, Y. S. & Drasgow, F. (2017 – accepeted). Nonparametric calibration of item-by-attribute matrix. Multivariate Behavioral Research.
Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Reading: Addison-Wesley.
MATH Google Scholar
Mantel, N., & Haenszel, W. (1959). Statistical aspects of the analysis of data from retrospective studies of disease. Journal of National Cancer Institute, 22, 719–748.
Google Scholar
Park, Y., & Lee, Y. (2011). Diagnostic cluster analysis of mathematics skills. IERI Monograph Series: Issues and Methodologies in Large-Scale Assessments, 4, 75–107.
Google Scholar
Rosenbaum, P. R. (1984). Testing the conditional independence and monotonicity assumption of item response theory. Psychometrika, 49, 425–436.
Article MathSciNet Google Scholar
Rupp, A., Templin, J., & Henson, R. (2010). Diagnostic assessment: theory, methods, and applications. New York: Guilford.
Google Scholar
Tatsuoka, K. (1983). Rule space: an approach for dealing with misconceptions based on item response theory. Journal of Educational Measurement, 20, 345–354.
Article Google Scholar
Wainer, H., & Kiely, G. (1987). Item clusters and computerized adaptive testing: a case study for testlets. Journal of Educational Measurement, 24, 195–201.
Article Google Scholar
Wang, S., & Douglas, J. (2015). Consistency of nonparametric classification in cognitive diagnosis. Psychometrika, 80, 85–100.
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Science Education, Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Hofstra University, Hempstead, NY, 11549, USA
Youn Seon Lim
School of Labor & Employment Relations, Department of Psychology, University of Illinois at Urbana-Champaign, 603 E. Daniel St., Champaign, IL, 61820, USA
Fritz Drasgow

Authors

Youn Seon Lim
View author publications
You can also search for this author in PubMed Google Scholar
Fritz Drasgow
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Youn Seon Lim.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lim, Y.S., Drasgow, F. Conditional Independence and Dimensionality of Cognitive Diagnostic Models: a Test for Model Fit. J Classif 36, 295–305 (2019). https://doi.org/10.1007/s00357-018-9287-5

Download citation

Published: 29 March 2019
Issue Date: July 2019
DOI: https://doi.org/10.1007/s00357-018-9287-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Conditional Independence and Dimensionality of Cognitive Diagnostic Models: a Test for Model Fit

Abstract

Similar content being viewed by others

Insights from Reparameterized DINA and Beyond

Assessing the Dimensionality of the Latent Attribute Space in Cognitive Diagnosis Through Testing for Conditional Independence

Nonparametric Methods in Cognitively Diagnostic Assessment

1 Conditional Independence and Its Violations

2 Nonparametric Cognitive Diagnosis Methods

3 Mantel and Haenszel Test of Model Fit

4 Heuristic Justification of the Large Sample Chi-square Reference Distribution

5 Simulation Study

6 Simulation Design

6.1 Results

Type I Error Study

Power Study with Underspecified Q-matrices

Power Study with Testlet-Dependent Data

7 Fraction Subtraction Data

7.1 Results

8 Discussion

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Conditional Independence and Dimensionality of Cognitive Diagnostic Models: a Test for Model Fit

Abstract

Similar content being viewed by others

Insights from Reparameterized DINA and Beyond

Assessing the Dimensionality of the Latent Attribute Space in Cognitive Diagnosis Through Testing for Conditional Independence

Nonparametric Methods in Cognitively Diagnostic Assessment

1 Conditional Independence and Its Violations

2 Nonparametric Cognitive Diagnosis Methods

3 Mantel and Haenszel Test of Model Fit

4 Heuristic Justification of the Large Sample Chi-square Reference Distribution

5 Simulation Study

6 Simulation Design

6.1 Results

Type I Error Study

Power Study with Underspecified Q-matrices

Power Study with Testlet-Dependent Data

7 Fraction Subtraction Data

7.1 Results

8 Discussion

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation