1 Introduction

Though the origin of unfolding analysis for attitude measurement is often credited to Coombs (1950), who coined the term, its history could be traced back to Thurstone’s (1927) works on attitude measurement. Despite such a long history, unfolding models have still been rarely used in applied studies until recently. One of the possible reasons is that the underlying response mechanism is slightly less intuitive than the cumulative one, which simply assumes that the higher the attitude level, the higher the response. The unfolding model of response assumes that an item rating will be higher around some ideal point on a latent attitudinal continuum, thus resulting in a single-peaked response function. The farther a person is from this ideal location, in either direction, the lower the item rating.

This response mechanism has been shown to be relevant in at least two main areas, evolutionary processes and conflictualized attitudes. Examples in the first field include: The development of moral judgements (Davison, Robins, & Swanson, 1978), of intellectual processes (DeMars & Erwin, 2003), of educational goals among students (Volet & Chalmers, 1992), and of processes of change in smoking cessation (Noel, 1999a, 1999b). Examples in the second area include: The structure of political opinions (van Schuur 1993), attitude toward capital punishment (Andrich & Luo 1993), abortion (Roberts & Laughlin 1996), and environment (Andrich & Styles 1998).

Though early developments in unfolding analysis already provided models for continuous data in the multidimensional scaling tradition (e.g., Ramsay 1980; Greenacre & Browne 1986), no probabilistic model exists (as far as we know) to unfold continuous bounded responses (CBR) in an item response theory framework. This paper aims at providing a model for CBR by which continuous responses with both a lower and an upper bound are designated. Such data may be collected by asking subjects to rate items by putting a mark at some point on a horizontal line segment with ends labeled, for instance, “0 % agreement” and “100 % agreement.” The response is then measured as the distance from the left end of the segment to the subject’s mark. But responses could also be proportions of time spent in one out of two tasks or, more generally, any kind of proportion data for which approximate continuity is arguable.

Modeling the distribution of CBR has many advantages over modeling categorical data: Their graphical representation is often more suggestive, the number of parameters to be estimated is lower in general, and the quality of estimation is better with smaller sample sizes and shorter tests (Noel & Dauvier 2007; Wang & Zeng 1998). Moreover, CBR are now very easy to collect through sliders on a computer interface, and it has been reported that subjects find continuous scales more pleasing to use (McKelvie 1978).

To model such data, one could think of dealing with the bounded nature of the response by using some ad hoc transformation (e.g., logit) that would change the response into an unbounded variable and that could then be modeled with some random-effects model (see, for instance, Samejima, 1973, 1974). By contrast, the approach adopted in the present paper is to directly model the response mechanism by which a bounded response is generated and to deduce a distributional model from this mechanism. This approach leads to a class of potentially more flexible models.

The paper is organized as follows: The interpolation response mechanism proposed by Noel and Dauvier (2007) to analyze CBR in a cumulative framework is first presented. The resulting Beta Response Model is then generalized to a Dirichlet Response Model, to deal with the situation where, besides item acceptability, two concurrent sources of refusal are potentially present in subjects’ responses. An unfolding response model is derived by pooling the two sources of refusal in the expected response. An EM algorithm for estimating the model parameters is described, and a simulation study is conducted to test the quality of parameter recovery. An application to real data (attitudes toward abortion) is finally presented.

2 An Interpolation Response Mechanism

The situation where a person rates an item in a questionnaire on a continuous bounded response scale is now considered. Define the response variable X, measured as a distance from the left boundary and then arbitrarily rescaled to lie in [0;1].

By contrast with Likert response scales, the continuous response format provides no semantic reference points except at the boundaries of the response segment. It is assumed that, in this situation, persons implicitly grant relevance values (or proximity judgments) to both extreme responses. Denote by v (D) and v (A) those psychological values, granted to the left and right ends, respectively (or Disagree and Agree extremities). By the interpolation response mechanism (Noel & Dauvier, 2007), the agreement response variable X on the unit-length segment is then assumed to be of the form

$$ X=\frac{v^{(A)}}{v^{(D)}+v^{(A)}}. $$
(1)

This may be viewed as an interpolation process: Respondents are expected to put a check at a distance of the left boundary that is proportional to the relative value they give to the extreme positive answer. This simple mechanism closely resembles commonly used models in choice theory (Luce 1959) or reinforcement learning (Herrnstein 1961).

The random part of the model is introduced by defining a density for v (D) and v (A). Because those implicit values are thought of as nonnegative quantities, it is assumed for convenience that

$$\begin{aligned} \begin{aligned}[c] v^{(A)} & \sim\varGamma(m,s), \\ v^{(D)} & \sim\varGamma(n,s). \end{aligned} \end{aligned}$$
(2)

Latent values are assumed to follow a Gamma distribution with a common scale parameter. This assumption of a common scale parameter is reasonable given that respondents have to take both reference points simultaneously into account to construct a single response. From these assumptions, assuming conditional independence, it is known that (Kotz & Johnson 1982, p. 229)

$$ X=\frac{v^{(A)}}{v^{(D)}+v^{(A)}}\sim\beta(m,n). $$
(3)

The beta density, with two parameters m and n, is defined as

$$ f(x;m,n)=\frac{\varGamma(m+n)}{\varGamma(m)\varGamma (n)}x^{m-1}(1-x)^{n-1}\quad\text{for}\ x \in[0;1],\ m,n>0, $$
(4)

with first moments

$$\begin{aligned} E(X;m,n) = & \mu=\frac{m}{m+n}, \end{aligned}$$
(5)
$$\begin{aligned} V(X;m,n) = & \frac{mn}{(m+n)^{2}(m+n+1)}=\mu(1-\mu) \biggl[\frac {1}{m+n+1} \biggr]. \end{aligned}$$
(6)

As the beta density is defined on [0;1], it is well suited to the modeling of CBR, which may be easily rescaled to lie in [0;1]. Expressions (5) and (6) show that it is relatively straightforward to define both mean and dispersion models within the beta, and this has already been derived in various flavors in the regression (Ferrari & Cribari-Neto 2004; Smithson & Verkuilen 2006; Verkuilen & Smithson 2012) or IRT (Noel & Dauvier 2007; Tamhane, Ankenman, & Yang, 2002) fields. An example of this ratio construction is illustrated on Figure 1. In this example, implicit values for both extreme responses (disagreement and agreement) are assumed to be drawn from a Γ(2,2) (expected value 1) and a Γ(8,2) (expected value 4), respectively. It is thus expected that more weight be put on agreement than on disagreement in this situation. From (3), the observed response will be generated from a β(8,2). From (5), the expected response is simply calculated as 8/(8+2)=0.8, and high ratings are expected on average.

Figure 1.
figure 1

Construction of an interpolated response. If the latent values for total disagreement and agreement are drawn from a Γ(2,2) (expected value 1) and a Γ(8,2) (expected value 4), respectively, the response will be generated from a β(8,2) (expected response 8/(8+2)=0.8).

Depending on the values of m and n, the beta may show very different shapes, as illustrated on Figure 2. For m=n=1, the density is uniform. Strong agreement becomes more likely as m is large relatively to n, and conversely. This legitimates an interpretation of m and n as “acceptance” and “refusal” parameters, respectively. For some parameter values (m<1 and n<1), it may also show a bimodal shape, which will be argued below to be especially relevant to model ambivalent attitudes.

Figure 2.
figure 2

The beta density for some parameter values.

Note that the choice of the Gamma for the value distribution is not the only one possible: Any density function defined on \(\mathbb{R}^{+}\) could be used in a similar construction. For instance, the ratio of lognormal variates is known to follow a logistic-normal (Aitchison & Shen, 1980). But the Gamma has several advantages. First, it can have a mode at zero, which does not occur with other alternative distributions (lognormal, Weibull). Second, the sum of independent Gamma is still a Gamma, and this class-invariance property will be especially useful in the unfolding construction below, where two sources of refusal will be added. Alternatives like the lognormal, Weibull, or Generalized Exponential (Gupta & Kundu 1999) do not enjoy this property. Third, the ratio construction above leads to a well-known distribution (the beta), with closed-form expressions both for the density and its first moments. This will help derive simple expressions for the expected rating curves, an important feature in the context of item response modeling. Fourth, the distribution of a ratio of Gamma variates can be bimodal, and bimodality will play a special role in the model to be presented below.

3 The Beta Unfolding Model

3.1 A generic Dirichlet Response Model

We consider the usual psychometric situation where N persons give ratings x ij on a set of J items (i=1,…,N, j=1,…,J). Following Andrich and Luo (1993) and Verhelst and Verstralen (1993), an unfolding response process is generated each time a person is likely to refuse an item for two opposite reasons (“I cannot wholeheartedly support either side of the abortion debate”). In this context, the response is assumed to be determined by an implicit assessment of three possible extreme attitudes: Fully Agree (A), Fully Disagree for the first reason (D 1), and Fully Disagree for the second (opposite) reason (D 2). These are what Andrich and Luo (1993) named the “Agree”, “Disagree from below”, and “Agree from above” attitudes, respectively. Assume that for each item j, subject i implicitly assesses these three potential attitudes and grants them latent affinity values \(v_{ij}^{(A)}\), \(v_{ij}^{(D_{1})}\), and \(v_{ij}^{(D_{2})}\).

In the line of the preceding section, the following conditional, subject-specific, gamma random model is considered:

$$\begin{aligned} \begin{aligned}[c] v_{ij}^{(A)} & \sim\varGamma(m_{ij},s), \\ v_{ij}^{(D_{1})} & \sim\varGamma(p_{ij},s), \\ v_{ij}^{(D_{2})} & \sim\varGamma(q_{ij},s). \end{aligned} \end{aligned}$$
(7)

As the scale parameter s will not play any role in the following, it is indifferent whether it is considered subject (or item) specific or not.

If made explicit in the response format (for instance, by instructing subjects to mark a dot in a unit-side ternary plot), these implicit assessments would lead subject i answering item j to give three observed responses, say x ij , y ij , and z ij (or 1−x ij y ij ). In this situation, a straightforward generalization of the binary interpolation response mechanism presented above assumes that subjects would pick a position in the response triangle that is at the weighted average of the triangle vertices, using latent values as weights. On a unit-side ternary plot, this defines a random response vector of the form

$$(X_{ij},Y_{ij},Z_{ij})'= \biggl( \frac {v_{ij}^{(A)}}{v_{ij}^{(A)}+v_{ij}^{(D_{1})}+v_{ij}^{(D_{2})}},\frac {v_{ij}^{(D_{1})}}{v_{ij}^{(A)}+v_{ij}^{(D_{1})}+v_{ij}^{(D_{2})}},\frac {v_{ij}^{(D_{2})}}{v_{ij}^{(A)}+v_{ij}^{(D_{1})}+v_{ij}^{(D_{2})}} \biggr)'. $$

This is illustrated in Figure 3 for three contrasted attitudes. From the distributional assumptions in (7), a well-known result states that the three-dimensional response vector will be Dirichlet-distributed:

$$ (X_{ij},Y_{ij},Z_{ij})\sim\operatorname{Dir}(m_{ij},p_{ij},q_{ij}). $$
(8)
Figure 3.
figure 3

Ternary plot of an unfolding response mechanism.

In the spirit of item response theory, a connection with a latent dimension and with person attitude (θ i ) and item location (δ j ) parameters is obtained by constructing the following structural model:

$$ \begin{cases} m_{j} = \exp\lambda_{j},\\ p_{ij} = \exp[\theta_{i}-\delta_{j} ],\\ q_{ij} = \exp\bigl[-(\theta_{i}-\delta_{j})\bigr], \end{cases} $$
(9)

where λ j is an item-specific acceptability parameter. The exponential in these expressions simply ensures that m j , p ij , and q ij parameters take on strictly positive values. The refusal parameters p ij and q ij are defined as increasing (p ij ) and decreasing (q ij ) functions of person-item distance on some latent continuum, modeling the interplay between two sources of refusal operating in opposite directions. In the line of Andrich and Luo (1993), the assumed response mechanism is essentially a double-refusal process. This takes into account that unfolding items are usually unlikely to be positively endorsed for themselves, but rather because refusability is low, for any one of two opposite reasons. But although acceptability is here modeled as a constant, item-specific characteristic, this will be shown below to lead to a subject-varying acceptance function, through the corresponding expectation function.

An illustrative plot of three-dimensional responses sampled from the Dirichlet Response Model is provided in Figure 4, for three items located at δ 1=+1, δ 2=0, and δ 3=−2, drawing latent attitudes θ i from a standard Gaussian. The contradictory nature of the two sources of refusal, as implied by this model, is apparent on the middle panel of Figure 4: When there is disagreement with the item, it is either on the “Disagree from below” or on the “Disagree from above” sides, but not on both (hence, the white area between bottom-left and top-right vertices).

Figure 4.
figure 4

Ternary density plots of sampled responses from the Dirichlet Response Model. Middle panel: When there is disagreement with the item, it is either on the “Disagree from below” or “Disagree from above” sides, but not both, hence the whiter area between both disagreement vertices.

3.2 An Unfolding Model

In the item rating context, where a simple Disagree–Agree response scale is provided, effects of the two sources of refusal are actually collapsed into a single observed response. From the properties of the Gamma, under the hypothesis of conditional independence across value assessments, the Dirichlet enjoys an interesting aggregation property: The response vector remains Dirichlet distributed upon aggregating some of the response components, the corresponding Dirichlet parameters simply being summed up.

Define the total disagreement value \(v_{ij}^{(D)}\) as

$$v_{ij}^{(D)}=v_{ij}^{(D_{1})}+v_{ij}^{(D_{2})}. $$

Under the conditions mentioned above, we then have

$$v_{ij}^{(D)}\sim\varGamma(n_{ij},s)\quad\text{with}\ n_{ij}=p_{ij}+q_{ij}, $$

and the observed response X ij on the unit response segment is such that X ij |θ i ∼Dir(m j ,p ij +q ij ), that is, a Beta distribution β(m j ,n ij ). Note that this aggregation property, inherited from the Gamma, does not exist in general for other distributions on the positive half-line, like the logistic-normal, as already mentioned by Aitchison and Shen (1980).

A Beta Unfolding Model (BUM) is thus fully specified as

$$ \begin{cases} X_{ij}|\theta_{i} \sim\beta(m_{j},n_{ij}),\\ m_{j} = \exp\lambda_{j},\\ n_{ij} = \exp[\theta_{i}-\delta_{j} ]+\exp\bigl[-(\theta _{i}-\delta _{j})\bigr]. \end{cases} $$
(10)

From the properties of the beta density, the expected response function for item j is

$$\begin{aligned} E(X_{ij}|\theta_{i}) = & \mu_{ij} \\ = & \frac{m_{j}}{m_{j}+n_{ij}} \end{aligned}$$
(11)
$$\begin{aligned} = & \frac{\exp\lambda_{j}}{\exp\lambda_{j}+2\cosh(\theta _{i}-\delta_{j})}. \end{aligned}$$
(12)

Interestingly, it is exactly of the hyperbolic cosine form independently derived for dichotomous data by Andrich and Luo (1993) and Verhelst and Verstralen (1993). Plots of the expected rating curve and the corresponding value functions are provided in Figure 6. The λ j parameter here has the same interpretation of an item-specific acceptability parameter. But this quantity does not take the same importance for all values of θ i in the expected response, and this leads to an attitude-varying model of acceptance. Note that, by contrast with Bernoulli models, the resulting response function in (12), although bounded by 0 and 1, is not a probability function, but an expectation function.

3.3 Generalized Forms of the Beta Unfolding Model

It may appear in practice desirable to model response expectation and variance in a (relatively) decoupled manner. Because the variance is directly related to the sum of the canonical parameters m j +n ij (see Equation (6)) in a beta model, the model variance is easily modulated by introducing an additional scaling parameter ϕ j =expτ j in m j and n ij such that

$$ \begin{cases} m_{j} = \exp(\lambda_{j}+\tau_{j}),\\ n_{ij} = \exp\bigl[(\theta_{i}-\delta_{j})+\tau_{j}\bigr]+\exp \bigl[-(\theta _{i}-\delta_{j})+\tau_{j}\bigr]. \end{cases} $$
(13)

This will leave the expectation function unchanged:

$$\begin{aligned} E(X_{ij}|\theta_{i}) = & \frac{\exp(\lambda_{j}+\tau_{j})}{\exp (\lambda_{j}+\tau_{j})+\exp[(\theta_{i}-\delta_{j})+\tau_{j} ]+\exp[-(\theta_{i}-\delta_{j})+\tau_{j} ]} \\ = & \frac{\exp\lambda_{j}}{\exp\lambda_{j}+2\cosh(\theta _{i}-\delta_{j})}, \end{aligned}$$
(14)

and the variance function now reads

$$\begin{aligned} V(X_{ij}|\theta_{i}) = \frac{\mu_{ij}(1-\mu_{ij})}{1+\phi_{j} [\exp\lambda_{j}+2\cosh(\theta_{i}-\delta_{j}) ]}. \end{aligned}$$
(15)

The effects of varying λ j and τ j on response density and expectation are illustrated in Figure 5. Each of them mostly affects response expectation and variance, respectively. Higher values of λ correspond to higher expected responses (item acceptability increases), and higher values of τ correspond to a smaller response dispersion, given the true attitude. As will be further commented below, this parameterization suffices to create a family of flexible response functions with varying peakedness.

Figure 5.
figure 5

Expectation rating curves and conditional response density functions. Varying λ (from upper to lower panels) and varying τ (from left to right panels) affects response expectation and conditional variance, respectively.

Of course, the λ j and τ j are both item-specific parameters that are not separable in (13). With no loss of generality, one may reparameterize as

$$ \begin{cases} m_{j} = \exp\lambda_{j}^{*},\\ n_{ij} = \exp\bigl[(\theta_{i}-\delta_{j})+\tau_{j}\bigr]+\exp \bigl[-(\theta _{i}-\delta_{j})+\tau_{j}\bigr], \end{cases} $$
(16)

with \(\lambda_{j}^{*}=\lambda_{j}+\tau_{j}\). For convenience, \(\lambda_{j}^{*}\) will be simply written λ j in what follows. The over-parameterization in n ij is practically dealt with by imposing ∑ i θ i =0.

It may also be convenient, for interpretation or comparison purposes, that the attitude distribution be scaled to a fixed unit, for instance, by imposing \(\sum_{i}\theta_{i}^{2}=N\). In this case, a common slope constant will play the role of a (positive) scaling factor:

$$ \begin{cases} m_{j} = \exp\lambda_{j},\\ n_{ij} = \exp\bigl[\alpha(\theta_{i}-\delta_{j})+\tau_{j} \bigr]+\exp \bigl[-\alpha(\theta_{i}-\delta_{j})+\tau_{j} \bigr]\\ \phantom{n_{ij}} = 2\phi_{j}\cosh\bigl[\alpha(\theta_{i}-\delta _{j}) \bigr]. \end{cases} $$
(17)

The expected response function becomes

$$\begin{aligned} E(X_{ij}|\theta_{i}) = \frac{\exp\lambda_{j}}{\exp\lambda _{j}+2\phi_{j}\cosh[\alpha(\theta_{i}-\delta_{j}) ]}. \end{aligned}$$
(18)

Although tempting, there is no need to free the scaling constant and make it item-specific (i.e., a discrimination parameter), as is common in Bernoulli response models. This would result in high parameter redundancy and lead to numerical estimation problems. Actually, the (λ,τ) parameterization already produces sufficiently flexible response curves (see Figure 5). Similar remarks may be found in Verhelst and Verstralen (1993) about a Bernoulli unfolding model with very similar equations.

3.4 Bimodality

An unusual feature of the BUM is the ability to model bimodal response density functions. This will occur each time one has simultaneously m j <1 and n ij <1 for an item j and some range of θ, that is,

$$ \begin{cases} \lambda_{j} < 0,\\ \tau_{j} < -\ln\bigl[2\cosh\bigl(\alpha(\theta_{i}-\delta_{j}) \bigr) \bigr]. \end{cases} $$
(19)

In the following, such items will be referred to as bimodal response density (BRD) items, as opposed to unimodal response density (URD) items. It should be noted that this terminology characterizes the shape of the conditional response density for a fixed level of attitude, and not the shape of the expected response function, which is of course always single-peaked in an unfolding model. Value and response functions for an URD item (δ j =0, λ j =2, α=1, τ j =0) and a BRD one (δ j =0, λ j =−0.8, α=1, τ j =−2) are displayed on the top and bottom rows of Figure 6. These plots deserve some attention, as they summarize most of the BUM features. On each row, the left panel shows the value functions (i.e., acceptability and refusal parameters as functions of attitude), and the right panel displays the corresponding expected and modal response curves. Contour lines of the response density have been added to the response function graph (as thin lines on the right panels), to help see the difference between both cases.

Figure 6.
figure 6

Value functions and response isodensity plots of unimodal (top row) and bimodal (bottom row) response density items. Note: Thick lines are expected (plain) and modal (dotted) response curves, thin plain lines are response density contours. Top row: “R” and “A” mark regions where refusal or acceptation dominates. Bottom row: “U” and “B” mark unimodal and bimodal response density regions, respectively.

In the URD case (top row), both value functions (m j and n ij ) remain above one (left panel), and the conditional response density (right panel) is never bimodal: The response density is high around the modal response curve (curve of higher density responses) and decreases as more extreme responses (given θ) are considered above and below this curve. Note that, as a result of the dissymmetry of the beta, the modal response curve will always lie at some distance of the expectation, except at the two attitudes values where acceptability and refusal have equal intensities (see dots on the plots); the beta is perfectly symmetric at these points (along the vertical dotted lines). This is to keep in mind to correctly interpret real data plots, where the expected rating curve will seem not to pass “through the middle of the data” (but the modal response curve will). Although somewhat puzzling at first glance, it should be noted that the very same phenomenon exists with binomial models but the effect (i.e., the difference between and (N+1)π in a B(N,π) model) is so small that it is almost invisible. From one end of the attitudinal continuum to the other, the response density evolves from shapes where refusal dominates (for the first reason), then acceptance, and then refusal again (for the second reason). The acceptance region (marked as “A” on the plot), defined by m j >n ij , is bounded by the critical attitude values δ j ±α −1acosh[0.5expλ j ], for which agreement and refusal are equal and E(X ij )=0.5.

In the BRD case (bottom row), the conditional response density function is unimodal only for low or high values of θ i δ j (refusal regions) but becomes bimodal when both m j and n ij fall below 1 (left panel). This defines a central interval on θ (marked as “B” in Figure 6), bounded by the critical values \(\delta_{j}\pm\alpha^{-1}\operatorname{acosh} [0.5\exp(-\tau_{j}) ]\). For any given θ i in this interval, both extreme responses are more probable than the expected one. This may be viewed from contour plots of response density (right panel), for instance, for θ=0; the expected response at this point (plain curve) has a density lower than 1 when both extreme responses (X ij =0 and X ij =1) have densities greater than 1. Within this interval, the density is bimodal and perfectly symmetric when X ij =0.5. At these points (see dots on the right panel), extreme responses (close to 0 or 1) have the same (highest) density. In this example, the modal response curve (thick dotted line) becomes a step function, suddenly switching from one extreme to the other of the response scale as θδ j increases (this is why it could not fully replace the expected response function for item characteristics description).

Bimodality, be it symmetric or not, is clearly an unusual feature for an IRT model, but it may help model response choices among ambivalent subjects. The model implies that if you do not have strong reasons to either endorse or reject an item (i.e., both m j and n ij are low), then your response choice is likely to be chaotic and suddenly bifurcate from one extreme to the other of the response scale. This also means that, in the bimodal case, agreement is considered more unstable than refusal. This seems reasonable in the unfolding context, as intermediate attitudes are conflicting by nature, and probably more difficult to endorse.

If bimodality is not expected to occur, from theoretical considerations, or simply not desired for the data at hand, one may want to impose either one of m j ≥1 or n ij ≥1 ∀i for a given item j, in the estimation process, that is,

$$ \begin{cases} \lambda_{j} \ge0,\\ \tau_{j} \ge-\ln\bigl[2\cosh\bigl(\alpha(\theta_{i}-\delta_{j}) \bigr) \bigr]. \end{cases} $$
(20)

Anyone of these conditions suffices. Note that the constraint on τ j in (20) should hold for any value of θ i . Because the hyperbolic cosine is U-shaped with a minimum of 2expτ j at θ i δ j =0, the minimal constraint on this second condition is to impose τ j ≥−ln2. In practice, we will look for the minimal scaling factor to apply simultaneously on m j and n ij (at θ i =δ j ) to achieve at least one of these conditions. This approach affecting both m j and n ij by the same scaling factor is not the only choice possible for preserving unimodality, but has the advantage to leave the expectation function unchanged.

In any case, that the bimodality feature is present has to be tested on the dataset at hand. In practice, both the constrained and unconstrained versions of the model may be fitted and compared on the basis of some goodness-of-fit measure (see below). If bimodality is a feature of the data, the unconstrained BUM is likely to capture it and should show better fit.

4 Parameter Estimation

4.1 Analysis of the Loglikelihood Surface

For a given item j with parameters Δ j and the response set X j , the BUM loglikelihood reads

$$ \ln L(\varTheta,\varDelta _{j}|X_{j})=-\sum _{i=1}^{N}\ln B(m_{j},n_{ij})+(m_{j}-1) \sum_{i=1}^{N}\ln x_{ij}+\sum _{i=1}^{N}(n_{ij}-1) \ln(1-x_{ij}), $$
(21)

where B(⋅) is the Eulerian Beta function.

The item log-likelihood function in (21) is basically a sum of conditionally independent beta log-likelihoods. As a member of the exponential family, the univariate beta has a globally concave log-likelihood with respect to its canonical parameters. The BUM log-likelihood is therefore globally concave with respect to m j and n ij parameters and has a unique maximum. A condition for the preservation of a unique maximum, when the log-likelihood is reexpressed as a function of BUM structural parameters, is that these parameters (up to an orthogonal transformation) all appear in one-to-one strictly monotone transforms. This is obviously the case for m j =expλ j . To see that this is also the case for δ j and τ j , it is convenient to reparameterize the (unscaled) model as

$$\begin{cases} m_{j} = \exp\lambda_{j},\\ n_{ij} = \exp\bigl[\theta_{i}-\delta_{1j} \bigr]+\exp\bigl[-(\theta _{i}-\delta_{2j}) \bigr], \end{cases} $$

where δ 1j and δ 2j threshold parameters are obtained by rotation/dilation in the (δ j ,τ j ) parameter subspace as

$$\begin{cases} \delta_{1j} = \delta_{j}-\tau_{j},\\ \delta_{2j} = \delta_{j}+\tau_{j}. \end{cases} $$

These parameters do appear through a strictly monotonic (exponential) transform, so unimodality of the log-likelihood surface is preserved. Of course, this ensures that a global maximum exists but does not preserve concavity. This is illustrated in Figure 7, where partial views of the BUM log-likelihood surface are plotted for each possible pair of item parameters, the other two parameters being held fixed at their true values, for a sample item.

Figure 7.
figure 7

Partial views of an item log-likelihood surface (λ=−1, δ 1=2, δ 2=−2, α=1). Note: True parameter values and the corresponding log-likelihood values are plotted as crosses at the bottom and top of the graph.

To construct these plots, a thousand attitude values have been generated, equally spaced between −4 and +4, and responses have been drawn from a BUM with λ=−1, δ 1=2, δ 2=−2, and α=1. Contour lines for a number of fixed log-likelihood values have been added at the bottom of each plot, to help see that the corresponding parameter values do not necessarily form strictly convex sets, for instance, in the (δ 1,δ 2) subspace (the stretching effect of the exponential transforms in the δ 1 and δ j directions is visible). However, concavity is not necessary to obtain good convergence properties, provided that the log-likelihood surface is unimodal, and the estimation procedure detailed below appears to work quite well in practice.

4.2 Estimation Procedure

As multimodal or nonsymmetric attitude distributions have sometimes been reported in previous unfolding studies (Andrich 1995; Roberts, Donoghue, & Laughlin, 2000a; Roberts, Rost, & Macready, 2000b), it seems unwise to assume a normal prior on θ in the general case. A finite mixture approach is adopted (Woodruff & Hanson 1996), where the ability random variable is discretized to a set Θ of a priori defined θ k (k=1,…,K) values with probabilities π k iteratively estimated along with item parameters (Mislevy, 1984). This approach may also be used with the π k weights held fixed at values from some a priori defined probability model, if desired.

Let f(x i |Δ,π) be the density of an observed response vector x i , given the whole set Δ of item parameters and the vector π of latent scores probabilities. When the latent variable is discrete, this density is expressed as a finite mixture:

$$ f(\mathbf{x}_{i}|\varDelta ,\pi)=\sum_{k}f( \mathbf{x}_{i}|\theta_{k},\varDelta )\pi_{k}. $$
(22)

An EM algorithm is used to estimate model parameters, which implies computing at each step s the expectation

$$ Q\bigl(\varDelta ,\pi|\varDelta ^{(s)},\pi^{(s)}\bigr)=E_{\theta} \Biggl\{ \ln\Biggl[\prod_{i=1}^{N}f( \mathbf{x}_{i},\theta_{i}|\varDelta ,\pi) \Biggr]|\mathbf{X}, \varDelta ^{(s)},\pi^{(s)} \Biggr\} $$
(23)

in the E step and maximizing it with respect to item parameters and latent score probabilities in the M step.

Woodruf and Hanson (1996) show that, in the finite mixture reformulation, this expectation may be rewritten as the sum of two quantities,

$$ \varPhi_{1}(\varDelta )=\sum_{k=1}^{K} \sum_{i=1}^{N} \Biggl\{ \ln\Biggl[\prod _{j=1}^{p}f(x_{ij}| \theta_{k},\varDelta _{j}) \Biggr]p\bigl(\theta_{k}| \mathbf{x}_{i},\varDelta ^{(s)},\pi^{(s)}\bigr) \Biggr\} $$
(24)

and

$$ \varPhi_{2}(\pi)=\sum_{k=1}^{K} \ln(\pi_{k})\sum_{i=1}^{N}p\bigl( \theta_{k}|\mathbf{x}_{i},\varDelta ^{(s)}, \pi^{(s)}\bigr), $$
(25)

each of which depending only upon Δ and π, respectively.

In the E step, the posterior probability that the ith response vector results from a θ k attitude level is computed as

$$ p\bigl(\theta_{k}|\mathbf{x}_{i},\varDelta ^{(s)}, \pi^{(s)}\bigr)=\frac{f(\mathbf{x}_{i}|\theta_{k},\varDelta ^{(s)})\pi_{k}^{(s)}}{\sum_{k'=1}^{K}f(\mathbf{x}_{i}|\theta _{k'},\varDelta ^{(s)})\pi_{k'}^{(s)}}. $$
(26)

In the M step, Φ 1 is maximized with respect to any item-specific η j parameter by setting to zero the corresponding partial derivative

$$ \frac{\partial\varPhi_{1}(\varDelta )}{\partial\eta_{j}}=\sum _{k=1}^{K}\sum _{i=1}^{N}\frac{\partial\ln f(x_{ij}|\theta_{k},\varDelta _{j})}{\partial\eta_{j}}p\bigl( \theta_{k}|\mathbf{x}_{i},\varDelta ^{(s)}, \pi^{(s)}\bigr), $$
(27)

where the quantity p(θ k |x i ,Δ (s),π (s)) computed in the previous step is taken as fixed.

Finally, maximizing Φ 2 with respect to the π k , subject to the constraint that ∑ k π k =1, leads to the updated estimates

$$ \pi_{k}^{(s+1)}=\frac{1}{N}\sum _{i=1}^{N}\frac{f(\mathbf{x}_{i}|\theta_{k},\varDelta ^{(s)})\pi _{k}^{(s)}}{\sum_{k'=1}^{K}f(\mathbf{x}_{i}|\theta_{k'},\varDelta ^{(s)})\pi_{k'}^{(s)}}. $$
(28)

The partial derivatives of lnf(x ij |θ k ,Δ j ) with respect to δ j , λ j , and τ j are (see Appendix):

$$\begin{aligned} \frac{\partial\ln f(x_{ij}|\theta_{k},\varDelta _{j})}{\partial\delta _{j}} = & -2\alpha\phi_{j}\sinh\bigl[\alpha( \theta_{k}-\delta_{j}) \bigr] \\ & {}\times\bigl\{ \varPsi(m_{j}+n_{kj})- \varPsi(n_{kj})+\ln(1-x_{ij}) \bigr\} , \end{aligned}$$
(29)
$$\begin{aligned} \frac{\partial\ln f(x_{ij}|\theta_{k},\varDelta _{j})}{\partial\lambda _{j}} = & m_{j} \bigl\{ \varPsi(m_{j}+n_{kj})- \varPsi(m_{j})+\ln x_{ij} \bigr\} , \end{aligned}$$
(30)
$$\begin{aligned} \frac{\partial\ln f(x_{ij}|\theta_{k},\varDelta _{j})}{\partial\tau _{j}} = & n_{kj} \bigl\{ \varPsi(m_{j}+n_{kj})- \varPsi(n_{kj})+\ln(1-x_{ij}) \bigr\} , \end{aligned}$$
(31)

where Ψ(x)=lnΓ(x)/∂x is the digamma function, and ϕ j =expτ j .

Because log-transformed scores appear in the expressions above, responses that are exactly 0 or 1 cannot be dealt with as such. Moreover, it should be noted that in the unconstrained version of the BUM (i.e., m j and n ij are potentially lower than 1), exponents on x ij and 1−x ij may be negative, and the density is no more defined for the extreme responses x ij =0 or x ij =1. Though it is not sure whether absolute agreement or disagreement with an item is psychologically plausible, extreme responses are of course observed in real data, and both issues are simply addressed by arbitrarily replacing 0 and 1 responses by numerical values arbitrarily closed to 0 and 1 (for instance, 10−3 and 1–10−3). Although other proposals have been made to deal with extreme responses in beta models (Smithson & Verkuilen 2005), we found in practice this slight correction to induce the smallest bias in the estimation of BUM parameters (see the simulation section).

Several iterative methods may be used to maximize ϕ 1 and ϕ 2. In the light of the discussion on the log-likelihood surface above, the Newton–Raphson algorithm does not seem appropriate. But the limited-memory quasi-Newton (L-BFGS-B) algorithm (Byrd, Lu, Nocedal, & Zhu, 1995) was found in practice to be very fast, numerically accurate, and flexible for managing parameter boundary constraints. In this algorithm, a negative definite approximation of the Hessian is numerically estimated at each iteration from the last t previous iterations (t=10 in the following). This is implicitly done by maintaining in memory a history of the last t parameter vectors and gradients.

Although only gradient functions are needed for using the L-BFGS-B algorithm, exact expressions of the expected second-order cross-derivatives of the mixture log-likelihood with respect to item parameters are provided in the Appendix. The resulting 3×3 matrices are used at the end of the estimation process to compute standard errors of estimates for each item, by computing the inverse of minus these matrices and taking the square roots of their diagonal elements.

4.3 Initial Values of Estimates

Row and column scores from a correspondence analysis of the data matrix, centered and scaled with respect to row scores mean and variance, were found to provide a very good initial solution (\(\theta_{i}^{(0)}\) and \(\delta_{j}^{(0)}\)) for θ i and δ j parameters. In the finite mixture formulation adopted here, the \(\theta_{i}^{(0)}\) are then used to estimate initial values of the latent probabilities π k , as relative frequencies of K fixed width intervals, over some a priori defined domain (e.g., [−4.5;+4.5]), outliers being included in the extreme intervals. The K interval midpoints are taken as the corresponding discrete θ k attitude values. At step 0, all λ j parameters are arbitrarily set to 1, and all τ j are set to 0.

At the end of each estimation cycle over all items, the estimated distribution of the θ k is centered around zero and scaled to unit variance (the corresponding observed standard deviation is used to fix the common slope α). The δ j parameters are consequently adjusted.

4.4 Estimation of Latent Attitudes

Once item parameters have been estimated, they are taken as fixed in the estimation of attitude parameters. First (and often very good) estimates are already provided by the expected value of the Θ posterior distribution:

$$ \hat{\theta}_{i}=\sum_{k=1}^{K} \theta_{k}p(\theta_{k}|\mathbf{x}_{i},\varDelta , \pi). $$
(32)

These can be still improved by maximizing the joint likelihood with respect to all θ i , all item parameters being fixed. Denoting by L(Θ,Δ|X) the joint likelihood of the BUM, we have:

$$ \ln L(\varTheta,\varDelta |\mathbf{X})=\sum_{i=1}^{N} \sum_{j=1}^{p}\ln f(x_{ij}| \theta_{i},\varDelta _{j}). $$
(33)

Person parameters are obtained by setting to zero the first derivative:

$$\begin{aligned} \frac{\partial\ln L(\varTheta,\varDelta |\mathbf{X})}{\partial\theta _{i}} = & 2\sum_{j=1}^{p} \alpha\phi_{j}\sinh\bigl[\alpha(\theta_{i}- \delta_{j}) \bigr] \\ & {}\times\bigl\{ \varPsi(m_{j}+n_{ij})- \varPsi(n_{ij})+\ln(1-x_{ij}) \bigr\} . \end{aligned}$$
(34)

A Fisher scoring scheme is obtained by using

$$ E \biggl[\frac{\partial^{2}\ln L(\varTheta,\varDelta |\mathbf {X})}{\partial\theta_{i}^{2}} \biggr]=4\sum_{j=1}^{p} \alpha^{2}\phi_{j}^{2}\sinh^{2} \bigl[ \alpha(\theta_{i}-\delta_{j}) \bigr] \bigl\{ \varPsi'(m_{j}+n_{ij})-\varPsi'(n_{ij}) \bigr\} $$
(35)

in an iterative procedure updating the θ i as follows:

$$ \hat{\theta}_{i}^{(t)}=\hat{\theta}_{i}^{(t-1)}-E \biggl[\frac{\partial^{2}\ln L(\varTheta,\varDelta |\mathbf {X})}{\partial\theta_{i}^{2}} \biggr]^{-1} \biggl[\frac{\partial \ln L(\varTheta,\varDelta |\mathbf{X})}{\partial\theta_{i}} \biggr]. $$
(36)

4.5 Information Functions

The test information function reads

$$\begin{aligned} I(\theta_{i}) = & -E \biggl[\frac{\partial^{2}\ln L(\varTheta ,\varDelta |\mathbf{X})}{\partial\theta_{i}^{2}} \biggr] \\ = & 4\sum_{j=1}^{p}\phi_{j}^{2} \alpha^{2}\sinh^{2} \bigl[\alpha(\theta_{i}- \delta_{j}) \bigr] \bigl\{ \varPsi'(n_{ij})- \varPsi'(m_{j}+n_{ij}) \bigr\} . \end{aligned}$$
(37)

The item information function

$$\begin{aligned} I_{j}(\theta_{i}) = & 4\phi_{j}^{2} \alpha^{2}\sinh^{2} \bigl[\alpha(\theta_{i}- \delta_{j}) \bigr] \bigl\{ \varPsi'(n_{ij})- \varPsi'(m_{j}+n_{ij}) \bigr\} \\ = & I_{j}^{(1)}(\theta_{i})\times\bigl\{ I_{j}^{(2)}(\theta_{i}) \bigr\} \end{aligned}$$
(38)

is displayed in Figure 8 for various values of λ j and τ j (item location is fixed to δ=0). From the properties of the hyperbolic sine and the trigamma, the functions

$$\begin{aligned} I_{j}^{(1)}(\theta_{i}) = & 4 \phi_{j}^{2}\alpha^{2}\sinh^{2} \bigl[ \alpha(\theta_{i}-\delta_{j}) \bigr], \\ I_{j}^{(2)}(\theta_{i}) = & \varPsi'(n_{ij})-\varPsi'(m_{j}+n_{ij}) \end{aligned}$$

are both second-order functions of θ i , and so I j (θ i ) is of fourth order, as has been observed with categorical unfolding models (Andrich & Luo, 1993; Roberts & Laughlin, 1996). It is null for θ i δ j =0 and increases, up to some optimum, with increasing values of |θ i δ j |. Because the trigamma is strictly monotone decreasing on \(\mathbb{R}^{+}\), higher values of ϕ j (or τ j ) and λ j are essentially associated with higher information levels.

Figure 8.
figure 8

Theoretical item information functions for varying values of λ and τ parameters. Note: λ j values are varied from left to right, and τ j values from top to bottom (note the different y-scale on the bottom row).

5 A Simulation Study

A simulation study was conducted to test the accuracy of parameter estimates obtained by the algorithm described in the previous section, which was implemented in the R programming language (R Core Team 2013). Three hundreds simulated datasets were generated for each of the 16 conditions resulting from factorially combining four levels of sample size (100, 250, 500, and 1000 observations) with four levels of test length (10, 15, 20, and 30 items). For each replication, subject parameters where drawn from a normal distribution and then exactly centered on zero and scaled to unit variance. Item locations were fixed equally spaced between −2 and +2. Dispersion and acceptance parameters were drawn at random from the uniform distributions in the [−3;1], and [−1.5;1.5] intervals, respectively. The scale parameter was fixed to 1.5. These choices resulted from preliminary experience with available datasets (see next section) but also ensured that bimodal response density was possible (τ j and λ j are potentially lower than −ln2 and 0, respectively). For the estimation of latent attitudes, the number of latent scores θ k was fixed to K=30, over the [−4.5;4.5] domain.

Several measures of estimation quality have been computed. Correlations (R) between true and estimated parameters are reported in Table 1. For a given item parameter (η), the root mean square error (RMSE), the signed bias (BIAS), and the standard error (SE) of estimates have been computed as follows:

$$\begin{aligned} \mathit{RMSE} = & \Biggl[\frac{1}{300}\sum_{r=1}^{300}( \eta-\hat{\eta}_{r})^{2} \Biggr]^{\frac{1}{2}}, \\ \mathit{BIAS} = & \frac{\sum_{r=1}^{300}(\eta-\hat{\eta}_{r})}{300}, \\ \mathit{SE} = & \Biggl[\frac{1}{300}\sum_{r=1}^{300}( \hat{\eta}_{r}-\bar{\eta})^{2} \Biggr]^{\frac{1}{2}}, \end{aligned}$$

where \(\hat{\eta}_{r}\) is the parameter estimate obtained at the rth replicate, and \(\bar{\eta}\) is the averaged estimate over all 300 samples. These quantities are reported in Table 2. The squared bias is easily recovered from Table 2 by computing BIAS 2=RMSE 2SE 2.

Table 1. Correlations between true and estimated parameters.
Table 2. Root mean square error, signed bias, and standard errors of item parameter estimates.

On the whole, the estimation algorithm performed very well, larger sample sizes being associated with increased accuracy in item parameter estimation, and larger test lengths with higher correlations between true and estimated attitude values. The average correlation between true and estimated attitude values was never lower than 0.96 (Table 1).

As to item locations, the RMSE values reported in the present analysis are lower than those usually reported for categorical unfolding models for the same sample size and test length. For comparison, Roberts and Laughlin (1996) report an RMSE=0.217 for item location estimates obtained in the Graded Unfolding Model for N=500 (J=20, 6 response categories). This is in the order of magnitude of what is observed for N=100 in the present simulation. Roberts, Donoghue, and Laughlin (1998) report RMSE values for δ estimates in the Generalized Graded Unfolding Model that fall below 0.10 for at least N=2000. This is achieved for N=500 in the present study. Wang and Zeng (1998) report RMSE values around 0.08 for item location recovery in Samejima’s Continuous Response Model for N=500, which is quite comparable to what is obtained here, for an unfolding model. So the algorithm seems to perform satisfactorily. We also note that the bias is quite negligible as far as the item locations are concerned.

The acceptability parameters had the smallest standard errors in the estimation but also the highest (positive) bias. This remains acceptable however (the larger bias value was 0.038 for N=100 and J=10) and seems to be well compensated by the sample size, the bias values falling below 0.01 as soon as N≥500. Conversely, dispersion parameters had the highest standard errors and a slight positive bias (which appears negligible with regard to the standard errors). These SE values fall below 0.10 only for N=1000, so τ j are the parameters in this model for which there is most uncertainty.

Finally, if for practical purposes, one mainly considers the quality of estimation of attitude and item location parameters, then in the light of this simulation study, the minimum requirements would be to take N≥500 and J≥10 to get good estimates. This is an encouraging result, in conformity with what Noel and Dauvier (2007) already observed with the cumulative Beta Response Model (BRM): Working on continuous data do help to get good levels of accuracy.

6 Application on Real Data

A fifty-item questionnaire on attitude toward abortion (Roberts et al. 2000a; Roberts, Lin, & Laughlin, 2001) was translated into French and submitted to 443 subjects. For these sample size and test length, results in the previous section suggest that the quality of estimation should be acceptable. The subjects had to report their responses on a web interface written in PHP/Javascript. The response scale appeared as a 480-pixel horizontal slider. On page loading, the initial positions of the sliders were drawn at random, as was the item ordering. The recorded responses were divided by 480 to lie in the [0;1] interval, and the responses that were exactly 0 or 1 were replaced by 10−3 and 1−10−3.

A succession of four BUM models were then fitted to the resulting dataset, constraining or not the response density to be everywhere unimodal and estimating the latent distribution or fixing it to be Gaussian. Goodness-of-fit measures (log-likelihood and Akaike Information Criterion) for each of these models are reported in Table 3, along with the numbers of estimated and fixed parameters. All parameters that had been constrained during the estimation process were considered as fixed in the computation of the AIC.

Table 3. Goodness-of-fit measures for several BUM models.

Estimating the latent distribution gave better fit than fixing it to be normal, in both conditions of constraint on the response density: A plot of the estimated score distribution (Figure 9) suggests a slight positive skewness. This dissymmetry is made apparent in Figure 9 by superimposing a kernel smoothed version of the estimated distribution with a standard Gaussian curve.

Figure 9.
figure 9

Observed and smoothed distribution of estimated latent scores. Note: A Gaussian curve is added for comparison purposes.

Relaxing the unimodality constraint on the response conditional density also resulted in better fits. This strongly suggests that some kind of bimodal response process does underlie the data, where for a given attitude (essentially in the ambivalence region illustrated in Figure 6, bottom panels), two opposite manifest responses are highly (and potentially equally) likely (this is further discussed below). The estimated item parameters are reported in Table 4 for a subset of 15 items, approximately evenly spaced along the dimension. Asymptotic standard errors are also reported, and they appear of the order of magnitude of what was observed in the simulation study (for N=500). As is common in IRT estimation, the standard errors slightly increase for extreme item locations, which might be related to the fact that fewer subjects lie in this region of the latent dimension, and so less information is available. For information, classical item INFIT and OUTFIT measures (Wright & Masters 1982) are also reported in the table, but more work is needed to explore the properties and behavior of these goodness-of-fit measures with the BUM.

Table 4. Item parameter estimates for the abortion data.

Items are clearly scaled from positive (“43. Restrictions should never be placed on a woman’s right to an abortion.”) to negative (“13. Abortion is unacceptable under any circumstances.”) attitudes toward abortion, ambivalent items lying in the middle range (“34. Abortion should be a woman’s choice, but should never be used simply due to its convenience.”). Observed data (as dots), response density contours (as thin plain lines), and expectation functions (thick gray lines) for these items are plotted on Figure 10. In interpreting these plots, one should keep in mind that the BUM expected response function in (18) is not supposed to go “through the middle” of the data: Because the beta is not symmetric in general, the distribution expectation always lies at some distance from the mode (see Figure 6, thick lines).

Figure 10.
figure 10

Observed data, response density contours, and expectation function for the abortion data. The thick plain lines display the expected rating curve, the thin lines the response density contours as functions of attitude. Figures in the contour lines are the density values at that particular level.

The response density contours (thin lines) help appreciate how response density varies both with response value and latent attitude. For example, a subject whose attitude θ on the latent dimension is close to the location of item 47 (“Abortion is a reasonable alternative if a woman feels that having a baby might ruin her life”, δ 47=−0.934) is more likely to provide a response near the boundaries of the response scale (0 or 1), where the response density is above 1, than in the middle range, where the response density is lower than 1. Another view of this phenomenon is provided by the response density surface plots in Figure 11 for items 48 (“Abortion should be an accepted mechanism for family planning.”), 29 (“Sometimes I am in favor of a woman’s right to abortion, but at other times I am not.”), 30 (“I cannot whole-heartedly support either side of the abortion debate.”), and 6 (“Abortion is a threat to our society.”). Along the response axis, for attitudes in the neighborhood of the item location (and especially for θ=δ), the density displays a characteristic “saddle” profile, the surface going upwards for extreme responses. This bimodality feature was in fact also present in Noel and Dauvier’s (2007) BRM but judged unrealistic or nonpsychologically plausible by these authors (and not apparent on their mood data). The results reported above show that it may indeed appear in attitude data.

Figure 11.
figure 11

Response density surfaces.

7 Discussion

The use of continuous bounded responses (CBR) in psychological measurement may have a number of advantages. A first aspect is parsimony. The number of parameters to be estimated under the BUM favorably compares to the requirements of categorical models in general, which depend upon the chosen number of response categories. At a maximum, 3p−1 item parameters are to be estimated under the BUM, which is comparable to a Graded Unfolding Model with fixed slopes and four response categories (Roberts et al. 2000a). Item and test information levels in categorical response models are known to increase with the number of response categories available. But the advantages of arbitrarily increasing this number are rapidly neutralized by the need of larger sample sizes to get acceptable levels of estimation quality. This is no more an issue when the set of possible responses is potentially infinite and statistically modeled with a continuous density.

The Beta Unfolding Model for CBR proposed in this paper is a direct extension of Noel and Dauvier’s (2007) cumulative BRM. The model is derived both from a hypothetical interpolation response mechanism and from the hypothesis of two opposite sources of item refusal being collapsed. These two sources of refusal have been made explicit in a three-component Dirichlet model and then collapsed to obtain a (two-component) beta response model, taking advantage of the aggregation property of the Gamma. Because its parameters are readily interpretable in terms of acceptance and refusal, the beta is well suited to the modeling of Agree–Disagree responses on a continuous response scale. The reexpression of these distribution parameters in terms of structural parameters (person attitude and item location, dispersion, and acceptability) is also straightforward, through exponential transforms. The resulting expected response function has the same shape as the ones already proposed for binary data by Verhelst and Verstralen (1993) and Andrich and Luo (1993), and this model may be viewed as a generalization of these earlier proposals to the continuous case.

In theory, other beta unfolding models could be constructed in a similar way by defining a suitable U-shaped function of the person-item distance in the n ij refusal parameter. The square (Andrich 1989) or any power (Hoijtink 1993) of the (absolute) person-item distance could be used, and the resulting expected response function would have a ratio form quite similar to the one proposed by these authors. However, the choice made in the present paper was to define separate and explicit components for both refusal sources. Besides the fact that it is easily done through a Dirichlet model, given its additivity property, this also leaves open the possibility to define potentially different submodels on the two refusal terms (for instance, one source of refusal may have a stronger impact than the other). This possibility is still to be explored.

The most salient and intriguing aspect of the BUM is the ability to model bimodal response distributions, given θ. More research is needed to see how useful this feature can be in applied studies. But this did happen on all items in the analysis of abortion data reported in this paper: Persons located near the item position tended, for the same attitude value, to provide opposite responses. From the properties of the model, we interpret this result as indicating that subjects in the neighborhood of an item location did not find so strong reasons to either agree or disagree with it (although agreement dominates). For those person and item parameter values, the expected response appears as the least likely. Although counterintuitive, this model feature allows a new distinction between firm and unstable acceptance for the same attitude and observed response. This is potentially measured by the response conditional variance, given the attitude (Equation (15)), and is made possible (by contrast with Bernoulli models) by the fact that the beta has two parameters. We believe these features to be especially meaningful in the field of attitude measurement and, in particular, in the modeling of sudden bifurcation in the choice of a response among ambivalent subjects.

It could be a matter of debate in general whether a bimodal response distribution reveals some kind of chaotic response process, or simply an extreme (dichotomous) response process, the middle range of the response scale simply not being used. In the present case, examination of the abortion items plots in Figure 10 shows that, for the same item, subjects did use the whole range of the response scale at some points of the attitudinal continuum (see Item 47 for example, around θ=0), and not at others (around θ=−1), so that this phenomenon is attitude-dependent, as predicted by the model. Whatever the interpretation, would this binary response process appear, possibly for some items and not for others, the model would be flexible enough to accommodate both cases simultaneously.

It would probably be easier to illustrate bifurcative response processes in the field of behavior change, where sudden changes are sometimes expected to occur, and explicitly modeled as such (van der Maas & Molenaar, 1992). Preliminary results in the analysis of smoking cessation data (Noel 2009) strongly suggest that bimodality may be given a substantial interpretation, at least in some contexts. In this research, bimodality appeared specifically for those smokers that were about to quit. This is a strong incentive to explore the response bimodality issue further, in particular, in the domains of behavior change and conflictual attitude measurement.