Keywords

1 Vagueness and Individual Differences

Whether a woman of 1m75 is tall or chess is a sport are questions without single, matter-of-fact answers. tall and sport are vague words, meaning that there is no established demarcation of the instances to which they apply and the instances they do not apply to. Individuals can use these words in different ways without committing an error (Kölbel 2004; Raffman 2014; Wright 1995). This “permissible variation” becomes readily apparent in categorization tasks in which the participants diverge widely regarding the instances they feel these words apply to (Black 1937; McCloskey and Glucksberg 1978). Among psychology undergraduates, the odds in favour of calling a woman of 1m75 tall are 65:35 (Verheyen et al. 2018) and the odds in favour of considering chess a sport are about 50:50 (Verheyen et al. 2010), for instance.

The idea that some of these individual differences in categorization are systematic and can be brought back to properties of the participants, has often been entertained, but seldom demonstrated. Both Barsalou (1993) and Smith and Samuelson (1997) have suggested that in addition to the context language users find themselves in, their individual learning histories, aptitudes, and dispositions influence their categorization behavior. The idea also finds support in the work of Gardner (1953) and Verheyen et al. (2010), who showed that participants display relatively stable categorization patterns across tasks. Which properties of the individuals are responsible for the observed stability? And to what extent can individual differences in categorization be attributed to different participant properties?

Questions like these naturally fit an experimental perspective on vagueness. A comparison of the categorization behaviour of two groups of participants who differ in a property of interest, allows one to determine if the property under investigation systematically affects the participants’ word use. The work described in this chapter is an illustration of how one might effectively do that. Inspired by a quote from Chomsky: “Word meaning is intimately bound up with matters of knowledge and belief” (1980: 225), we chose to study the effect of education on vaguenessFootnote 1.

The effect of education on semantic tasks has rarely been explicitly investigated (see Rosenzweig 1964, for a notable exception), unless educational level is used as a proxy for verbal skill (Kuperman et al. 2013). Participants’ level of education is regularly included as a control variable in semantic norming studies, though (e.g., De Witte et al. 2015; Loonstra et al. 2001). In these studies the transition from compulsory to higher education regularly yields pronounced effects (Charchat Fichman et al. 2009; Keuleers et al. 2015). For illustrative purposes, we too therefore decided to focus on the categorization differences between participants who went on to complete higher education after compulsory education and those who did not.

In what follows we describe how a property such as the level of education of participants can be brought into a formal account of vagueness. This account takes the form of a statistical model that makes a number of assumptions regarding the manner participants arrive at a categorization decision and the kind of differences one might expect to see herein. Indeed, while the vagueness of words comes with variation in their application by different individuals, it is not the case that anything goes. The latter would render communication among language users impossible. An explanation of the permissible individual variation should thus be an intrinsic part of any formal account of vagueness (Black 1937). The proposed statistical model works by characterizing categorization differences between groups as vagueness in criteria or vagueness in degree.

2 Vagueness in Criteria and Degree

Devos (1995, 2003) distinguishes vagueness in criteria and vagueness in degree (see also Alston 1964; Kennedy 2013; Machina 1976). He defines vagueness in criteria as the indeterminacy with respect to (the combination of) the conditions for application of a term. Individuals might employ different criteria for establishing whether an activity is a sport or not. While some might emphasize that sports require physical activity, others might require an element of competition. According to the first criterion hiking, but not chess, is likely to be considered a sport, while the reverse holds when the second criterion is employed. Devos (1995, 2003) defines vagueness in degree as the extent to which a term can be applied given that the conditions have been determined. Even when individuals agree on which criterion to employ to establish whether an activity is a sport or not, they might still disagree as to whether a particular activity sufficiently meets that criterion. That is, while some might deem both hiking and running sufficiently demanding, others might feel that only running requires sufficient physical activity to be considered a sport.

Devos (1995, 2003) argues that vagueness in criteria coincides primarily with nouns and vagueness in degree with adjectives. The rationale behind this is that while for many adjectives there exists a unique criterion that determines application (e.g., height for tall, price for expensive), many criteria can be considered for applying a noun like sport (competitiveness, physical activity, …). This line of reasoning ignores the fact, however, that (a) many adjectives too are multifaceted (Kamp 1975; Klein 1980; Sassoon 2012) and (b) the application of many multifaceted nouns is effectively governed by one-dimensional constructs such as typicality or similarity to the target category (Hampton 1998, 2007; McCloskey and Glucksberg 1978). We therefore see no reason to expect a principled relationship between vagueness in criteria and nouns on the one hand, and vagueness in degree and adjectives on the other.

What Devos (1995, 2003) terms vagueness in degree is addressed in the so-called threshold theory (Hampton 1995, 1998, 2007; Raffman 1994, 1996). The threshold theory provides a psychological account of individual differences in categorization. It does so by positing that categorizers position a threshold that separates members from non-members on a dimension along which the candidate items are organized. The threshold theory thus assumes that categorization is governed by a single latent dimension, which can be composed of one or more (weighted) substantial criteria (see Égré 2017; Keefe 2000, for similar assumptions)Footnote 2. In the work of Raffman (1994, 1996), who is mainly interested in adjectives such as rich and red, the dimension reflects a single substantial criterion such as dollar amounts or wavelengths. In the work of Hampton (1995, 1998, 2007), who is primarily concerned with noun categories such as fruits, vehicles, and sports, the dimension is thought to reflect the items’ typicality or similarity to the target category, which can be shown to reflect a weighted combination of several substantial criteria (Dry and Storms 2010; Hampton 1979; Rosch and Mervis 1975; see De Deyne et al. 2014, for evidence that the same holds for multifaceted adjectives). Crucially, however, in both accounts all categorizers are assumed to rely on the same dimension for their categorization and only to differ with respect to the threshold they employ. When categorizers use different thresholds on the same dimension, they are essentially diverging on the degree to which the target term applies, given fixed conditions (be it wavelength for red, or a weighted combination of competitiveness and activity level for sports).

Although vagueness in criteria lay outside its original scope, the threshold theory can easily be extended to encompass it, by allowing not only the threshold, but also the dimension that is relied upon for categorization, to be subject to individual variation. Hampton (2006) already alluded to this possibility when he showed that the weighting of substantial criteria may differ from person to person (see Hampton and Passanisi, 2016; Verheyen and Storms 2013; Zee et al. 2014, for additional empirical support). The resulting dimensions of categorization could differ rather subtly when the same criteria are merely accentuated differently, or more profoundly when they reflect the use of distinct criteria as in the sports example at the beginning of this section. Both are potential instantiations of vagueness in criteria; the latter being a special case which amounts to setting to zero the weights of all potential criteria, except the one that is relied on.

2.1 A Formalization of Threshold Theory

Verheyen et al. (2010) provided a formalization of the threshold theory that can be applied to binary categorization tasks, in which categorizers go through a set of candidate items, indicating whether (1) or not (0) these belong to the target category. Like the threshold theory, the model assumes a single dimension along which the candidate items and the categorizers’ thresholds are positioned and it is their relative position that determines the answers. Unlike the original theory, the thresholds in the model do not deterministically separate members from non-members.

The model encompasses a free parameter β for each one of the items i and a free parameter θ for each one of the categorizers c. The βi values reflect a rank ordering of the items according to the propensity with which they are endorsed (the more often i is endorsed as a category member, the higher βi is). The θc values reflect a rank ordering of the categorizers according to the number of items they endorsed (the fewer items c endorses, the higher θc is). As such, the values of βi and θc can respectively be interpreted as the extent to which item i meets the categorization criteria and categorizer c’s threshold (Verheyen et al. 2010). The better an item meets the categorization criterion, the further it is positioned on the dimension. The higher the requirements for category membership categorizers’ impose, the further their thresholds θc are positioned along the dimension.

Both the item positions (βi) and the thresholds (θc) are estimated from the categorization data. As such, the model does not presume any a priori knowledge about the underlying dimension, although it can be subsequently interpreted by inspecting the relative positions of the items indicated by βi. Items are positioned further along the dimension the better they meet the categorization criteria. If physical activity thus governed the categorization decisions for sport, hiking would be found to the right of chess. If competition governed the decisions, the relative positions of hiking and chess would be reversed.

According to the model formula in Eq. (1), the more the position of the item (βi) surpasses the position of the categorizer’s threshold (θc) along the dimension, the higher the probability that categorizer c considers item i a category member will be (and vice versa):

$$ { \Pr }\left( {Y_{ci} = 1} \right) = \frac{{{\text{e}}^{{ \beta_{i} - \theta_{c} }} }}{{1 + {\text{e}}^{{ \beta_{i} - \theta_{c} }} }}. $$
(1)

The membership function thus starts of at 0 (clear non-member) for items that fall short of the categorizer’s threshold (βi ⋘ θc) and steadily increases until 1 (clear member) for items that clearly surpass it (βi ⋙ θc). The threshold θc thus does not rigorously separate members from non-members into, but rather reflects the point at which the categorizer is indifferent with respect to the category membership decision. The probability that the model assigns to the categorization answer for an item that coincides with the threshold θc is 0.5, making the decision effectively a coin toss. By casting them as stochastic variables, the model leaves room for some statistical variation in the categorization answers, which is particularly interesting for explaining intra-individual differences in categorization (see Verheyen et al. 2010, for details); but this does not mean that categorizers are completely free to fill in the meaning of a word. The categorization patterns are constrained by the positions of the items (βi), which reflect how the items score on the criteria that are relevant for category membership. Like the threshold theory, the model assumes that the grounds for making the categorization decisions are shared by all categorizers. In Eq. (1) the agreement on which items score high/low on the relevant criteria is reflected in a single set of βi estimates. That is, there is one dimension that governs categorization. As such, the model is only concerned with degree differences in categorization. It can be naturally extended to encompass criteria differences, though.

2.2 A Formalization of Criteria and Degree Differences

The model in Eq. (1) makes establishing whether a participant’s group membership (e.g., compulsory vs. higher education) affects vagueness in criteria and degree tangible. Vagueness in degree would show in a meaningful difference in the threshold positioning θc of the members of the two groups. This would indicate that one group applies the term under investigation more conservatively than the other group does. Vagueness in criteria would show in a meaningful difference in the item positioning βi of the two groups. This would indicate that different dimensions govern categorization in the groups. In order to establish whether two groups employ different criteria for categorization, the modeling framework again requires no intuitions about potential criteria. It suffices that the estimated item positions are different for the two groups, as this constitutes evidence that the used criteria are not the same. The nature of these criteria can later be investigated through interpretation of the items’ relative positions in the manner described above.

To make statistical inferences regarding the existence of both types of vagueness feasible, the model in Eq. (1) needs to be extended as neither the threshold positions, nor the item positions are group dependent. In addition to indices c and i to indicate individual categorizers and items, respectively, an index g is introduced to make a distinction between groups. In order to establish whether group membership is a source of vagueness in degree, we assume that the thresholds follow a normal distribution with a group-specific mean and variance: \( \theta_{cg} \sim N\left( {\mu_{{\theta_{g} }} , \sigma_{{\theta_{g} }}^{2} } \right). \) By constraining \( \mu_{{\theta_{1} }} \) to equal 0, \( \mu_{{\theta_{2} }} \) can be thought of as the mean threshold difference between the groups (compulsory vs. higher education). If \( \mu_{{\theta_{2} }} \) is reliably different from zero, this constitutes evidence for vagueness in degree resulting from education.

To establish whether group membership is a source of vagueness in criteria, we introduce binary latent indicators Di that signal whether individuals who have the same threshold but are from different groups have a different probability of categorizing item i as a category member and therefore require a separate βi estimate. If Di equals 0 the model reads:

$$ { \Pr }\left( {{\text{Y}}_{cig} = 1 \left| { {\text{D}}_{i} } \right. = 0} \right) = \frac{{{\text{e}}^{{ \beta_{i} - \theta_{cg} }} }}{{1 + {\text{e}}^{{ \beta_{i} - \theta_{cg} }} }}. $$
(2)

βi has no index g here, signaling that the position of item i is the same in both groups.

If Di equals 1 the model reads:

$$ { \Pr }\left( {{\text{Y}}_{cig} = 1 \left| { {\text{D}}_{i} } \right. = 1} \right) = \frac{{{\text{e}}^{{ \beta_{ig} - \theta_{cg} }} }}{{1 + {\text{e}}^{{ \beta_{ig} - \theta_{cg} }} }}. $$
(3)

βi does receive an additional index g here, signaling that item i is positioned differently in the two groups. When the modeling procedure uncovers items with different positions in the two groups, this constitutes evidence for vagueness in criteria resulting from education.

3 Method

In order to demonstrate the modeling framework, we will re-analyze part of the data from Verheyen et al. (2018) in which adult participants who were recruited online completed a categorization task. The re-analysis is restricted to the data from female participants, who make up the majority of the participants sample (55%), since previous research has shown that gender produces both vagueness in degree and vagueness in criteria (Stukken et al. 2013). We want to avoid mistaking gender differences for education differences.

3.1 Participants

The selection of participants is comprised of 1036 adult female participants aged 18—92 (M = 56.82, SD = 16.10) from Flanders (Belgium). Since not all participants completed all categories, the actual number of participants per category ranges between 1004 and 1011. Sixty-four percent of the participants indicated to have obtained a diploma beyond secondary education, which is the compulsory level in Flanders. They make up the higher education group.Footnote 3

3.2 Materials

The materials were Dutch translations of the categories and items in Hampton et al. (2006). In what follows we will use the original English terms to refer to them. The materials included eight categories with 24 items each. The majority of these items were borderline items for the target categories, but clear members and non-members were included as well to make the task more natural. The categories will be printed in small capitals (fruits, vegetables, fish, insects, sports, sciences, tools, and furniture) and the items in italic (avocado, garlic, shrimp, maggot, chess, economics, funnel, and piano, are examples of borderline items for the respective categories). A list of the original materials along with their Dutch translation can be found in the Appendix.

3.3 Procedure

The data were gathered through a web survey. The participants completed a categorization task in which they were asked to indicate for the eight categories whether the 24 candidate items belonged to the category or not. In addition to “yes” and “no” participants could also answer “I don’t know the item”. It was emphasized that we were interested in participants’ personal opinions rather than the answers considered appropriate by the general public or official authorities. The categories were presented on separate pages in a random order. The corresponding items were presented in randomized lists. Participants could proceed at their own pace. The majority of participants completed the survey in less than ten minutes.

3.4 Model Analysis

The models that have been discussed in this paper are all existing models that have been developed in the Item Response Theory (IRT) literature. IRT models tend to be used to infer latent traits from individuals’ manifest responses to the items in a questionnaire or test (Hambleton et al. 1991). Verheyen et al. (2010) recognized the potential of IRT models for the study of vagueness when they introduced the model in Eq. (1) as a formalization of the threshold theory. In the IRT literature this model is known as the Rasch model (Rasch 1960). The model we describe in this chapter was introduced in the IRT literature by Frederickx et al. (2010) to investigate group bias in high stakes testing situations.

We analyzed each category’s categorization data separately using the extended model. This was done using WinBUGS (Lunn et al. 2000) following the procedures for the Bayesian estimation of the model outlined in Frederickx et al. (2010). These include the specification of the priors for the model parameters. For every analysis five chains were run of 10,000 iterations each, with a burn-in sample of 1,000. To determine whether compulsory education (group 1) and higher education (group 2) participants employ different criteria for categorization, we investigate whether there are items for which the posterior probability of indicator Di exceeds .5, indicating that the item is positioned differently in the two groups. To determine whether the two education groups differ regarding the degree they feel the target categories apply, we investigate the mean threshold difference between them. We deem a difference in degree reliable if the 95% credibility interval for \( \mu_{{\theta_{2} }} \) does not include 0.

3.5 Predictions

This is an exploratory study that is first and foremost intended to be an illustration of a modeling framework to characterize group differences in categorization as degree or criteria differences. The study was not designed with the intent of investigating the effect of education level on categorization. The choice for education level as the group variable was a matter of convenience, as we had this information available from a large scale study where this was not the variable of interest. We did not entertain any a priori hypotheses as to the extent to which we would observe criteria and degree differences resulting from education differences, as to our knowledge, the effect of education level on semantic categorization has not been investigated yet. We did deem education level a promising group variable for an illustration, though, as it seemed probable that individual differences in knowledge would affect how terms are applied.

4 Results

4.1 Vagueness in Criteria

We hardly found any evidence for vagueness in criteria. For fruits, vegetables, insects, sports, and tools there were no items for which the indicator Di exceeded .5. That is, in the higher education group the candidate items were positioned the same as in the compulsory education group. For fish and furniture one item was positioned differently. Higher educated participants considered sea horses more representative fish than compulsory education participants did (higher βi in the higher education group). They also considered shelves more representative furniture than compulsory education participants did.

For sciences the picture looked completely different. There were twelve items (50%) for which Di exceeded 0.5. Astrology, philosophy, palm reading, literature, and psychology were considered less representative of sciences in the higher education group than in the compulsory education group (lower βi in the higher education group). The items geography, geometry, meteorology, mineralogy, chemistry, dentistry, and nutrition were considered more representative by higher education participants than by compulsory education participants (higher βi in the higher education group).

4.2 Vagueness in Degree

For fruits and sports the mean threshold difference could not reliably be discerned from zero, indicating that there is no reliable difference in the number of items endorsed by the two groups. A reliable positive threshold difference was found for vegetables, insects, furniture, fish, and tools. Higher education participants used a higher threshold than the compulsory education participants did, resulting in the endorsement of fewer items as category members by the former group. For the category of sciences, we found the opposite pattern: the mean threshold difference was reliably negative, indicating that higher education participants tended to endorse more items as sciences than compulsory education participants did.

4.3 Criteria-Degree Interplay

The findings regarding group differences in degree should be interpreted in light of the findings regarding vagueness in criteria. To aid this interpretation we have included a figure that depicts categorization patterns for the three combinations of vagueness in criteria and degree we identified through the model analyses: (i) absence of vagueness in criteria and degree, (ii) vagueness in degree but not criteria, (iii) vagueness in both criteria and degree. The three combinations are exemplified by the results for fruits, insects, and sciences, respectively.

All three panels in Fig. 1 show for 24 candidate items the proportion of participants endorsing the items as a category member. Each panel contains two graphs, one for the compulsory education participants (black circles) and one for the higher education participants (gray squares). The items are organized along the horizontal axis in increasing order of endorsement according to the compulsory education group. The item on the far left is thus the one least endorsed by the compulsory education participants, while the item on the right is the one most endorsed by these participants.

Fig. 1
figure 1

Categorization proportions for fruits (left), insects (middle), and sciences (right) of the compulsory education group (black circles) and the higher education group (gray squares). Items are ordered along the horizontal axis according to their categorization rank in the compulsory education group

For fruits (left panel) the analyses yielded no reliable differences between the two education groups. The absence of vagueness in degree and criteria clearly shows in the categorization proportions as well: the black and gray curves almost completely overlap. This pattern of categorization is exemplary for sports as well.

For insects (middle panel) the analyses yielded a reliable threshold difference, but no differently positioned items. The absence of vagueness in criteria shows in that the categorization curves of the education groups take the same shape: the items are ordered in the same manner in both groups. The vagueness in degree shows in the displacement of the two curves: the gray squares are systematically lower than the black circles, indicating that the higher educated participants were more strict in categorizing items as category members. This pattern of categorization is exemplary for the majority of the studied categories (vegetables, furniture, fish, tools).

For sciences (right panel), we observe that the categorization proportions of the higher education group tend to be higher (instead of lower) with respect to those of the compulsory education group. This is an indication of the vagueness in degree the analyses established. The additional vagueness in criteria shows in that the two curves no longer have the same shape: The order of the categorization proportions in the two groups is not the same. It is not just the case that the curve of one group is shifted with respect to the other group, as we saw for insects. The nature of the group difference is different for individual items (those for which Di exceeded .5) with the compulsory – higher education categorization divide being greater for some than for others and in some instances even showing the reverse pattern. This was the case for items 2 (palm reading), 5 (literature), 11 (philosophy), and 19 (astrology). They were considered less representative category members by the higher education group, along with psychology (item 12) for which the categorization proportion difference is smaller than one would expect in light of the group threshold difference. For other items such as item 7 (nutrition), 10 (dentistry), and 16 (mineralogy) the categorization difference is larger than expected based on the threshold difference solely.

4.4 Discussion

We observed pronounced differences in semantic categorization between the group of compulsory education participants and the group of higher education participants. The model analyses qualified the categorization differences more often as degree differences (with higher education participants endorsing fewer items than compulsory education participants), than as criteria differences. From this observation one should conclude that education can give rise to both these types of vagueness, but not that degree differences are more prevalent than criteria differences. Which combination of degree and criteria differences emerges, appears to be dependent upon the stimulus materials, and the employed materials were not selected with the education difference in mind. The current study does offer a number of interesting hypotheses regarding the origin of the degree and criteria differences between the education groups, which can be tested in future research with materials tailored to these questions (see Conclusions section for details).

5 Conclusions

The purpose of this chapter was to introduce a procedure that allows group differences in categorization to be identified as criteria and/or degree differences. As such, the procedure yields a number of insights regarding the nature and the sources of vagueness. We established that for noun categories, both vagueness in criteria and vagueness in degree are likely in play at any given moment. From the early work by McCloskey and Glucksberg (1978) and the subsequent replication of their work by Verheyen et al. (2010) we already knew that individuals diverge in the degree to which they consider items members of noun categories. Verheyen and Storms (2013), on the other hand, identified latent groups of participants who use different criteria for categorizing items in categories such as fish, sports, and sciences. The novelty of the current procedure lies in its focus on vagueness in degree and vagueness in criteria simultaneously, and in its ability to relate them to external information about the participants, such as their education. In the current application it allowed us to show that higher education participants tend to apply common terms like vegetable, furniture, and tool more conservatively than compulsory education participants do (vagueness in degree) and that for terms they are arguably more familiar with, like sciences, they employ different conditions for application (vagueness in criteria).

The procedure opens up the possibility to investigate which other properties of the participants systematically affect vagueness. In other work along these lines, we have so far identified a number of factors with ties to degree and criteria differences. We found degree differences in the application of adjectives that one can also apply to oneself to be related to one’s own standing on the relevant dimension. Subjects’ height and weight requirements for applying tall and heavy, for instance, correlate positively with their personal measurements (Verheyen et al. 2018). Gender results in categorization patterns that are opposite to those we observed for education. Using a selection of categories that were expected to yield gender differences in categorization (e.g., clothing, professions, sports, toys) few degree differences were observed, while most categories yielded criteria differences (Stukken et al. 2013). Age gives rise to very intricate patterns of categorization. Both young children (<13 years old) and older adults (>62 years old) are found to overextend common noun categories compared to young adults (degree difference; Verheyen et al. 2011a; Verheyen et al. 2018). Older and young adults also differ in the criteria they use for categorization (White et al. 2018). For the categorization of storage containers, for instance, older adults rely more on “classic” materials such as glass or cardboard, whereas younger adults emphasize relatively “new” materials such as plastics. Using the proposed modeling framework, it is straightforward to extend this line of study to the comparison of different contexts, cultures, language groups, and even regional varieties, and/or to look into categorization differences at the level of individual items instead of entire categories, as we have done.

Ultimately, this research program will allow us to determine to what extent vagueness, as shown in inter-individual application differences, can be accounted for in terms of a limited number of external participant properties. As such, the program could be characterized as researching vagueness from a sociological or psychological perspective, depending on the nature of the property of interest that is investigated. The complete resolution of vagueness is not within reach of this program, however, as intra-individual application differences (Hampton and Passanisi 2016; McCloskey and Glucksberg 1978) cannot be explained in these terms. Our findings do raise the fundamental question of whether sociolinguistic variation of the kind uncovered in this study should be part of a theory of vagueness proper, or whether such a theory should only address the variation that is left after these external influences have been partialled outFootnote 4. A choice for the latter might lead to inter-individual application differences being struck as hallmarks of vagueness, as it is impossible to ever ascertain whether all relevant participant properties have been taken into account.

The current project falls short when it comes to explaining where the effects of education on categorization come from. The purpose of this project was to establish whether education gives rise to particular categorization differences. Establishing how education gives rise to these differences, is more of an endeavor for sociolinguists and differential psychologists and is out of the scope of this chapter. Below we will nevertheless offer some suggestions as to the origins of the education effects we established, with the primary purpose of indicating how they could be formally tested within the framework we have proposed.

One explanation for the observation that compulsory education participants have broader categories than higher education participants do, might be the latter group’s higher lexical familiarity (familiarity with printed words). Individuals that score high on lexical familiarity, have been found to display a higher rejection rate of non-category members that are semantically related to the category (Lewellen et al. 1993), reminiscent of the difference in degree we established. According to this explanation, educational level would be a proxy for verbal skill (see Kuperman and Van Dyke 2013, for support of this argument). An alternative explanation for the difference in degree could also be attempted in terms of personality characteristics that correlate with education. Education level correlates positively with both openness and conscientiousness (Denissen et al. 2008), but academic performance is more strongly related to conscientiousness than to openness (Rocklin 1994; see also Paunonen and Ashton 2001). The observation that higher educated participants are more conscientious than compulsory education participants are, could be used to explain the degree differences observed in the majority of categories. According to this reasoning, higher education participants would ultimately reject more semantic foils because their deliberations are more thorough and deliberate than those of the compulsory education participants are. However, based on the relationship between education and openness, one could have predicted the opposite pattern as well. Higher education participants would then include more items in their categories than compulsory education participants do because they are more imaginative and creative and can therefore more easily come up with conditions in which an item fulfills the requirements for category membership.

For the pronounced differences in criteria for the category of sciences, one can similarly come up with several explanations: (i) higher education participants differ in familiarity with the different disciplines on the basis of information they acquired directly in higher education courses or indirectly through contact with fellow students, (ii) higher education participants have been explicitly instructed about the nature of sciences, (iii) higher education participants have been exposed to the manner in which higher education institutes are organized. Evidence for the second explanation can be found in our data in that higher education participants make a stronger distinction between pseudosciences like astrology and palm reading and prototypical natural sciences like chemistry, meteorology, and mineralogy. Evidence for the third explanation can be found in the clustering of dentistry and nutrition with medicine among higher education participants. All three explanations offered for the vagueness in criteria with respect to the sciences category are reminiscent of categorization differences between experts and novices (e.g., Chi et al. 1981).

We did not entertain any a priori predictions as to how education might give rise to categorization differences. For demonstrative purposes we re-analyzed an existing data set, which was not gathered with the purpose of explaining education differences. If one were interested in the effects of lexical familiarity, conscientiousness, or openness on degree differences in categorization, it would be straightforward to collect this information from the participants and apply the procedure outlined in this chapter after dichotomizing these external variables (as is commonly done in experimental approaches). Alternatively, if one would like to honor the continuous nature of these variables, one could extend the model hierarchically to evaluate their relationship with the estimated thresholds (for a demonstration see Verheyen et al. 2011a). To interpret the criteria differences, one could regress the item positions onto the criteria under consideration and see which of these impact the item positions in the two groups differently. This is the approach taken by Verheyen and Storms (2013) and Verheyen et al. (2015). The regression could also be made part of the modeling by extending the model hierarchically, as in Verheyen et al. (2011b). Ideally, if one were to have specific hypotheses about potential criteria, one would compile the item set in a way that optimally allows one to disentangle the various criteria under consideration. Manipulations like these would make the proposed procedure an even more valuable experimental perspective on vagueness.