In France, physical disability is the most prevalent disability among university students. Many efforts have been made in the universities to include students with disabilities (Ebersold 2003). However, people with physical disabilities remain numerically underrepresented in the universities compared to their numbers in the general population (Ponticelli and Russ-Eft 2009; Quick et al. 2003; Rohmer and Louvet 2013). Possible consequences of this outnumbered status are visibility and role encapsulation (Kanter 1977) meaning that people with disabilities feel highly visible and are expected to be representative of all the other members of their category.

Individuals with physical disabilities feel stereotyped and a wide range of negative characteristics are attributed to them (Louvet and Rohmer 2010; Louvet et al. 2009). Typical attributes assigned to the individual with a disability are incompetence (Louvet and Rohmer 2010; Louvet et al. 2009), unproductivity or dependency (Popovich et al. 2003; Rohmer and Louvet 2006, 2009; Weinberg 1976), warmth (Fiske et al. 2002; Rohmer and Louvet 2011), and courage (Louvet and Rohmer 2010; Rohmer and Louvet 2011, 2013) independent of the nature of the disability. This lack of competence is not only assigned to people with intellectual disabilities: Individuals with a physical disability are also stereotyped as (intellectually) incompetent (Rohmer and Louvet 2011). This stereotype could help to explain why, in an educational setting, teachers can have less positive attitudes and, in accordance, lower expectations for students with disabilities, and therefore treat them differently than students without disabilities concretizing in this way their (negative) attitudes. For example, teachers’ attitudes toward students with disabilities may also be negative toward students with dyslexia (Hornstra et al. 2010) or toward students with special educational needs (Markova et al. 2016; Thaver and Lim 2014). In the same way, professional caregivers may have implicit negative attitudes toward children with T21 (Enéa-Drapeau et al. 2012).

Students with physical disabilities are outnumbered and stigmatized in higher education (Cunnah 2015). According to the distinctiveness theory (Brewer 2003; Brewer and Roccas 2001; Leonardelli et al. 2010), minority status forefronts individuals’ social identities. For example, Cota and Dion (1986) found that participants in a sex-minority condition reported their sex (that is, their minority social identity) more often than participants under same-sex conditions. In this study, participants were placed in three-person groups. The composition of the groups was based on several conditions: all men, all women, lone man (one man with two women), and lone woman (one woman with two men). Participants were then asked to answer to two open-ended probes: “Tell me about yourself” and “Tell me what you are not.” In the minority sex condition, in which gender was distinctive, participants mentioned more often their own gender in response to the affirmation self-concept probe than participants in the other conditions, in which gender was not distinctive.

Being numerically underrepresented in a group makes individuals who are negatively stereotyped vulnerable to stereotype threat (Beaton et al. 2007; Inzlicht and Ben-Zeev 2000; Lee and Nass 2012). In other words, social identity distinctiveness increases stereotype threat. According to Steele (1997), stereotype threat

is a situational threat—a threat in the air—that, in general form, can affect the members of any group about whom a negative stereotype exists (e.g., skateboarders, older adults, White men, gang members). Where bad stereotypes about these groups apply, members of these groups can fear being reduced to that stereotype (p. 614).

As a consequence, individuals perform worse when a relevant stereotype or stigmatized social identity is made salient in the performance situation (Schmader and Johns 2003; Schmader et al. 2008).

The seminal work of Steele and Aronson (1995) examined stereotype threat on a group whose abilities are negatively stereotyped in intellectual domains: African Americans. Black and white college students were asked to complete a difficult verbal test. In the stereotype threat condition, participants were informed that this test could reveal their intellectual aptitudes in the fields of reading and verbal reasoning. In the non-stereotype threat condition, participants were told that the test was a laboratory exercise unrelated to ability. The authors predicted and showed that the black participants performed worse than the white participants when the test was presented as a measure of their intellectual ability; however, when it was presented as non-diagnostic, the performances of black and white participants did not differ. According to Steele and Aronson, the observed decrease in the performances is due to motivational factors such as anxiety (Osborne 2001), fear of confirming the stereotype that is associated with the group membership (Ben-Zeev et al. 2005), and/or cognitive factors such as allocation of attention and resources to processes that are generally automatic (Beilock et al. 2006; Beilock et al. 2007; Schmader and Johns 2003).

Stereotype threat has been demonstrated for several groups and in various fields and domains. For example, this threat appears for girls and math abilities (Désert and Leyens 2002; Shih et al. 1999; Spencer et al. 1999), students from low socioeconomic backgrounds and intellectual ability (Croizet and Claire 1998), French-Arab students and verbal intellectual performance (Chateignier et al. 2009), men and affective information processing (Leyens et al. 2000), German preservice teachers and cognitive ability (Ihme and Moller 2015), and French children attending adapted courses and memory task performance (Berjot et al. 2015). In this last study, after having checked that a negative reputation is associated with students attending adapted classes from elementary schools, Berjot and her colleagues tested the effects of stereotype threat on the performance of students from regular or adapted classes (from 2nd and 4th grades). The results showed that students from adapted classes performed worse than students from regular classes in the threatening condition. In the non-threatening condition, no differences in the performance emerged. Students attending adapted courses are generally oriented on the basis of their learning difficulties: This study showed that “the problem is not so much students’ abilities level, but rather the way the task is presented” (Berjot et al. 2015, p. 153). Although stereotype threat is generally viewed as a situational threat, numerous individual factors moderate its effects such as group identification (Aronson et al. 1999; Bergeron et al. 2006; Kiefer and Sekaquaptewa 2007), domain identification (Aronson et al. 1999, study 2; Leyens et al. 2000), stigma consciousness (Brown and Pinel 2003; Sekaquaptewa and Thomson, 2003), or stereotype internalization (Bonnot and Croizet 2007; Schmader et al. 2004, study 2).

Some authors argued that there is a misrepresentation of the strength and robustness of the stereotype threat effect (Stoet and Geary 2012). Nevertheless, many reviews and meta-analysis confirmed the robust mean stereotype threat but failed to demonstrate the moderating effect of different variables (Doyle and Voyer 2016; Flore and Wicherts 2015; Lamont et al. 2015; Nguyen and Ryan 2008; Picho et al. 2013; Régner et al. 2015). For example, none of the variables (difficulty, presence of boys, type of control group, and cross-cultural gender equality) explored by Flore and Wicherts (2015) in their meta-analysis significantly moderated the effect of stereotype threat for schoolgirls (under 18 years) on math, science, and spatial skills (MSSS) tests. In the same way, Picho and her colleagues (2013) failed to show a significant moderating effect of stereotype priming (either explicit or implicit), group composition (single sex, mixed sex), and regional localization of study on mathematics performance. This lack of moderation may be due to the diversity of the means used to manipulate stereotype threat and situational factors (Nguyen and Ryan 2008; Picho et al. 2013; Shapiro and Neuberg 2007). According to Shapiro and Neuberg and the multi-threat framework, distinct stereotype threats may arise from the intersection of two dimensions: the target of the threat (the self or one’s group) and the source of the threat (the self, outgroup others, or ingroup others). Shapiro and Neuberg (2007) argued that these threats may be elicited by different conditions and moderators and are mediated by different processes (see also Pennington et al. 2016). Thus, distinct forms of stereotype threats operate on particular individuals in particular contexts.

Contextual factors or other situational elements are therefore of high importance for the emergence of a stereotype threat effect (Logel et al. 2009; Picho et al. 2013). Indeed, stereotype threat depends on both the specific circumstances and the social interaction dynamics. For example, the minority/majority sex conditions have implications on the cognitive performance (Psaltis 2012; Psaltis and Duveen 2006): under certain conditions, girls perform worse in opposite-sex dyads than in same-sex dyads. This impact of the context of interaction on the girls is probably linked with status differences (Psaltis and Duveen 2006). Several studies (Huguet and Régner 2009; Inzlicht and Ben-Zeev 2000; Logel et al. 2009; Lord and Saenz 1985; Sekaquaptewa and Thompson 2002, 2003) examined how cues in the social environment trigger stereotype threat, for example, men’s sexism during the interaction (Logel et al. 2009), coaction contexts (Lee and Nass 2012), or numerical representation (Beaton et al. 2007; Huguet and Régner 2009; Inzlicht and Ben-Zeev 2000; Lord and Saenz 1985; Sekaquaptewa and Thompson 2002, 2003). Beaton and collaborators (2007, study 1) found that the sex composition of a team can affect the mathematical performance of women. They showed that women in a solo condition performed worse than women in a non-solo condition. In the same way, Inzlicht and Ben-Zeev (2000) invited participants to complete a difficult math test in triads comprising two individuals of the same sex or two individuals of the opposite sex. Women who were outnumbered by men showed performance decrements in the math test compared to women who were in a gender-homogeneous environment. Men performed equally well in both conditions. The performance drop for women is linked to a stereotype about their group’s poor math abilities and poor math competence.

Stereotype threat and associated effects on performance were therefore shown with different groups. The present study is an attempt to extend these results to students with disabilities and to deepen the understanding of the stereotype threat phenomenon, in particular the impact of the context on performances. This is a societal question as, in most academic or professional evaluations, it is quite implicit that the evaluator does not suffer from a physical disability. The poor performance of students or job candidates with disabilities could reflect a stereotype effect. The objective of this study is to investigate the effect of the composition of a group of evaluators on students’ performances. We hypothesized that students with physical disabilities would be threatened when they believe that their evaluators do not have physical disabilities. Thus, we predicted that students with physical disabilities in outgroup condition (i.e., thinking that they would be evaluated by students without disabilities) would underperform compared to students with physical disabilities in the ingroup condition (i.e., when they think they will be evaluated by students with disabilities). We expected that students without disabilities would be unaffected by the kind of evaluators (ingroup or outgroup conditions).

Method

Participants

The study comprised 48 French university students aged from 19 to 35 years (M = 23.00, SD = 3.59) who volunteered to participate without compensation. All participants studied humanities and social sciences except psychology (40 undergraduate students, seven graduate students, and one PhD student). Twenty-four had a physical disability (11 women, 13 men, M age = 24.33, SD = 3.33) and 24 did not (13 women, 11 men, M age = 22.25, SD = 3.54). All signed a consent form and all were included in the sample. The disabilities were of the lower extremities without intellectual disability (for example, people in a wheelchair after an accident, one student with spina bifida without cognitive consequence, or those who had had a prenatal infection). Thus, a pencil and paper task was not problematic, and potential participants were reassured about that.

Material and procedure

The participants were tested individually in the lab. We informed them that they would fill out a questionnaire and then complete a test customarily used in universities. This test was presented as a “measure of logic skills and general school performance.” The participants were chronologically informed that “while you will complete these tasks, you will be filmed,” “the video recording as well as your answers will be evaluated by five students,” “when your results have been evaluated, within a week, you will have the possibility to meet the evaluators who will present their evaluations and the results of the test.” Behind the camera, the experimenter said “good, one sees you from head to toe.”

The outgroup or ingroup conditions were then randomly introduced using the identity of the imaginary panel of future evaluators. This panel was presented as coming from various student associations. Participants with disabilities in the outgroup condition were told that they would be evaluated by five members of the “Association of Students in Psychology” (i.e., students assumed to be without disabilities). Those in the ingroup condition were told that they would be evaluated by five members of the “Association of Students with Disabilities” (i.e., students with disabilities). For participants without disabilities, the associations establishing ingroup and outgroup conditions were inverted. They then filled out a 4-item questionnaire on their personal characteristics (size, eye color, hair, and potential specificity especially disabilities), being told that the panel would read their answers before grading the test. The evaluators would therefore presumptively be informed of the disability via the 4-item questionnaire and the videotape.

Participants first filled out the Twenty Statements Test (Kuhn and McPartland 1954), a measure of self-image, in which participants are asked for 20 answers to the simple open-ended question “Who am I?” The participants had 3 min to write their answers, which were used here to rate the potential effects of evaluators’ membership on social categorization. More precisely, this measure was a check of the fact that social identity of “students with physical disabilities” was more salient in the outgroup condition than in the ingroup condition.

As a second task, the participants completed the D-series of the Progressive Matrices (Raven et al. 1998/2003). The Progressive Matrices are a series of multiple-choice items of abstract reasoning (Brouwers et al. 2009). Each item depicts an abstract pattern in a three by three matrices. All cells contain a figure except for the cell in the lower right corner. Participants are asked to identify the missing segment that would best complete the pattern. Correct and incorrect answers were respectively coded 1 and 0. The performance was the sum of correct answers (max score = 12). After completing this test, participants were debriefed about the real purpose of the study and thanked for their participation.

This experiment took the form of a 2 × 2 factorial design. The factors were status of participants, students with disabilities or not, and condition of evaluation, that is, the test was presented as either being evaluated by ingroup members or by outgroup members. The primary dependent variable was the performance in a written “intellectual ability” task. Spontaneous mentions of identity were also recorded.

Results

Preliminary analysis

First, a t test showed the absence of gender differences on performance (M men = 9.79, SD = 2.13, M women = 10.08, SD = 2.00), t(46) = 0.49, p = .63.

Grade and time spent to complete the matrices were regressed on performance. The grade generally tended to influence the performance of the participants, β = −.27, p = .06. We thus compared the grade of the participants (after a numerical coding from 1 (L1) to 6 (PhD student)) between conditions. We conducted an analysis of variance (ANOVA) with grade as the dependent variable and status and condition as the independent variables. The analysis revealed no main effects for status, F(1,44) = 1.45, p = .23, or condition, F(1,44) = 0.16, p = .69. However, a significant status × condition interaction occurred, F(1,44) = 5.19, p = .03, η 2 p  = .11. A Fisher’s LSD post hoc test indicated that in the outgroup condition, students with disabilities (M = 3.25, SD = 1.29) were more often graduates than students without disabilities (M = 2.17, SD = 0.94), t (44) = 2.41, p = .02. There was no difference between students with (M = 2.42, SD = 1.00) and without disabilities (M = 2.75, SD = 1.06) in the ingroup condition, t(44) = .94, p = .45. Grade was thus used as a covariate in all test performance analysis.

Time spent to complete the matrices was not a significant predictor of test performance, β = .20, p = .17.

Manipulation check

The participants gave an average of 8.50 answers (SD = 2.65) to the question: “Who am I?” Two judges (who were unaware of the conditions) rated the answers of the participants. These judges were instructed to code each participant for the presence or absence of three categories in the answers: identity in terms of “disabled people group” membership (only for students with physical disabilities), other social identities (gender, student, etc.), and self-identity (size, hair color, etc.). The inter-rater reliability was .98. All disagreements were resolved by the first rater.

Unexpectedly, students with physical disabilities did not significantly mention their disability more often in the outgroup condition (75%) than in the ingroup condition (41.67%, phi-coefficient, ɸ = .34), χ 2 (1, N = 24) = 2.74, p < .10. Although this difference could be considered as marginal, it is not sufficient to be sure that our manipulation was successful. This point will be further addressed in the “Discussion” section.Footnote 1

Participants’ performance

The performance scores were analyzed by performing a 2 (status) × 2 (condition) analysis of covariance (ANCOVA) on the number of correct answers, with status and condition as between-participants factors and self-reported grade as a covariate. The analysis showed no significant effect of the covariate, F(1,43) = 0.17, p = .68.

Our main prediction was that students with physical disabilities would experience stereotype threat that undermines their performance only in situations in which evaluators are outgroup members (i.e., students without disabilities). This prediction is strongly supported by the data. Even if the covariate has no significant effect, the adjusted means were used in the analysis and are presented in Fig. 1.

Fig. 1
figure 1

Mean number of correct answers adjusted by grade (max = 12) to the D-series for the participants with or without physical disabilities in the outgroup or ingroup condition (error bars represent ± 1 standard deviation) * Significant difference between Ingroup and Outgroup conditions Note. The analysis showed no significant effect of the covariate, F(1, 43) = 0.17, p = .68

The analysis revealed indeed a significant status × conditions interaction, F(1,43) = 15.89, p < .001, η 2 p  = .27. In order to test our specific hypotheses, two planned contrasts were conducted between the ingroup and outgroup conditions for students with disabilities and for students without disabilities. Consistent with the hypothesis, students with disabilities in the outgroup condition (M = 7.04, SD = 1.65) correctly completed fewer matrices than students with disabilities in the ingroup condition (M = 10.40, SD = 1.00), F(1,43) = 52.53, p < .0001, η 2 p  = .55. Students without disabilities were unaffected by the condition and performed similarly in the ingroup condition (M = 11.51, SD = 0.67) and in the outgroup condition (M = 10.80, SD = 0.72), F(1,43) = 2.39, p = .13. It should also be noted that students with disabilities performed to a lower level than students without disabilities in both the ingroup condition, F(1,43) = 6.06, p = .02, and the outgroup condition, F(1,43) = 62.54, p < .01.

The ANCOVA also revealed a significant condition main effect, F(1,43) = 41.39, p < .001, η 2 p  = .49, with participants in outgroup condition (M = 8.92, SD = 2.32) performing worse than participants in ingroup condition (M = 10.96, SD = 1.00) and a significant status main effect, F(1,43) = 57.62, p = .0001, η 2 p  = .57 with students with disabilities (M = 8.71, SD = 2.20) performing worse than students without disabilities (M = 11.17, SD = 0.76).

Discussion

We predicted that placing students with physical disabilities in an environment in which they anticipate to be being confronted with non-disabled evaluators is sufficient to create a threatening intellectual environment that causes decrements in their intellectual performances. The results strongly support this hypothesis and are consistent with the theoretical frameworks that guided the work (Shih et al. 1999; Steele and Aronson 1995). The main effect for condition (outgroup or ingroup) could lead into thinking that the mere—anticipated—presence or contact with outgroup members is in itself sufficient to decrease cognitive performances. In the same way, the main effect for status (with or without physical disabilities) could lead to seeing people’s physical disabilities as an intrinsic factor of decrement in their cognitive performances. However, these effects are qualified by the interaction between both the experimental manipulation and participants’ status. This interaction showed not only that students with disabilities perform less than students without disabilities in both contexts but it also showed that the context, as expected, only influenced the performance of students with disabilities. Indeed, students with disabilities in the ingroup context performed less than those in the outgroup context, a difference that did not appear for students without disabilities. However, it is worth noting that the higher scores of the students without disabilities may reflect a ceiling effect that could have masked an effect of the context. Future studies may help to solve this issue even though it is not a core aspect of the present research question.

Regardless of these promising results, it should be acknowledged that the experimental manipulation used here had only a marginal effect on the salience of students’ “disabled” identity, implying that results should be read with caution. Nevertheless, the fact that the experimental manipulation also leads to identity threat gives us confidence toward the reliability of the procedure. The marginal result obtained here may more translate a problem in the measure itself (i.e., the “Who am I?” test). As a result and as previously used in the literature, future research may rather use an implicit measure instead of an explicit measure (e.g., lexical decision test, for an example see Thomas and Dubois 2011).

This study shows thus that, as for women or other social groups, stereotype threat on students with physical disabilities can be activated by the mere anticipation of an evaluation by students without disabilities. Students with physical disabilities may be highly concerned about the possibility of confirming negative stereotypes about their group assumed to be held by the students without disabilities. This concern could effectively decrease their cognitive performance (see Beilock et al. 2006; Carver et al. 1979; Duval et al. 1992). In the outgroup condition, the participants with disabilities could be afraid that their performance confirmed in the minds of the evaluators that the negative stereotypes were true and therefore that these evaluators could judge them badly. In sum, the context—in particular the real or anticipated presence of outgroup members—can trigger stereotype threat. According to Shapiro and Neuberg (2007), this threat corresponds to a “threat from outgroup members.” These results were obtained from students with physical disabilities. It can be expected that other disabilities such as mental or learning disabilities may be affected by this phenomenon. Furthermore, one can hypothesize that different disabilities (e.g., deafness or blindness) could lead to different effects of a stereotype threat on the individuals’ performances because the stereotypes attached to them differ, their visibility differs, and the real or supposed weakness or impairment of the abilities (e.g., attentional, intellectual, emotional, social, and/or motor) also differs.

Furthermore, students with disabilities obtained lower scores than students without physical disabilities even in the absence of a time constraint for completing the matrices. However, the use of a time constraint—as it is often the case for standardized tests or scholastic tasks—could probably reinforce this discrepancy between the groups because it could constitute an additional pressure for the students with disabilities: doing something well and doing it fast. That second counter-stereotypic element could increase the associative strengths with the category of “disability” (see Dijksterhuis et al. 2000). The effect of time constraint, in itself (versus its absence) on people with disabilities should be tested and could possibly lead to adjustments in the way they are generally evaluated (indeed, for example, French students with disabilities benefit from an added third of time to take their exams).

In several studies (e.g., Wout et al. 2008; Zhang et al. 2013), the authors observed different decrements in performances as a function of the type of threat that is activated. For example, Zhang et al. (2013)’s studies are devoted to two types of threats. In a first pilot study investigating the women’s threat concerns in math, they showed that women reported a greater concern about how gender stereotypes threatened their self-reputation rather than their group-reputation. In a second study, the performance was public in all conditions (the source of the stereotype threat is therefore others in all conditions) but the target of the threat—self or group—was manipulated. In an own name condition, participants were asked to write the date, their real name, and their gender on the cover sheet of a math test booklet. These instructions allowed linkage of the self to the performance. In an identity-masked condition, they were instructed, namely for coding purposes and due to confidentiality concerns, to indicate a fictitious name on each page of the math test. The fictitious name was assigned and could either be male or female. In this condition, participants were thus primed with the idea that the self would not be publicly linked to their performance. The results showed that women in the identity-masked condition had a significantly higher math performance and reported less self-threat and distraction than those in the own name condition. Men were unaffected by this manipulation.

Future studies could thus explore (1) the way a threat could differently affect students with disabilities depending on both the source and the target of this threat and (2) the effect of the types of threats on their performances. These analyses could enable, we think, promising specific programs aiming at preventing stereotype threat effects in students with disabilities in higher education. As Shapiro and Neuberg (2007) pointed out, the threats are elicited by different circumstances, are mediated by different mechanisms, and are moderated by different personal and situational factors. They therefore require qualitatively different interventions. For example, preventing the development of personal negative attitudes and the internalization of the stereotypes will protect against self-as-source stereotype threats (Shapiro and Williams 2012). On the other hand, reducing stigma consciousness will protect against other-as-source stereotype threats. The identification of sources and targets for disabled students in educational setting is important to prevent the effects of stereotype threat and to promote the social inclusion of these students. Programs for inclusive education could be addressed to students with disabilities but not uniquely for them. Other persons, potential sources of threats like students without disabilities, teachers, etc. could also be concerned.

The results of the present study also have important implications for both the classrooms and the workplaces. The proportion of people with a physical disability is low in the general population. The drop in performance in a threatening situation may confirm stereotypes and thus justify their position in our society. It could for example reinforce the idea that individuals with a physical impairment are less competent and less productive than others (Louvet and Rohmer 2010). This study pushes forward the understanding of psychosocial problems related to disability and in particular the academic visible lack of success of individuals with disabilities (see Ponticelli and Russ-Eft 2009; Quick et al. 2003): they are still less represented than others in prestigious education domains and are generally numerically underrepresented in the universities (few achieve a doctoral thesis or even a master’s degree).

We suggest that future studies test the hypothesis that the mere presence of students with physical disabilities and students without is sufficient in itself to decrease the performance of those with disabilities. The right to education, promoted by the United Nations’ Convention on the Rights of Disabled Persons, was accepted by most OECD countries. In France, the law n° 2005-102 (2005) that promotes general education for students with disabilities provides the foundations for inclusive schooling. As in other European countries, the proportion of students with disabilities in the academic system continues to gradually increase. This can contribute to gradually reduce the effects of stereotype threat. The recruitment of teachers with disabilities could also, at least partly, mitigate the effects of stereotype threat.