Introduction

Choice of major is an important career decision. First, it affects one’s odds of finding employment, since career outlook varies greatly by a disciplinary area. According to the Occupational Outlook Handbook of the Bureau of Labor Statistics (2015), employment of petroleum engineers is projected to grow 26 percent; while employment of reporters, correspondents, and broadcast news analysts is projected to decline 13 percent from 2012 to 2022. Secondly, it influences future earnings to a greater extent than the type of college one chooses to attend—more or less selective (James et al. 1989; Ma and Savas 2014). The 2012 median pay of petroleum engineers with Bachelor’s degree—$130,280 a year—is about 3.5 times the median pay of reporters, correspondents, and broadcast news analysts with bachelor’s degree—$37,090 a year (Bureau of Labor Statistics 2015). Additionally, choice of major is consequential for the required level of training and education. A successful career in science might require more years of study than a successful career in engineering. Lastly, choice of discipline affects the type of work one is likely to engage in.

Choice of discipline by females is puzzling as they seem to frequently overlook such important factors as future employment prospects or wages by discipline. While females outperform males in college enrollment, persistence, and degree attainment (Snyder and Dillow 2012; cited in DiPrete and Buchmann 2013), they tend to choose less lucrative fields with less favorable career outlooks. Based on National Science Foundation (2014), in 2010 female representation was the highest in social science occupations (58.1 %); followed by biological, agricultural, and environmental life science (48.2 %); physical science (30.0 %); computer and mathematical science (25.1 %); and engineering (12.7 %). Thus, gender segregation is most significant in engineering, where participation of females is roughly half of what it is in computer and mathematical science. Underrepresentation of women in certain disciplines translates into their lower earnings and poses questions about choices they make when they enter college.

Multiple empirical studies analyzed the determinants of college major choices. With respect to differences in choice of major of females and males, researchers (e.g., Davies and Guppy 1997; Song and Glick 2004) suggest that females pick majors with lower earning potentials. Ma (2009) indicates that females place greater emphasis on “intrinsic, altruistic, and social job rewards,” while males place greater emphasis on “extrinsic rewards,” such as money and prestige. Some researchers (e.g., Paglin and Rufolo 1990; Oakes 1990; cited in Ma and Savas 2014) explain female underrepresentation in certain fields of study by differences in abilities or academic preparation in math and science. However, others (e.g., Adelman 2003; cited in Ma and Savas 2014) found that gender differences in high school academic preparation are declining. According to Jacobs (1986), Lackland (2001), and Solnick (1995; cited in Porter and Umbach 2006), women tend to choose disciplines like education or nursing because of their gender role orientation.

Several researchers explored the association between socio-economic status (SES) and choice of major. However, evidence from these studies is not conclusive. According to Davies and Guppy (1997), SES does not affect choice of fields with higher economic returns. Contrary to this finding, Goyette and Mullen (2006) suggest that low SES students are more likely to choose vocational majors, while high-SES students choose arts and sciences. Similarly, Ma (2009) found that family SES has a significant influence on choice of major. “[L]ower SES families tend to choose technical, life/health science, and business majors—those higher paying fields upon graduation—over humanities and social science/education majors” (ibid, p. 277). Ma (2009) also suggests that the effect of family SES on college major choice might be asymmetric for men and women. “Men from a high SES family may still be expected to choose a lucrative career, whereas women from comparable backgrounds may not” (ibid, p. 215). For example, males from wealthy families become business majors more often than females from wealthy families (Green 1992; cited by Ma 2009).

Eide and Waehrer (1998) suggest that choices of students who intend to complete graduate studies might be different from choices of students who intend to enter the workforce after completing a bachelor’s degree. Furthermore, the effect of the highest intended degree might differ by gender. For women the intent to complete a graduate degree is correlated with greater odds of choosing liberal arts and science majors. For men the intent to complete a graduate degree is associated with greater chances of choice of liberal arts and science majors as well as other majors except computer science/engineering.

Several studies focused on the association between self-rated abilities and self-efficacy on choice of major. Betz and Hackett (1983) suggest that mathematics self-efficacy is significantly related to the extent to which students selected science-based college majors. At that, males tend to have stronger mathematics self-efficacy than females. Smart et al. (2000) provided support for the association between student self-rated abilities and academic fields they choose. “[S]tudents who intended to major in a particular academic environment generally perceived themselves as having decidedly stronger abilities and interests commensurate with those that their chosen environment tended to reinforce and reward” (ibid, p. 116). Interestingly, the association between a choice of discipline and self-ratings of one’s abilities and interests is similar for males and females across all disciplines with the exception of social disciplines. The magnitude of differences in self-rated abilities and interests of students who chose social and non-social disciplines was greater among males than among females (ibid), thus indicating that factors other than self-rates abilities and interests might lead females to choose social disciplines.

While previous studies contributed greatly to our understanding of major choices, most of these studies did not analyze the choices of males and females separately to account for possible gender differences in effects. For example, based on a previous study (Betz and Hackett 1983), mathematics self-efficacy is significantly related to the choice of a college major and is stronger for males. But does the effect of mathematics self-efficacy on the choice of a college major differ for males and females?

Our study is aimed at exploring the effects of different characteristics—i.e., socio-economic status, prior academic achievement, high school curriculum, extra-curricular involvement, and self-efficacy for academic achievement—on choice of discipline by females and males. Apart from substantive contribution, our study seeks to overcome the independence of irrelevant alternatives (IIA) assumption inherent in the multinomial logit model used in many existing studies of major choice. Since the Hausman–McFadden (HM) test rejects the IIA assumption in our study, we followed the multinomial logit analysis with a nested model and a mixed model. We also compared the consistency of average expected probabilities based on multinomial logit and mixed logit models.

Theoretical Framework

Holland’s Theory of Vocational Choices

Our classification of academic disciplines is based on the Holland’s (1966, 1973, 1987, 1997) theory of vocational choices—artistic, social, enterprising, conventional, realistic, and investigative. According to Holland, congruency between personality and career leads to greater occupational satisfaction and success. Hence, individuals have a tendency of choosing an occupation that fits their skills, abilities, attitudes and values. Thus, realistic occupations attract individuals who enjoy concrete and practical activities. Investigative professions are chosen by those who are fond of analytical and intellectual work. Artistic fields are for those who possess imagination, unconventional ideas, and aesthetic values. People in social occupations tend to be tactful, understanding, empathetic, responsible, and helpful. Enterprising jobs reward confidence, social skills, and ambition. Individuals in conventional occupations are good at following plans and paying attention to detail.

Nauta (2010) provides an overview of empirical studies that validate the existence of personality and occupational types suggested by Holland. While Holland (1997; cited in Nauta 2010) recognized that having six categories is a simplification of reality, he still abided by his model, since having a greater complexity would be less practical. For instance, in career counseling having fewer categories “help provide clients with families of occupational titles” (Nauta 2010, p. 17) and a greater number of possibilities to explore as opposed to having a single occupation to discuss.

Holland (1973, 1997; cited in Nauta 2010) suggests that occupational and personality types—artistic, social, enterprising, conventional, realistic, and investigative—are broadly applicable, yet acknowledged that gender impacts career development. Empirical studies support this idea. For example, Fouad (2002) indicates that males scored higher on realistic measures, while females scored higher on artistic and social measures. According to Betz and Gwilliam (2002), males report higher confidence on realistic, investigative, enterprising, and conventional inventories, while females report higher confidence on social inventory. Research also suggests that Holland’s model fits data across gender (see Nauta 2010).

Porter and Umbach (2006) used Holland’s theory to study the effects of political orientations and student personalities on choice of an academic major. They found that political orientations and student personalities have a significant effect on choice of a discipline, while effects of test scores disappear after control for student personalities. Porter and Umbach (2006) conclude that “Holland categories provide an excellent framework for the study of student major choice” (p. 445).

Smart et al. (2000) used Holland’s classification in their study of characteristics of students entering academic fields. However, they excluded two out of Holland’s six categories—conventional and realistic—because their focus was on artistic, social, enterprising, and investigative fields. Furthermore, very few college students and faculty fit into these categories (ibid, cited in Jones 2011). Following Smart et al. (2000), we also excluded the conventional category from the analysis as the data set for the study does not include college majors that fit into this category. (Future accounting majors enter a study university as pre-business majors.) However, we did include the realistic category, since our dataset includes a significant number of students in engineering and architecture who represent the realistic class (see, for example, Holland and Lutz 1967). See Table 1 for lists of disciplines and shares of female and male students in each of the five categories included in the analysis.

Table 1 Variable descriptions and descriptive statistics

Gottfredson (1981; cited in Trusty et al. 2000) suggests an alignment between socio-economic status and Holland’s occupational types. “I[nvestigative] occupations are the most prestigious. E[nterprising], A[rtistic], and S[ocial] occupations have roughly average levels of prestige, and R[ealistic] and C[onventional] occupations have the lowest level of prestige” (ibid, p. 464). Hence we expect to find an association between first generation status and a measure of financial stress, on the one hand, and choice of academic discipline, on the other hand.

The influences of demographics and academic performance on career choices have been accepted by career theorists (Trusty et al. 2000). With respect to ethnicity, some researchers (e.g., Swanson 1992; cited in Ryan et al. 1996) suggest that Holland’s structure fits white students better. Yet Ryan et al. (1996) indicate that this lack of fit might be associated to socio-economic status rather than ethnicity. While we include ethnicity in our study, the vast majority of students in our sample are white and the number of students from other ethnic groups is rather small. Hence, our study does not analyze each ethnic group separately and compares white and non-white students.

According to Schneider and Overton (1983), educational achievement, measured by GPA and SAT scores, can be linked to Holland’s personality types. The association between achievement and personality types also varies for males and females (ibid). Hence we include high school GPA and test scores as predictors in our study.

Skills and interests change as a result of learning experiences (Krumboltz 1996; cited in Trusty et al. 2000). Spending more time on certain subjects will result in improved achievement and might lead to change in interests (ibid; see also Görlitz and Gravert 2015). Therefore, we hypothesize that years of English/Literature, Math, Science, History/Social Sciences, and Foreign Language in high school affect choice of a discipline in college.

“The way students may validate congruency between themselves and their college major is through their involvement in high school and college extracurricular activities” (Patrick et al. 1993, p. 28). Participation in extracurricular activities—performing or visual arts, athletic teams, student government, publications, academic honor societies, academic clubs, vocational clubs, and religious youth groups—helps students acquire self-understanding and confidence for effective career decisions (ibid). We, therefore, hypothesize that high school extracurricular involvement affects a choice of a college major.

Prior research supports relations between interests, skills, and abilities within the same Holland type, but these associations tend to be rather small (Ackerman and Heggestad 1997; Randahl 1991; Swanson 1993, cited in Nauta 2010). At the same time associations between Holland type and self-efficacy tend to be more substantial (Betz et al. 1996; cited in Nauta 2010). Hence we proceed to discuss self-efficacy and social cognitive theory.

Social Cognitive Theory

According to social cognitive theory, self-efficacy is a person’s belief in his or her ability to succeed in a particular situation (Bandura 1977, 1982, 1986, 1995, 1997, 2006). This belief in one’s ability to succeed affects one’s behavior. Efficacy varies by domain or “distinct realms of functioning” (Bandura 2006). Choice of academic major is expected to correlate with a belief in one’s ability to perform different academic tasks—such as effective writing and speaking, critical thinking, analysis of quantitative problems, using computing and information technology, working independently, or working in team environment.

Huss et al. (2002) suggest that students’ self-rated preparedness can be viewed from the perspective of self-efficacy or beliefs about one’s ability to succeed at a given task. Bubany and Hansen (2010) compare self-efficacy with ability self-estimate scores and suggest that empirical differences may be due to measurement error or scale content, rather than due to meaningful reasons. Given these prior findings, we use self-rated preparation, abilities, and self-efficacy interchangeably. We expect that, controlling for other characteristics, a student’s self-rated preparedness to analyze math and quantitative problems will have a positive effect on her odds of choosing an investigative or realistic field. A student’s self-rated preparedness to write or speak clearly and effectively and to think critically and analytically is presumed to have a positive effect on her odds of choosing an artistic field. Self-rated preparedness to use computing or information technology might be associated with greater odds of choosing a realistic field. Students who are certain in their ability to work effectively with others might be more comfortable with social or enterprising fields, while those who feel prepared to learn effectively on their own might be inclined to choose investigative fields.

Given the varying expected returns by academic field and by degree level (Arcidiacono 2004; Berger 1988; Eide and Waehrer 1998), we also included expected earnings and intended degree in our study.

The dataset is based on student records and responses to the Beginning College Survey of Student Engagement. Variable descriptions as well as descriptive statistics of our study are presented in Table 1.

Data and Method

The institution studied is a Research University (high research activity) with about 4000 first-time freshmen enrolled each fall. Five fall cohorts of first-time freshmen (starting from fall of 2008) are included in this study. Each year, about 95 % of first-time freshmen at a study institution complete the Beginning College Survey of Student Engagement (BCSSE). Only those students who completed BCSSE are included. The data contain information on choices of one of five disciplinary categories—artistic, social, enterprising, realistic, and investigative—by 9918 females and 8939 males (see Table 1).

Consistent with Davies and Guppy (1997), Song and Glick (2004) and Ma (2009), females at a study institution are more likely to self-select into artistic, social, and investigative fields; while males tend to self-select into more lucrative careers—business, engineering, and architecture. One should note here that careers in fields selected by females might require more years of study; which corresponds to a greater mean time to intended degree for females. Females have, on average, higher high school grade point averages and ACT verbal scores, but lower ACT quantitative scores. The percentage of those who have passing grades in Calculus is also lower for females, compared to males. With respect to involvement in extra-curricular activities, females report greater involvement in performing or visual arts, student government, publications, academic honor societies, academic and vocational clubs, religious youth groups, and community service and volunteer work. Males report greater involvement in athletic teams. Females self-report greater preparation to write, speak clearly and effectively, work effectively with others, and learn effectively on their own. Males, on the other hand, feel more prepared to analyze math and quantitative problems, use computing and information technology, and think critically and analytically.

Note that one variable included in the analysis—financial stress—is a latent variable that was measured using the principal component analysis. Following Cole (2012), we included reliance on student loans and self to pay college expenses, number of hours a student plans to spend in a typical week working on- or off-campus, and perceived difficulty of paying college expenses as indicators of financial stress. Table 2 contains means and standard deviations of observed variables as well as factor loadings and factor scores for the model. The percentage of variance explained is 51.20 %. Cronbach’s Alpha for this set of observed variables is 0.61. The factor loadings range from 0.68 to 0.74. Thus, the greater the index of financial stress, the more difficulty a student expects to encounter paying college expenses.

Table 2 Principal component analysis: financial stress

Selection into majors depends on the monetary returns (e.g., Arcidiacono 2004). An alternative specific variable included in our study—natural logarithm of salary—is based on the Salary Survey of National Association of Colleges and Employers (2014). The salary for enterprising disciplines is based on the average salary for business ($54,234). Most of artistic and social disciplines are assigned salaries for humanities and social sciences ($37,058). Exceptions are: communications with the average salary of $43,145; education with the average salary of $40,480; and health with the average salary of $49,713. Aside from computer science with the average salary of $59,977, realistic fields were assigned a salary of engineering ($62,535). Investigative disciplines were assigned a salary of math and sciences ($42,724).

The outcome variable—a discipline chosen by a first-time freshman—is categorical and calls for a multinomial model. The most commonly used model for categorical outcomes is multinomial logit model (MNLM), according to which the probability of observing an outcome i for an individual n is (Train 2009, p. 37):

$$P_{ni} = \frac{{{ \exp }\left( {x_{ni} \beta } \right)}}{{\mathop \sum \nolimits_{j = 1}^{J} { \exp }\left( {x_{nj} \beta } \right)}}$$
(1)

where the dependent variable has J categories numbered from 1 to J, x is a vector of independent variables and β is a vector of regression coefficients.

The multinomial logit model assumes the independence of irrelevant alternatives (IIA). This assumption implies that adding or removing categories of the outcome variable does not change the relative risks of the remaining categories. Let’s consider a scenario in which we could separate accounting majors and add a conventional discipline to our model. If IIA holds, student odds of choosing an enterprising discipline over artistic, investigative, realistic, or social discipline should not depend on adding a conventional discipline to our model. In practice, however, the odds of choosing an enterprising discipline would likely go down, since some of the pre-business majors can be expected to choose accounting if this option was available to them.

To test for the IIA assumption, we use the Hausman–McFadden (HM) test, which involves the following steps (Long and Freeze 2014; pp. 408–409): (1) estimate the full model with all J alternatives included; (2) estimate a model with one or more alternatives excluded; and (3) conduct HM test that compares parameter estimates of the full set with those of the subset. If the parameters of the full set and parameters of the subset are inconsistent, the IIA assumption is rejected.

Because the IIA assumption does not hold for our models, we estimated nested logit and mixed logit models. Given the number of parameters we estimate, the multinomial probit model (MNP) produced a difficult computational problem relative to nested or mixed logit and did not reach the point of saturation. A similar observation was made by Dow and Endersby (2004) who stated that “[t]he MNP presents a difficult maximum likelihood optimization problem that sometimes fails to converge at a global optimum or produces parameter estimates that are sufficiently imprecise as to make statistical inferences suspect” (p. 109). Thus, we will limit our discussion to three models—multinomial logit, nested logit, and mixed logit.

By allowing correlation between some choices, the nested logit model partially relaxes the IIA assumption. While the IIA holds within nests, it does not hold for alternatives in different nests (Train 2009). For example, if we combine realistic and investigative disciplines into one nest, the ratio of probability of choosing realistic over investigative disciplines will remain the same regardless of removing artistic, social, or enterprising disciplines. At the same time, the ratio of probabilities of choosing realistic over social disciplines—i.e., disciplines that do not belong to one nest—might change if one of the remaining alternatives is removed. The nested logit probability is the product of two standard logit probabilities—the probability of choosing nest B k and the probability of choosing an alternative i within the nest B k (ibid, p. 82):

$$P_{ni} = P_{{ni|B_{k} }} P_{{nB_{k} }}$$
(2)

where \(P_{{ni|B_{k} }}\) is the conditional probability of choosing the alternative i given an alternative in nest \(B_{k}\) is chosen and \(P_{{nB_{k} }}\) is the probability of choosing an alternative in nest \(B_{k}\).

“Mixed logit probabilities are the integrals of standard logit probabilities times the density of parameters” (ibid, p. 135):

$$P_{ni} = \mathop \int \nolimits \,\,\left( {\frac{{{ \exp }\left( {x_{ni} \beta } \right)}}{{\mathop \sum \nolimits_{j = 1}^{J} { \exp }\left( {x_{nj} \beta } \right)}}} \right)f\left( \beta \right)d\beta$$
(3)

where the dependent variable has J categories numbered from 1 to J; x is a vector of independent variables; β is a vector of regression coefficients; and \(f\left( \beta \right)\) is a density that provides the weights. Thus, “[t]he mixed logit probability is a weighted average of the logit formula evaluated at different values of \(\beta ,\) with the weights given by the density \(f\left( \beta \right).\)” (ibid, p. 135). The standard logit model is a special case of mixed logit model where \(f\left( \beta \right) = 1\) if \(\beta\) equals fixed parameter b and 0 if \(\beta \ne b\).

Given our research focus, we would ideally like to accompany our comparison of regression coefficients across gender with statistical tests for the significance of differences. In linear regression this could be done either by estimating two separate models for males and females and using Wald Chi square statistic to test the difference between coefficients or by estimating a single model that interacts gender with other independent variables in the model. Unfortunately, use of these methods in logit and probit models has been questioned in several studies. Using an example of logit regressions predicting the probability of promotion to associate professor for males and females, Allison (1999; cited in Hoetker 2004; Long 2009; Williams 2009; Mood 2010) illustrates that the difference in the two coefficients for article counts is an artifact of differences in the degree of residual variation for men and women. Allison (1999) concludes that unless we are willing to assume that the residual variation is constant across groups, both standard tests—Wald Chi square test for coefficients in two separate models or a single model that interacts group variable with other variables—tell us nothing about actual differences in the effects of our variables. With respect to single equation approach, Hoetker (2004) states: “Econometric theory and simulation results suggest that tests interacting coefficients with a dummy variable for group membership in a single equation are particularly misleading. Forcing observations from both groups to have the same residual variation yields coefficients that tell us nothing about how a covariate’s impact varies across groups” (p. 17). The problem is “very similar to the well-known problems with comparing standardized ordinary least square (OLS) coefficients across groups” (Williams 2009, p. 534). But while the solution for comparing coefficients in OLS—i.e., use of unstandardized coefficients as opposed to standardized coefficients—is clear, approaches suggested for addressing residual variation in logit and probit models have limitations.

Allison (1999) proposes a test that accounts for residual variation, but adds an assumption of identical regression coefficients for certain variables across groups. This latter assumption is difficult to justify. Williams (2009) suggests that in some cases “Allison’s procedure could make things worse rather than better” (p. 547). Other proposed solutions to the problem include use of predicted probabilities (Long 2009), use of ratios of coefficients (as opposed to coefficients themselves) or abandoning direct comparisons and analyzing the pattern of coefficient significance between the two models (Hoetker 2004).Footnote 1 Use of ratio of coefficients is straightforward and appealing, but it requires strong theoretical justification (ibid). Use of expected probabilities, as suggested by Long (2009), is another attractive option, but comparison of expected probabilities is complicated by nonlinear and non-additive associations between independent and dependent variables in logit models. The effects of the same magnitude would translate into different effects on the probabilities depending on the level of the initial probabilities.Footnote 2 Therefore, supplementing the coefficient estimates with changes in expected probabilities is informative, but comparisons in effects between groups with different initial probabilities should be done with caution. To summarize, at the time of writing “[t]here are no simple all-purpose solutions to the problems of interpretability and comparison of effect estimates from logistic regression” (Mood 2010, p. 80).

Hoetker (2004) indicates that one way to compare effects for two groups is to explore the direction and statistical significance of the coefficients. “If we model the two groups separately, the coefficients and standard errors are consistent within each group. The pattern of coefficient significance between the two models may provide some information.” (ibid, p. 16). While this information is not sufficient to be able to compare the magnitude of differences in effects, this solution is better than spurious results. We combine this approach with exploring changes in probabilities when we manipulate one independent variable at a time. Because probabilities are unaffected by residual variation (Long 2009), we can use them to better understand effects for females and males and compare findings from multinomial logit and mixed logit models.

Models were estimated using the R mlogit package (Croissant 2015; Train and Croissant 2015).

Model Selection

Our analysis was based on three models—multinomial logit, nested logit, and mixed logit. Following estimation of multinomial logit full models, we built models that excluded investigative and enterprising categories to test for IIA assumption. The IIA hypotheses were rejected for both models (i.e., the model for female students and the model for male students). Comparison of models for female students yields χ2 (31) = 5147.89, significant at the 0.01 alpha level. Comparison of models for male students yields χ2 (31) = 316.26, significant at the 0.01 alpha level. These results indicate the need to explore models that relax IIA assumption. Two approaches that relax IIA assumption—nested logit and mixed logit—were explored.

Our nested logit models (not presented here, but available from authors upon request) were based on two nests—a nest for investigative and realistic fields and a nest for social, artistic and enterprising fields. Based on the likelihood ratio test, the nested logit model for female students fits data better than the multinomial logit model for female students. The likelihood ratio test is \(\chi^{2} \left( 1 \right) = 63.00\), significant at the 0.01 % alpha level. At the same time, the nested logit for male students does not fit data better than the multinomial logit model. The likelihood ratio test \(\chi^{2} \left( 1 \right) = 0.19\), not significant at the 5 % alpha level. The concerning aspect for both nested models is that the log-sum coefficients are greater than 1—1.74 and 1.03 for females and males respectively. This means there is more substitution across nests than within nests. We concur with Train and Croissant (2015) that having a greater substitution across nests than within nests is not reasonable. With two nests—a nest for investigative and realistic fields and a nest for social, artistic and enterprising fields—we would expect that, if we excluded an investigative field from the choice set, the probability of choosing a realistic field would rise proportionately more than the probability of social, artistic, or enterprising fields. Similarly, if we excluded an artistic field from the choice set, the probability of choosing social and enterprising fields would increase proportionately more than the probability of realistic or investigative fields (Train 2009). Yet, based on the log-sum coefficients, the opposite is true. We also attempted other nest combinations, which led to improved model fit as evidenced by likelihood ratio tests. Yet none of our models led to the solution with greater substitution patterns within nests than between nests. Thus we failed to find nested models with reasonable substitution patterns for our choice of an academic discipline problem.

The mixed logit model in Table 3 extends the standard conditional logit model by allowing intercepts to be randomly distributed. Based on the likelihood ratio test, the mixed models fit data better than the multinomial logit model. For the models for male students the likelihood ratio test \(\chi^{2} \left( 4 \right) = 467.31\), significant at the 0.01 % alpha level. For the models for female students the likelihood ratio test \(\chi^{2} \left( 4 \right) = 156.4\), significant at the 0.01 % alpha level. One can observe that some of the random intercepts in our mixed logit model are negative, which intuitively does not seem to be right. However, as indicated by Hole (2007), the sign of estimated random effects in mixed logit models is irrelevant. Although in practice random effects in mixed logit might be negative, one can interpret them as being positive. Another observation is related to the fact that the random effects for only one category of the dependent variable—realistic—are statistically significant both for males and females.

Table 3 Mixed logit models of choice of academic majors with social disciplines as a reference group

In our calculations of expected probabilities, we followed Train and Croissant’s (2015) illustration of calculation of expected probabilities in R’s mlogit package. First we estimated the models with the actual values of all independent variables. The results of these models—referred to as fitted probabilities—are presented in Table 4 for female students and Table 5 for male students. After estimating models with actual values of independent variables, we change one variable at a time and calculate probabilities with the new values of this variable. Changes in expected probabilities between the fitted model and new models provide information on the magnitude of the effect of each independent variable. We present expected probabilities for multinomial logit models and mixed logit to gauge the consistency across these models when IIA assumption is violated. (The coefficients, standard errors, and indices of model fit for multinomial logit models are not presented here, but available from authors upon request).

Table 4 Effects of independent variables on choices of females: multinomial logit versus mixed logit models of choice of academic majors
Table 5 Effects of independent variables on choices of males: multinomial logit versus mixed logit models of choice of academic majors

Consistent across models, the fitted or average expected probabilities for females (see models in Table 4) are 0.33 for social disciplines, 0.13 for artistic disciplines, 0.10 for enterprising disciplines, 0.32 for investigative disciplines, and 0.11 for realistic disciplines. The fitted probabilities for males (see models in Table 5) are 0.08 for social disciplines, 0.06 for artistic disciplines, 0.19 for enterprising disciplines, 0.22 for investigative disciplines, and 0.45 for realistic disciplines. Fitted probabilities also match the observed shares of students who chose each of these disciplines, see Table 1. Next, we will discuss our findings and consistency of effects based on multinomial logit and mixed logit models.

Findings

Based on the models in Table 3, salary expectation has a significant positive effect on choice of a discipline by females and males. To translate these effects into expected probabilities, we estimated the effects of $1000 increase in expected earnings on the probabilities of choosing each of the disciplines. The effects of expected salary increases for females and males are presented in Tables 4 and 5. For example, the increase in expected salary for social disciplines leads to the 0.29 and 0.13 increases in expected probabilities of choosing a social discipline by females and males, see mixed logit models in Tables 4 and 5. The effects of salary expectations are generally greater based on mixed logit compared to multinomial logit models. Overall, substantial effects of salary expectations on choice of a discipline by females challenge our initial speculation that females overlook wages in their career choice.

Models in Table 3 indicate statistically significant associations between ethnicity and choice of realistic disciplines over social disciplines for both males and females. The expected probabilities in Tables 4 and 5 indicate that Caucasian students are less likely to choose a realistic discipline. The effect of ethnicity is consistent across models and holds for both males and females. As illustrated in Fig. 1, non-Caucasian female students are 1.70 (0.17/0.10) times as likely to choose a realistic discipline as Caucasian female students. Compared to Caucasian males, non-Caucasian male students have 1.16 (0.51/0.44) times the likelihood of choosing a realistic field. Due to a small number of racial and ethnic minority students at a study institution, we did not analyze different racial and ethnic groups separately. Further studies are needed to describe these differences in choices by Caucasian and non-Caucasian students.

Fig. 1
figure 1

Expected probabilities of choosing academic disciplines by Caucasian and non-Caucasian students. Note Expected probabilities in Fig. 1 are based on multinomial logit models

Two variables—financial stress and first generation status—were used to measure the family socio-economic status. Based models in Table 3, the effect of first generation status on choice of realistic disciplines over social disciplines by males is significant at the 1 % alpha level. Consistently across multinomial logit and mixed logit models, first generation status is associated with greater likelihood of choosing realistic disciplines by males. The expected probabilities by first generation status are also provided in Fig. 2. For males the expected probability of choosing a realistic discipline is 0.45 if their parents completed a four-year degree and 0.48 if their parents did not complete a four-year degree. Yet the expected probability of choosing a realistic discipline is the same for first-generation and non-first-generation females.

Fig. 2
figure 2

Expected probabilities of choosing academic disciplines by first generation students and students whose parent completed a Bachelor’s degree. Note Expected probabilities in Fig. 2 are based on multinomial logit models

Financial stress is associated with greater odds of choosing investigative and realistic disciplines over social disciplines by males and females and greater odds of choosing social disciplines over enterprising disciplines by females. In terms of predicted probabilities (see mixed logit models in Tables 4, 5), with one standard deviation increase in financial stress, the likelihood of choosing an investigative field goes up from 0.32 to 0.34 for females and from 0.22 to 0.23 for males, while the likelihood of choosing a realistic field goes up from 0.11 to 0.12 for females and from 0.45 to 0.47 for males. The effect of financial stress on the odds of choosing investigative and realistic fields is consistent with several prior studies (Goyette and Mullen 2006; Ma 2009): students who expect greater financial stress are more likely to choose technical and life sciences fields or “safe bet” majors with more favorable career outlooks and greater salary expectations. Unlike Davies and Guppy (1997), we did not use expected earnings as the dependent variable. The latter approach might be more appropriate for a study of the association between choice of a lucrative career and socio-economic status.

Consistent with Eide and Waehrer (1998), student choice of major is strongly associated with the degree they intend to complete. In our study degree expectations are measured on a scale from 1-Associate’s to 4-Doctoral. And, a one-point increase in degree expectations leads to an increase in the probability of choosing an investigative field from 0.22 to 0.42 for males and from 0.32 to 0.52 for females, see results for mixed logit models in Tables 4 and 5. Changes in expected probabilities of choosing investigative fields with one-point increase in degree expectations based on multinomial logit models are somewhat greater, but consistent with changes based on mixed logit. The higher the level of degree expected, the greater the odds of picking investigative disciplines by females and males.

Models in Table 3 indicate a statistically significant association between ACT quantitative (ACTQ) and having a passing grade in calculus, on the one hand, and choice of a discipline, on the other hand. Consistently across multinomial logit and mixed logit models, passing grades in calculus are positively associated with the likelihood of choosing realistic and investigative fields by females and realistic fields by males (see expected probabilities in Tables 4, 5). Females who pass calculus have 0.04 greater likelihood of selecting a realistic discipline, see Fig. 3. Passing calculus in high school by a female also leads to a 0.02 increase in the likelihood of going into an investigative field. Males who pass calculus have 0.09 greater likelihood of choosing realistic fields, see Fig. 3.

Fig. 3
figure 3

Expected probabilities of choosing academic disciplines by students who did and did not earn a passing grade in Calculus in high school. Note Expected probabilities in Fig. 3 are based on multinomial logit models

Greater ACTQ scores are also associated with greater probabilities of choosing realistic disciplines by males and females. This effect holds for both multinomial logit and mixed logit models, see Table 4. According to Fig. 4, females with ACTQ 30 are 1.40 (0.14/0.10) times as likely to choose realistic fields as females with ACTQ 25. Males with ACTQ 30 are 1.19 (0.50/0.42) times as likely to choose realistic fields as males with ACTQ 25. Interestingly, for females the probability of choosing investigative fields does not change as the ACTQ goes up. At the same time, the probability of choosing an investigative field is 0.04 lower for males with ACTQ 30 than for males with ACTQ 25, see Fig. 4.

Fig. 4
figure 4

Expected probabilities of choosing academic disciplines by ACT Scores. Note Expected probabilities in Fig. 4 are based on multinomial logit models

Effects of ACT verbal (ACTV) on choice of enterprising disciplines over social disciplines are negative and statistically significant for both females and males (see Table 3). Consistently across models, these effects translate into 0.02 decrease in the probabilities of selecting enterprising fields as ACTV increases from 25 to 30 (see Fig. 4 and Tables 4, 5 to compare consistency across models).

Based on models in Table 3, high school GPA is positively associated with the choices of investigative and realistic fields over social fields and negatively associated with the choices of enterprising and artistic fields over social fields. Some of these effects are not statistically significant for females, however. A point increase in high school GPA is associated with 0.04 and 0.05 increases in expected probabilities of choosing investigative fields by females and males (see expected probabilities for mixed logit models in Tables 4, 5). Probabilities of choosing realistic fields by females and males increase by 0.01 and 0.03. Changes in expected probabilities are consistent across multinomial logit and mixed logit models.

Years of mathematics in high school are positively correlated with choosing realistic fields, see Table 3. Adding one year of mathematics in high school translates into increases in the probabilities of choosing realistic fields from 0.11 to 0.14 for females and from 0.45 to 0.49 for males. Not surprisingly, years of science are associated with greater odds of choosing investigative and realistic fields. With one additional year of science in high school, the expected probability of choosing investigative fields changes from 0.32 to 0.43 for females and from 0.22 to 0.28 for males, while the expected probability of choosing realistic field changes from 0.11 to 0.12 for females and from 0.45 to 0.48 for males. The expected probabilities of choosing social, artistic, and enterprising fields go down as the number of years of science in high school goes up. Based on models in Table 3, taking an additional year of history or social science in high school is associated with a higher likelihood of choosing social fields over enterprising, investigative, and realistic disciplines by males and a higher likelihood of choosing social fields over investigative fields for females. An additional year of history or social science translates into 0.02 increase in the probability of choosing a social field by females and males (see mixed logit models in Tables 4, 5). Students who took more years of foreign language are less likely to enter investigative and realistic fields (see Table 3). Each additional year of foreign language translates into 0.02 and 0.01 decreases in probabilities of selecting investigative disciplines and in 0.01 and 0.03 decreases in probabilities of selecting realistic disciplines by females and males. These findings suggest that high school curriculum is an important factor in choice of a discipline both by males and females. Next, we will explore the effects of involvement in different extra-curricular activities on choice of an academic major.

On a scale from 1-not involved to 6-highly involved, students were asked to evaluate their involvement in extra-curricular activities during high school years. Based on models in Table 3, involvement in performing or visual arts is associated with greater chances of selecting artistic and realistic fields by females and males. This translates into 0.01 increases in expected probabilities with a point increase of involvement in performing or visual arts (see Tables 4, 5). The model in Table 3 indicates that females who report greater involvement in athletic teams are more likely to choose a social discipline over an artistic discipline, but this effect does not translate into a visible change in expected probability of choosing artistic fields with a point change in female involvement in athletic teams (see Table 4). For males a point increase in involvement in athletic teams is associated with 0.02 greater likelihood of choosing enterprising fields (see Table 5). A point increase in involvement in student government is associated with 0.01 increases in the probabilities of selecting enterprising fields by females and males. A point increase in involvement in publications does not lead to visible changes in expected probabilities for females (see mixed logit results in Table 4), but is associated with 0.01 increase in expected probabilities of choosing artistic and enterprising fields by males (see mixed logit results in Table 5). One-point increase in involvement in both academic honor societies and academic clubs leads to a 0.01 increase in expected probabilities of selecting realistic fields by males and females. A point increase in involvement in vocational clubs leads to 0.01 increases in probabilities of choosing enterprising disciplines by females and males. Involvement in religious youth groups and community service or volunteer work is positively associated with greater odds of picking social disciplines by female students. This finding does not hold for male students. It is remarkable that females also report greater involvement in religious youth groups, community service, and volunteer work (see Table 1). While prior involvements in extra-curricular activities have statistically significant effects on choice of discipline by females and males, these effects are rather small in terms of changes in expected probabilities. Some types of involvement—i.e., religious youth groups or community service—have visible effects on choices of females but not on choices of males.

Student self-ratings of their preparation to perform different academic tasks are measured on a scale from 1-not at all prepared to 6-very prepared. Models in Table 3 indicate that student self-ratings of their preparation to write clearly and effectively have significant positive effects on expected probabilities of choosing artistic fields over social fields for females and males and a negative effect on the expected probability of selecting a realistic field by males. A point increase in self-ratings of preparation to write is associated with the 0.01 increase in expected probability of choosing artistic disciplines by females (see Table 4) and 0.01 decrease in expected probability of choosing realistic disciplines by males (see Table 5). Students who are confident in their ability to speak clearly and effectively prefer social and enterprising disciplines over investigative and realistic disciplines. A point increase in self-rating of preparation to think critically and analytically is associated with slight increases in probabilities of choosing social, artistic and realistic fields for females and increase in the probability of choosing social disciplines by males. The effect of self-rating of preparation to analyze math and quantitative problems is stronger than the effects of self-ratings of ability to write, speak or think critically. For instance, a one-point increase is associated with 0.03 and 0.08 increases in the expected probability of selecting realistic fields by females and males. Confidence in one’s ability to use computing or information technology is associated with greater odds of choosing realistic over social fields (see models in Table 3). Finally, one’s greater confidence in ability to work effectively with others and learn effectively on one’s own is associated with lower odds of choosing a realistic discipline (see Table 3). To summarize, controlling for high school performance, test scores, high school curriculum and prior involvement in extra-curricular activities, confidence in one’s abilities and preparation has a significant effect on choice of major. Self-ratings of preparation to analyze math and use computing are most consequential for the choice of major. In most instances, the direction and significance of effects of self-ratings on selection of disciplines are consistent for males and females. At the same time, females tend to rate themselves lower in areas that have a strong positive effect on the choice of enterprising, realistic and investigative fields—e.g., ability to analyze math and quantitative problems—and higher in areas that have a positive effect on the choice of artistic fields—writing—and social fields—speaking.

Limitations

Our study has several important limitations. First, the analysis is based on data from a single moderately large research institution. Major choices and determinants of these choices might vary across institutions; and findings presented here might not apply to other institutions and institution types.

Our comparisons across different models—multinomial logit, nested logit, and mixed logit—and across models for different groups—males and females—is limited to stating significant positive and negative coefficients and analysis of changes in expected probabilities. Ideally, we would compare regression coefficients across genders and accompany these comparisons with statistical tests for the significance of differences. However, while this could be done in linear regression, similar comparisons for logit and probit models have been found misleading. While we took a rather conservative approach, this approach is superior to reporting spurious results.

Next, we did not go beyond random intercepts in our mixed model. Exploring the hypotheses of random effects of each of our independent variables might lead to an improved model fit.

Finally, we included a discipline-specific salary expectation variable—natural logarithm of expected salary—in our models. Based on this approach, the magnitude of the effect of salary expectations is very substantial and warrants additional exploration of different representations of salary expectations on choice of majors.

Conclusions and Implications

Our analysis of college major choices of entering first-time freshmen suggests that, compared to males, female students are more likely to self-select into social (Education, Psychology, Social Sciences, Health Sciences, and Nursing), artistic (Arts, Communications, English, Foreign Language, Music, and Theatre) and investigative (Agriculture, Biosystems Engineering, Sciences, and Mathematics) disciplines. Males are more likely to choose realistic (Architecture, Building Science, Engineering, and Computer Science) and enterprising (Business, Industrial and Systems Engineering, and Economics) disciplines. With one exception of a higher proportion of females in investigative disciplines, our findings align with prior studies (Fouad 2002; Betz and Gwilliam 2002).

Consistent with several prior studies (e.g., Goyette and Mullen 2006; Ma 2009), lower socio-economic status is associated with a slightly greater probability of choice of more lucrative careers and careers with more favorable outlooks. With respect to gender differences, we find that first-generation status leads to a greater likelihood of choosing realistic fields for males, but not for females. Financial stress has a greater effect on selecting investigative disciplines than realistic disciplines by females. The opposite is true for males. Overall, the association between lower socio-economic status and choice of sciences and engineering is a “glimpse of hope that students and their families take advantage of college major choice to ameliorate the constraining effects of low family SES” (Ma 2009, p. 227). However, further studies are needed to review low SES student persistence in these majors and their actual job placements, since better-off families might be able to help their children locate a more prosperous career.

“[M]any students choose to major in fields never intending to terminate their education with an undergraduate degree, but rather they intend to enroll in professional or academic graduate programs” (Eide and Waehrer 1998, p. 73). Consistent with findings of Eide and Waehrer (1998), highest degree intentions are strongly associated with the choice of investigative fields over other fields. However, in contrast to findings of Eide and Waehrer (1998), the effects of the highest degree intensions are not associated with greater expected probabilities of choosing liberal arts (social or artistic) disciplines. The expected probability of choosing an artistic or social discipline by females goes down as the degree expectation goes up. For males the expected probability of choosing an artistic or social discipline remains about the same as the highest degree expectation goes up.

Not surprisingly, student high school academic performance and curriculum are strongly associated with their choice of major. Consistent with Paglin and Rufolo (1990), quantitative ability is one of the most important factors in choice of major. For example, higher ACT quantitative scores are associated with greater likelihood of selecting realistic fields by both males and females. Interestingly, female students who had passing grades in calculus are more likely to enter either investigative or realistic fields, while males who passed calculus are more likely to enter realistic fields. One should also note here that fewer females report a passing grade in calculus; while males have higher ACT quantitative scores.

Turner and Bowen (1999) suggested that differences in choice of major between men and women reflect the effects of pre-collegiate preparation. Our study aligns with their observation. Students with more years of science are more likely to choose investigative and realistic fields. Years of history and social sciences are positively associated with the choice of social fields. Students with more years of mathematics are more likely to choose realistic fields. Years of foreign language are associated with greater probability of choosing an enterprising field. While the effects of years of mathematics and foreign language are consistent across gender, females report fewer years of mathematics and more years of foreign language completed in high school.

Involvement in religious youth groups and community service has a significant positive association with the choice of a social field over all other fields for female students. No such association was found for male students. Furthermore, our descriptive analysis reveals that females report greater involvement in these activities. Females’ selflessness appears to be a significant factor in their choice of social fields, which aligns with Ma’s (2009) observation of females’ intrinsic and altruistic motivation and males’ extrinsic motivation. Overall, the effect of involvement in extra-curricular activities on choice of major is statistically significant yet not always substantial in terms of expected probabilities.

Controlling for other characteristics, student confidence in their preparation to perform different academic tasks—writing, speaking, critical thinking, analyzing math, using computing, working effectively with others and on one’s own—has a significant association with choice of major. The magnitude of these effects is the greatest for the ability to analyze math and use computing thus supporting prior findings (Betz and Hackett 1983). Our study also supports the claim that the mathematics self-efficacy is stronger for males than for females (ibid). Overall, consistent with Smart et al. (2000), our study provides empirical evidence of association between student self-rated abilities and chosen academic fields. Because of weaker ratings of one’s ability to analyze quantitative problems among female students, college counselors need to take into account that encouragement and reassurance might be more important to females than to males.

Given the alignment between prior involvement in extra-curricular activities, self-rated abilities, and a major choice, we concur with Smart et al. (2000) suggestion of an alternative curricular that—unlike the typical curricular pattern of general-education courses in the first year of studies—allows students to take courses in subjects that are more congruent with their interests and abilities. Since high school experiences and expectations of females are different from those of males, further studies are needed to better understand the reasons behind less interest in mathematics and greater interest in volunteering and religious groups among females. Meanwhile, it is not clear whether counselors should encourage female participation in engineering or business careers as these careers might not always fit their interests and aspirations. Further studies are needed to understand the association between interests and aspirations, on the one hand, and retention in major or satisfaction with the chosen career path, on the other hand.

While our study sheds light on some factors that affect choices of academic disciplines by males and females, it left many questions unanswered. “[T]he career development of women … is demonstrably more complex due to a socialization process that has emphasized the dichotomy of work and family” (Fitzgerald and Weitzman 1992, p. 125, cited by Patton and McMahon 1999, p. 91). Further studies are needed to explore the gender role socialization, occupational stereotypes, or gender-biased counseling (Betz 1994) on choice of an academic discipline by female students. For example, is women’s choice of teaching or nursing related to their desire to marry upward and secure status as suggested by Psathas (1968)?

From the methodological point of view, our study explores the effect of violation of independence of irrelevant alternatives (IIA) assumption on the estimates of multinomial logit model and draws attention to the possible problem—a greater substitution across nests than within nests—that can be encountered while using a nested model. Despite the violation of IIA assumption, the findings from multinomial logit and mixed logit are generally consistent. Our study also demonstrates the importance of exploring changes in expected probabilities in addition to statistically significant effects. In some instances, manipulating independent variables that have statistically significant effects do not lead to visible changes in expected probabilities.