Introduction

Peers, loosely defined as fellow students, friends, neighbors, and roommates, have long been viewed as having an important influence on one’s behavior and learning. The socially optimal method for distributing educational opportunity, as well as the benefits of education policies such as affirmative action, school vouchers, ability tracking, and school choice, hinge to a large extent on the existence, size, and direction of peer effects (Goethals et al. 1999; Hoxby 2002; Winston and Zimmerman 2003; Zimmerman 2003; Griffith and Rask 2014).

Yet the empirical testing of peer effects has proved to be a rather daunting challenge. Manski (1993) proposed that there are at least three competing hypotheses that can explain observed peer effects: “(a) endogenous effects wherein the propensity of an individual to behave in some ways varies with the behavior of the group; (b) exogenous (contextual) effects wherein the propensity of an individual to behave in some way varies with the exogenous characteristics of the group and (c) correlated effects wherein individuals in the same group tend to behave similarly because they have similar individual characteristics or face a similar institutional environment.” The effects of these three hypotheses are captured in the following equation.

$$Y = \alpha + \beta E\left( {Y|X} \right) + E\left( {Z|X} \right)\gamma + Z\eta + \mu ,\quad E\left( {\mu |X,Z} \right) = X\delta$$
(1)

where Y is the outcome variable, (Z, µ) are the student’s own attributes that directly affect Y (e.g., socioeconomic status and ability), and X are attributes characterizing the peer group (e.g., a youth’s school or ethnic group). Therefore, coefficient γ indicates the exogenous effects when individual behavior varies with the pre-treatment group characteristics. Coefficient β corresponds to the endogenous effects occur when individual behavior varies with the during-treatment behavior of the group. Finally, correlated effects (coefficient δ) are those driven by the self-selection of individuals into groups.

Since the first proposal of Goethals et al. (1999), studies using (conditional) random assignment of students into housing units to evaluate peer effects in higher education have mushroomed.Footnote 1 Because the assignments are by and large random, the correlated effects (potential self-selection into rooms) are eliminated (δ = 0) and Eq. (1) will be reduced to Eq. (2). However, the endogenous effects are still present. As Sacerdote (2011) summarizes, “most papers have one source of exogeneity and do not separately identify the exogenous and endogenous peer effects.”

$$Y = \alpha + \beta E\left( {Y|X} \right) + E\left( {Z|X} \right)\gamma + Z\eta + \mu ,$$
(2)

These studies have so far provided mixed evidence of academic peer effects. Few find consistent evidence that roommates’ admission test scores or previous academic ability is linearly related to their own GPA (Sacerdote 2001; McEvan and Soderberg 2006; Carrell et al. 2009; Griffith and Rask 2014). A number of authors find that peer effects, when they do exist, are most pronounced among certain subpopulation groups. For example, Zimmerman (2003) and Winston and Zimmerman (2003) find that students in the middle of the SAT distribution are more susceptible to the influence of poor quality peers. Other studies have found that male or female students are more likely to be influenced by their peers (Duncan et al. 2005; Zimmerman 2003; Han and Li 2009). Another interesting strand examined the peer effects on students’ major or job selection (Sacerdote 2011). Marmaros and Sacerdote (2005) found that students not only are influenced by their roommates’ career choices but also lean on their peers in the job search process. De Giorgi et al. (2007) used repeated randomization of students to classes at Bocconi University in Italy and found that peers have a significant impact on students’ major choices.

In contrast, studies on the effects of roommates on social behaviors (especially those that are considered to be risky behaviors such as binge drinking, marijuana use, unprotected sex, and academic cheating) generally show consistent and strong results (Marmaros and Sacerdote 2006; Duncan et al. 2005; Stinebrickner and Stinebrickner 2008; Carrell et al. 2008; Tsai 2012). Concurrence is also found in other social issues such as sympathy toward the social groups to which their roommates belong and participation in civic activities in which their roommates were active (Duncan et al. 2006; Klofstad 2005).

In Chinese universities, students from the same department/faculty are not only randomly assigned to rooms but also live in much closer proximity to each other, and they live with the same roommates for 4 years with few exceptions.Footnote 2 The close proximity and anticipation of long-term relationships should provide them strong incentives to interact. On the other hand, qualitative researchers have provided some counterarguments. Cramped living space may force students to conduct most of their studying in library and lecture halls as opposed to dorms (Wang 2007; Xiao et al. 2015; Zheng 2015). The overcrowding condition of dorms and diverse background and habits of students also breed opportunities for disagreement and personal clashes. The increasing number of single children in the recent cohorts of college students in China may further exacerbate the interpersonal relationship at the dorms (Huang 2007; Chen 2013; Zhao and Su 2015). Last but not the least, competition among them for academic scholarships, which are awarded largely by based on their academic performance, serves as a countervailing factor for the peer effects to take place. Students may not want to share their insights on learning even with their roommates because it may reduce their comparative advantage. This effect should be more prominent in elite universities. Han and Li (2009) were the first to take advantage of the context of an elite Chinese university in Eastern China to examine the residential peer effects among roommates. They found weak roommate effects for females on academic performance and quite robust effects on social outcome such as students’ party affiliation. Zhang et al. (2011) used five cohorts of students at the School of Economics and Management of Tsinghua University to examine the peer effects. They found positive but insignificant effects of roommates but strong and significant effects of classmates. Nonetheless, this was based on a rather small sample of one school in Tsinghua University. Lu (2014) used variation in the English proficiency of students’ peers caused by the changes in the admission policy of a university in Jiangsu Province in China that brought a large number of specially admitted low-score students into many academic departments. She found that these low-score students significantly reduced the performance of the regular students in a difference-in-difference setup. However, the allocation of low-score students into different departments was unlikely to be exogenous.

The paucity of evidence regarding peer effect of higher education in the Chinese setting is in stark contrast with the policy intention by universities to use room assignment to promote national cohesion and exposure. In addition, China now boasts the world’s largest higher education system. Leveraging the peer effects of roommates can be a useful way to improve human capital investment. In this paper, I take advantage of the random assignment of students to rooms at an elite Chinese university where peer quality is better measured compared to Han and Li (2009) and Zhang et al. (2011). My findings suggest that students are likely to influence each other in course and major selection, but they do not influence each other in academic performance. Specifically, students are likely to be influenced by a roommate’s decision when it comes to the number of course credits taken and, to a lesser extent, the choice of major. Living with roommates that on average take one SD more elective credits is found to boost students’ elective course credits by one-fourth of a SD. However, students who are randomly assigned to live with roommates with higher college admission test scores and English proficiency do not outperform other students in terms of overall GPA and grades for English courses.

The remainder of the paper is structured as follows: “Institutional background of Chinese higher education” section explains the college admissions system in China and how it relates to the room assignment process. “Data description” section describes the data used in this study and confirms the conditional randomness of room assignment. “Empirical results” section describes the empirical strategy to examine the peer effects and provides the empirical results. “Sensitivity and heterogeneity analysis” section presents results from robustness check and sensitivity analysis, while “Conclusions and discussion” section concludes and briefly discusses the implications.

Institutional background of Chinese higher education

China’s higher education institutions have a highly centralized and structured enrollment system in which admissions committees at the provincial level operate under the supervision of the Ministry of Education.Footnote 3 As a general rule, admission is granted primarily on the basis of students’ test scores on the College Admission Test (CAT). The CAT, which is administered by the Ministry of Education in early June each year, has two tracks: arts and sciences. While candidates may sit for only one of the two, they may list up to four universities and departments they wish to enter in order of preference among each of the four tiers of the higher education system. Admissions decisions to a university are primarily based on students’ CAT score and their preference ordering of their desired departments.Footnote 4 Since the late 1990s, an increasing number of provinces and municipalities have set up their own province-wide examination in lieu of the CAT. In 2004, there were 15 different exams across China. Therefore, CAT scores were not comparable for students from different provinces. Students are usually admitted to a specific department in a university and in turn the departments are in charge of students’ study for the next 4 years. Decisions such as financial aid, scholarships, and the opportunity to go directly to graduate school are all made at the department level. In other words, students will compete with their peers in the same department for these honors. This may have implications for how peer effects work in this setting.

The university that this study focuses on is one of the most selective colleges and one of the best research universities in China, with approximately 3000 students in each class. The university sets quotas for each province roughly proportional to their population share.Footnote 5 This university also has an early admission track for students with truly outstanding high school academic performance, special athletic talents, and artistic merits.Footnote 6

Students are registered in their home department and are required to take 140 academic credits in order to graduate. There are basically two kinds of courses: required and elective.Footnote 7 Both types are comprised of university-wide and department-wide courses. University-wide required courses together total 30–34 of these credits, university-wide elective courses total 16 credits, and department-wide required courses account for the largest part, or 50–54 credits. Department-wide elective courses comprise another 24 credits. Students are required to take core courses in their freshman year, which are primarily required courses. Students need to take at least 14 academic credits in a given semester but cannot go beyond 25 academic credits.Footnote 8 Courses count for one to four credits. Therefore, the possible number of courses taken within this set of constraints is very wide.

Final course grades are calculated in light of students’ performance on problem sets, a mid-term, a final exam, and class participation. Official GPA is calculated using a universal formula that first translates the original grade into a grade point and then weights them by their course credits. This formula suffers from losing useful variation in students’ academic performance. Instead, I calculate the weighted average course grade, which is average course grade weighted by their course credit. This weighted average course grade ranging from 0 to 100 will be referred to as GPA thereafter.

Starting in the spring term of the first academic year, students can apply to switch departments, and they also have a choice to enroll in other departments’ double major or second-degree programs provided that their academic performance or qualifications meet the requirements of the target departments.Footnote 9

Students will select their majors after the freshman year. Major is a more-refined subfield within the department. For example, students in the physics department have three choices for their major: physics, atmospheric science, and astronomy. The exact timing of this process varies from department to department. Four departments have completed this process by the fall semester of the sophomore year.Footnote 10

College English

College English is a series of required courses for all students except students majoring in English, French, German, and Spanish literature. Depending on their initial English proficiency gauged by the English Placement Exam (EPE) conducted each fall right after their matriculation, students are assigned into four different starting levels of College English: Level 0, Level 1, Level 2, and Level 3. Level 3 is the highest level that students could start in the first term, and it requires a total score of 85 or higher on the EPE. The cutoffs for Level 1 and Level 2 are 59 and 76, respectively. Level 0 is for students whose second language is not English.Footnote 11 Students are expected to move up the ladder progressively term by term up to Level 3.Footnote 12 After finishing Level 3, students need to choose other elective English courses if their total English credits fall short of 6 credits (equivalent of three courses). All students (excluding those students whose initial second languages are not English) are required to pass the College English Level 4 exam by the time of graduation though no course is provided at Level 4.Footnote 13

Roommate assignment

Unlike the US setting where universities usually ask students to fill out a survey on their housing preferences and assignment is conditional on those factors, no students have any say in choosing their roommates at this elite Chinese university. Rooms are assigned to students by their departments in late August right before their matriculation at the university and students from the same department are assigned together whenever possible. To understand the exact room assignment process, I interviewed each department individually about its assignment mechanism. The majority of the departments deliberately mix students from different provinces with the intention of increasing national cohesion by exposing students to different subcultures in China. However, there are three departments and schools, namely the Department of Economics, the School of Environment, and the School of Journalism, which stratify students by their province of residence. Switching rooms after assignment is very rare because departments strongly discourage students from doing so. Administrative records show only 13 such cases or 0.5% for the first year. This gives us confidence about the randomization of students into rooms. Nonetheless, we offer a formal test later in “Random Assignment Checks” section.

Data description

The data on the 2008 class who entered the university in the fall of 2004 were gathered from the university registrar’s office and housing office and include students’ basic demographics, housing information, and detailed transcript records for their first 2 years in college.

Outcome variables include students’ grades on College English courses and year-by-year GPA as well as cumulative 2-year GPA, number of course credits taken, and major of choice.Footnote 14 The academic quality of a student was measured by their CAT scores and their EPE score. Other pre-treatment demographics characteristics include their age, gender, province of origin, party membership, and type of household registration (rural or urban residents). Because of the previously mentioned decentralization of CAT examinations in China since the mid-1990s, the CAT scores are not comparable across provinces for the 2008 class. I created students’ percentile rank within the university by assuming that the relative distribution of students’ academic ability measured by CAT scores across provinces is stable over time (from the 1999 class to the 2008 class). In other words, if a student ranking in the first percentile from province A was near to the fifth percentile in the science track in the 1999 class in the university, the first percentile student from province A in the 2008 class will also receive a percentile rank of fifth in that university. Students who are admitted on the basis of merit (meaning they do not have to take the CAT in the first place) and therefore missing their CAT score are assigned the highest percentile rank of that province in 1999 class. This is a major improvement in measuring the quality of one’s peers with relative precision compared to other studies on peer effects in Chinese universities.

Data from different sources are merged together by person-specific student IDs. I start with a sample of 3215 students. Of these, 170 students are dropped because they are enrolled in a dual-degree program or they live with such students, 22 students are dropped because they belong to earlier classes, along with two students who live alone, and 57 students whose housing data are missing are also dropped. Another 275 are dropped because they are Medical School students and do not report any CAT grade or household registration information. This leaves me with a base sample of 2689 students. This sample will be used to check the randomness of room assignment.

Depending on the outcome variables of interest, I use different samples for statistical analysis. When I focus on the peer effects for students’ College English course grades, I use the English Placement Exam (EPE) sample, which includes the College English courses in the first two semesters. One hundred eleven students who major in Western Language and therefore did not take the College English course are dropped. Forty-five students who are missing their EPE score are also dropped. Eleven students are dropped because they are missing their College English course grades. One hundred seventy-six students whose second languages are not English are also dropped. A variable is generated to indicate the number of roommates missing EPE scores. The sample size is reduced to 2346. When the effect of roommates on students’ GPA is examined, I use the College Admission Test (CAT) sample, which covers students with comparable CAT scores. The CAT sample has a smaller sample size because the CAT scores are not comparable for students from five provinces even though I go back to 1999 class. Therefore, 455 students are dropped along with 18 students who live primarily with students from those five provinces and therefore have no measure of peer quality, plus 18 students who had no record for courses taken. A variable is generated to indicate the number of roommates missing CAT scores. The final CAT sample size is 2198.

Tables 1 and 2 contain summary statistics for the EPE and CAT samples, respectively. On average, students take 86.7 academic credits in the first 2 years. Students in their sophomore year take slightly fewer credits than in the freshman year. Mean GPA on required courses in the first 2 years is 81.2 with a SD of 6.26. The sophomore year GPA is significantly lower than the freshman year’s GPA with a larger SD. For College English courses, 986, 1025, and 335 students are assigned to College English Level 1, Level 2, and Level 3, respectively, for the first term. Approximately, 50% of students enroll as science majors, 30% as social science majors, and 20% as humanities majors.

Table 1 Summary statistics for the 2008 class (EPE sample)
Table 2 Summary statistics for the 2008 class (CAT sample)

Given the fact that I am using data from an elite university, there are legitimate concerns about the lack of variation in roommates’ academic qualifications in the first place. First, initial English proficiency at this university is fairly dispersed with a mean EPE score of 76.7 and a SD of 7.93 (Table 1). I argue that the province-based quota in the admission system at the university and the huge gap in education quality across China’s provinces provide me with more variation than traditionally available from a homogeneous student body.Footnote 15

Random assignment checks

Examination of the housing records shows that the housing data are in line with the departments’ policy. Only 0.07% of students live in a two-person room, 5.9% of students live in triples, and 94%—the majority of the student body—share their rooms with three other students. About 86% of students live with other students from the same department, and another 12% of students live in two-department dorms. The mean number of provinces represented in a room is 3.51. It is not surprising since the majority of the departments intend to mix students from different provinces in a room.

However, it is still possible that departments intentionally or subconsciously assigned students of certain characteristics into rooms. Following Sacerdote (2001), students’ own characteristics were regressed on roommates’ average characteristics conditional on a three-way interaction term between province, department, and gender to examine the randomness of room assignment at this university. The inclusion of a three-way interaction term ensures that identification only comes from the within-department–province–gender cell to reflect the fact that randomization occurs inside each unique combination.

$$X_{ijkl} = \alpha + \beta X_{ijkl}^{{\overline{RM} }} + \pi_{jkl} + \varepsilon_{ijkl}$$
(3)

where X ijkl represents the pre-treatment characteristics (age, ethnicity, residential status, party membership, EPE score, and CAT percentile rank) of student i in department j from province k with gender l. \(X_{ijkl}^{{\overline{RM} }}\) represents the roommates’ average characteristics. π jkl is a set of interaction terms between students’ department, province, and gender. β is the variable of interest. If β is not significantly different from zero, it means that the roommate assignment process does not generate correlation among observed students’ characteristics. Therefore, it is unlikely that the same process will generate correlation among unobserved students’ characteristics.

In practice, the number of interaction terms amounts to over a thousand, which renders identification very difficult due to fewer observations within each cell. In other words, there are too many possible combinations of department–gender–province that there may not be enough observations to efficiently estimate the coefficient.

An immediate caveat is that using the three-way interaction terms above is likely to induce negative correlation for students in the departments that stratified students by their province (Kremer and Levy 2003). To see this, think of a case where there are only two male students (AB) from province X who are admitted to department Y, and they are put into a room of two because their department likes to lump students together. If student A has characteristics above the average, student B must have below-average characteristics. The same logic applies to situation of three or four roommates.

Following Kane et al. (2006), I use department–gender plus province fixed effects instead of the three-way interaction fixed effects. This specification allows comparisons across department and gender as long as students are from the same provinces, while the three-way interaction focuses only on students from the same province registered in the same department with the same gender.

$$X_{ijkl} = \alpha + \beta X_{ijkl}^{{\overline{RM} }} + \tau_{jl} + \upsilon_{k} + \varepsilon_{ijkl}$$
(4)

Table 3 shows the results for Eq. (4) across a variety of pre-treatment student characteristics. None of them shows a significant relationship between students’ and their roommates’ characteristics at the 5% confidence level using T tests.

Table 3 Own pre-treatment characteristics regressed on roommates’ mean pre-treatment characteristics

It may well be that departments take into account a variety of students’ characteristics in the assignment process. Therefore, following McEvan and Soderberg (2006), I regress students’ own characteristics on a full set of roommate characteristics. Again, results show that roommates’ characteristics have no predictive power on students’ characteristics gauged by the P values from the F tests.Footnote 16 These randomization checks reassure us that the assignment of students to different rooms is likely to be orthogonal to students’ characteristics except those explicitly used (Table 4).

Table 4 Own pre-treatment characteristics regressed on full set of roommates’ mean pre-treatment characteristics

Empirical results

Empirical strategy

In order to quantify the effects of peers on students, I use the “linear in means” approach similar to Eq. (2) where students’ academic performance is interpreted as a function of students’ own academic ability, roommates’ average academic ability, and other demographics.

$$Y_{ijkl} = \alpha + \beta_{1} X_{ijkl} + \beta_{2} X_{ijkl}^{{\overline{RM} }} + Z_{ijkl} \varphi + \pi_{jkl} + \varepsilon_{ijkl}$$
(5)

where Y ijkl is the student’s GPA (grade for College English course) for student i in department j from province k with gender l; X ijkl is the student’s percentile rank in the university (own EPE score); \(X_{ijkl}^{{\overline{RM} }}\) is the roommates’ average percentile rank in the university (own EPE score), Z ijkl is a vector of demographic variables indicating the student’s ethnicity and whether the student comes from rural areas. Three-way interaction fixed effects between gender, department, and home province are included to reflect the fact that randomization occurs inside each unique combination. As long as randomization is confirmed, the β 2 coefficient on \(X_{ijkl}^{{\overline{RM} }}\) will yield an unbiased estimate of peer effects. It is worth keeping in mind the concern of using the three-way interaction terms mentioned earlier. Therefore, the alternative strategy is to include department–gender fixed effects and province fixed effects separately.

$$Y_{ijkl} = \alpha + \beta_{1} X_{ijkl} + \beta_{2} X_{ijkl}^{{\overline{RM} }} + Z_{ijkl} \varphi + \tau_{jl} + \upsilon_{k} + \varepsilon_{ijkl}$$
(6)

In the next section, I show results from these two specifications.Footnote 17

Peer effects on English proficiency

Table 5 shows the regression results using College English course grade as the dependent variable. In panel 1, data from both fall and spring terms are stacked together with term-specific course-level fixed effects. Panels 2 and 3 present results when regressions are run separately for the two terms with course-level fixed effects. All regressions are OLS and standard errors are adjusted for heteroskedasticity and clustering at room level. Across all panels, Column (1) presents results of the College English grade regressed on own EPE and roommates’ average EPE scores. In Column (2) covariates of own characteristics are added to the model. Department fixed effects are added in Column (3) to difference out the potential positive correlation at the department level.Footnote 18 Column (4) further controls for department–gender fixed effects. Column (5) best mimics Eq. (6) while Column (6) presents results for Eq. (5). All regressions starting in Column (2) include controls for the number of roommates and the number of roommates who are missing their EPE score. Results from my preferred specification are shown in Column (5).

Table 5 Peer effects on college English course grade

Across all panels, the effects of roommates’ average EPE scores on own College English grade tend to be positive and modest but completely disappear both in the statistical and economic sense when department fixed effects are added to the model. This reflects the fact that students are not randomly assigned into departments but rather are a self-selected group with similar interests and qualifications (in this case, their initial English proficiency). When two-way or three-way interaction terms are added to the model, the coefficients estimates on roommates’ EPE score become even smaller. In any case, none of them are statistically significant.

In contrast, own EPE scores and other characteristics such as gender, ethnicity, and party membership consistently predict students’ performance in College English courses across all specifications. Specifically, a one SD (ten points) increase in EPE score translates into an advantage of 4.4 points in College English grade, which is as large as two-thirds of a SD. The results from panel 2 and panel 3 reveal a striking pattern of the effects of own EPE score. The size of the effects shrink by almost 20% in the spring semester compared to the fall semester.

The effects of other demographic indicators are fairly large and stable across the two semesters. Male students score 2.5–2.6 points (one-third of a SD) less than female students. Han majority students and students affiliated with the party enjoy a 1.3 point (one-fifth of a SD) and a 0.8 point (one-sixth of a SD) advantage over ethnic minority and non-party-member students.

Peer effects on GPA

Table 6 shows the regression results using an individual student’s GPA from required courses as the dependent variable.Footnote 19 In panel 1, a student’s GPA is calculated from all the required courses he or she has taken in the first 2 years in college. Panels 2 and 3 present results for freshman year GPA and sophomore year GPA separately. All regressions include controls for the number of roommates and the number of roommates who are missing their percentile rank measures. All regressions are OLS and standard errors are adjusted for heteroskedasticity and clustering at room level. Column (1) to Column (6) follow the same specifications as in previous section on EPE sample. Again, results from the preferred specification are shown in Column (5).

Table 6 Peer effects on GPA (first 2 years)

Across all panels, roommates’ average percentile rank seems to be negatively correlated with students’ GPA although these results are not statistically different from zero. However, the coefficient estimates are large (one-seventh) relative to the effects of students’ own percentile rank. The 95% confidence interval for the coefficients ranges from −2.08 to +0.59.Footnote 20 Again, own percentile rank and other student characteristics such as gender, ethnicity, and party membership consistently predict students’ GPA in required courses. Specifically, a one SD (29 percentile rank higher) increase in percentile rank translates into an advantage of 1.51 points in GPA, which is as large as a quarter of a SD. The directions of the effects of other demographic indicators are the same as reported earlier except that the rural students performed worse than urban students.Footnote 21 Male students score 2.2 points (one-third of a SD) less than female students. Han majority students and students affiliated with the party enjoy 2.0 (one-third of a SD) and 1.4 points (one-fifth of a SD) advantage over ethnic minority and non-party-member students.

Peer effects on numbers of course credits taken

The number of academic credits completed can be viewed as another measure of students’ progress in college. Meanwhile, students also need to strike a balance between the speed of progress and the grade received because grades are an important determinant of merit-based scholarships, future job placement, and the opportunity to attend graduate school, etc. I expect that students will consult each other about the process and share information on the courses of high interest. However, this benign description of roommates is not indispensable for peer effects to happen. The influence of one’s roommates on the number of course credits taken by a student may come from information flow from roommates’ course-taking behavior, peer pressure, or simply herding behavior.

Table 7 shows regression results where students’ elective course credits completed in the first 2 years, freshman year, and sophomore year are regressed on roommates’ average elective course credits. Students’ percentile rank and demographic variables are controlled for in Column (2). Department fixed effects are added in Column (3) to control for the different workload or distribution of course credits over 4 years across departments. The usual two-way and three-way interaction terms are added in Columns (5) and (6), respectively.

Table 7 Peer effects on electives credits

Across all specifications, roommates’ average course credits are positively correlated with students’ own course credits. Over a two-year period, having roommates that on average take 8.15 more credits (one SD) in electives will increase a student’s course credits by 0.283 (one-fourth of a SD). When freshman year and sophomore year data are examined separately, the peer effect is found to be slightly stronger in the freshman year than in the sophomore year. The number of elective course credits taken is not determined by students’ own percentile rank. The number of course credits also does not appear to differ across students’ gender, residential status, party membership status, and ethnicity; therefore, their results are not reported here.Footnote 22 Since there is no selection on observables, it is unlikely to have selection on unobservables. This gives me confidence on the causal nature of my estimates.

Peer effects on major selection

The selection of major is likely to determine students’ academic life in college and their professional careers later. This section examines the role of roommates on this process. I take advantage of the largest department at this university, which assigns students to four different majors at the end of their freshman year. There are two very large majors, each consisting of around 110 students (Majors A and B). A smaller major has 70 students (Major C) and the smallest major only has 30 students (Major D). Dummy variables indicating whether students select the major as their choice are created for each student–major pair. This dummy variable is summed at room level to calculate the number of roommates (excluding the student herself) who select a particular major.

Table 8 presents results where dummy variables indicating whether a student selects this major as the major of choice are regressed on a set of dummy variables for the number of roommates who select the same major and students’ own freshman year GPA. Specification (2) controls for other demographics and the number of roommates that come from this department. Province fixed effects are added in specification (3). The number of roommates that select a certain major is a significant determinant of own major of choice. The set of dummy variables for number of roommates in the same major are jointly statistically significant across specifications and majors at the 10% level. Having two roommates in a major will increase a student’s probability of selecting the same major except for the smallest major D. However, the effect of having one or three roommates in a major differs from major to major. Own freshman year GPA consistently predicts students’ major choices. Students who select major C tend to have a higher GPA, whereas majors A and B are more likely to be the choices of low-performing students.

Table 8 Peer effects on major choice (results for linear probability model)

Sensitivity and heterogeneity analysis

Several previous studies suggest that peer effects are heterogeneous and/or nonlinear among different populations. This section discusses the heterogeneity and nonlinearity of peer effects.

First, I include the quadratic terms of roommates’ average percentile rank or EPE score in the regression. The quadratic terms are not jointly significant with a p value of 0.534 and 0.201 for the EPE sample and CAT sample, respectively when department–gender and province fixed effects are controlled for (see column 2 in Table 9). This is true across specifications as long as department fixed effects are included. I also try to use different functional forms of roommates’ average characteristics to account for the fact that maybe only the really “good” or “bad” roommates matter. However, when students’ own GPA is regressed on a dummy variable indicating whether a student has a really good (top 10 percentile) or a really bad (bottom 10 percentile) roommate conditional on the usual covariates, the coefficient is statistically insignificant from zero (results not shown here).

Table 9 Sensitivity analysis of peer effects on college English grades and GPA

I explore the heterogeneous peer effects with respect to students’ gender and department type by running separate regressions for each subpopulation. Results from columns 3–4 in Table 9 suggest that neither male nor female students are affected by their roommates’ academic ability. No peer effects are found for students who are enrolled in science or non-science departments. I also interact students’ own EPE score and CAT percentile rank with their roommates’ average EPE score and CAT percentile rank to examine whether the peer effects differ by students’ initial academic ability. However, the results indicate that there is no such evidence. The t statistic for the interaction term is never larger than 0.5. Also, the interaction terms and the level of roommates’ average EPE score or CAT percentile rank are not jointly significant. As a final robustness check, departments that stratify students are excluded from the sample to see whether the negative correlation among roommates biases downward the estimates for peer effects. No such evidence is found either.

Conclusions and discussion

Since the late 1990s, research on peer effects in higher education in the USA has mushroomed, taking advantage of the randomization of students into dorms or squadrons. Previous literature has provided mixed evidence regarding the effect of roommates on students’ academic outcomes but consistent effects on social outcomes. A couple of earlier studies examining peer effects in the context of elite Chinese universities show no or limited effects of roommates on one’s academic performance (Han and Li 2009; Zhang et al. 2011). However, the quality of peers was not precisely measured in their studies.

This paper takes advantage of the roommate assignment system in a highly selective Chinese university to reexamine peer effects on academic performance among college roommates. In particular, I am able to measure students’ academic ability with more precision and confidence. First of all, students’ English proficiency was gauged by the English Placement Exam (EPE) conducted each fall right after their matriculation for all students, a common practice largely overlooked by earlier studies. Second of all, unlike earlier studies that turn a blind eye to the fact that CAT scores are not comparable across provinces for the 2008 class, I created students’ percentile rank within the university by assuming that the relative distribution of students’ academic ability measured by CAT scores across provinces is stable over time (from the 1999 class to the 2008 class).

I find that randomly assigned roommates’ average pre-treatment academic abilities are not significant determinants of students’ academic performance across a number of different specifications. Particularly, students who are randomly assigned to live with roommates with higher College Admission Test scores and English proficiency do not seem to outperform other students in terms of overall GPA and course grade for English courses.

On the other hand, students are indeed influenced by roommates’ decisions when it comes to the number of course credits taken and, to a lesser extent, major choice. Living with roommates that on average take one SD more credits in elective courses is found to boost students’ elective course credits by one-fourth of a SD.

Yet there are a couple of important caveats with regards to the findings of this paper. First, the focus on one elite Chinese university begs the question of the external validity of the results. One has to be cautious in extrapolating the results to other higher education settings in China. Second, the paper’s findings will have more policy relevance if it is supplemented by qualitative research that decipher the black box in peer effects or the lack of it among college roommates in China. For example, do the findings suggest that students are more likely to be influenced by roommates’ behavior that is easily observed? Is there any heterogeneity in terms of susceptibility of students to peer effects (e.g., certain students are more likely to be influenced by their peers)? Do we know the causal mechanisms behind the observed positive correlation between students’ choice of major and their peers’? Is this due to peer influence or is this decision of the students made simply out of convenience to save time on procuring study materials, sharing information on subject matters, and opportunities for studying interactions? These are all important directions for future research.