Introduction

Sleep is an essential part of daily living and is vital to maintaining normal levels of physical activities and cognitive skills [1, 2]. As recommended, adolescents (10–17 years old) sleep for 8.5–9.5 h per night and those persons ≧18 years of age obtain 7–9 h of sleep per night [3]. Adequate quality sleep is important for human. Lack of sleep or inadequate sleep can lead to brain dysfunction, immunological and metabolic collapse, and if it is sufficiently prolonged, it will ultimately lead to death [4]. What is more, poor sleep quality has been associated with increased medical expenses, poor working performance, and elevated risk for psychiatric disorders [5]. In a word, it has a negative effect on general health and also on quality of life [6].

Because of the unique setting of living condition such as added stress with academic challenge, social interaction within the mixed cultural pool, medical students have high prevalence of mental distress [7]. And it is suggested that medical students experience high rates of emotional problems and suicidal ideation such as anxiety, depression, burnout or even symptoms resembling post-traumatic stress disorder [8, 9]. Beginning as early as the 1960s, a lot of articles have emphasized the need to offer support for medical students to protect and enhance their self-care and well-being [10, 11]. The physical and mental health of medical students has got more and more attention. Due to academics and rigorous clinical training, medical students are very easy to developing sleep impairment and medical students are regarded as a population that is more susceptible to sleep-related problems [12, 13]. Poor sleep quality among medical students not only affects learning, memory and cognition, but also leads to physical and psychological morbidities [14].

The sleep quality of medical students has been most extensively studied and discussed in China, it may be caused by the special background of Chinese medical education. The system of higher medical education is different all over the world [15]. Different from the single medical education system for other countries, there are several medical education systems coexistent in China including the five system levels of 3, 5, 6, 7, 8 [16]. But the system levels of 3 and 5 are the main types, so the total medical education systems are pretty shorter than other countries [16]. In addition, most of the countries strictly limit the enrollment of medical students [17]. But in China after the enrollment expansion policy of colleges and universities beginning in the end of 1990s, the amount of medical students has become more numerous which cause high employment pressure to medical students [18]. And most of them do not want to work in grassroots communities, central and western parts of China [19]. In this context, the employment pressure of medical students is rising, and it is higher than other countries [19]. To control the influence of different cultural backgrounds, races and related factors on sleep quality, only Chinese medical students were chosen in this meta-analysis.

As we know, the gold standard for the assessment of sleep quality is polysomnography (PSG) and/or electroencephalographic (EEG) spectral content analysis, but both kinds of them have practical limits in large sample studies and in field studies [20]. To overcome these limitations, self-report retrospective and prospective questionnaires have been developed. The most widely used standardized questionnaire is the Pittsburgh sleep quality index (PSQI), which is designed to measure sleep quality over the previous month [21].

The PSQI questionnaire measures subjective sleep quality, sleep habits and sleep disturbances over a 1-month period, and discriminate between normal and poor sleepers [22]. It contains 19 self-rated items with 7 components (subjective sleep quality, sleep latency, sleep duration, habitual sleep efficiency, sleep disturbances, use of sleeping medications, and daytime dysfunction). Each component is scored from 0 to 3, the seven components of the PSQI scores were then added to produce a global PSQI score ranging from 0 to 21. The PSQI has developed together with an effort to adapt it to different cultures and languages [23, 24]. It was revealed that PSQI had a good internal consistency, test–retest reliability, and high validity in Chinese [25].

Though the PSQI questionnaire has been widely used in evaluating the sleep quality of medical students in China, but the result is inconsistent. In order to derive a more precise estimation, we carried out a meta-analysis to summarize the eligible articles by increasing the sample sizes. This research will inform intervention targets and provide direction for further research into the sleep quality of medical students.

Materials and methods

Literature search and selection of studies

A systematic literature search was performed towards the electronic databases including PubMed, Web of Science (SCIE), Chinese Biomedical Database (CBM), the Chinese National Knowledge Infrastructure (CNKI), the Chinese Wanfang Database, the Chinese VIP Database and searched for studies up to March 6, 2017, using the following terms: (‘‘medical students’’ or ‘‘medical college students’’ or “medical undergraduates” or “college students” or “university students” or “undergraduates”) AND (“sleep quality” or “quality of sleep” or “sleeping quality” or “sleep disorder” or “sleep disorders” or “sleep–wake disorder” or “sleep disturbance”) AND (“PSQI” or “the Pittsburgh sleep quality index”) by two researches. On the other hand, we searched the reference to identify other potential articles.

Inclusion and exclusion criteria

To meet the analysis requirements, included studies should fulfill the following criteria: (a) studies of cross-sectional observational design; (b) sleep quality was assessed by Pittsburgh sleep quality index (PSQI); (c) two groups of participants including medical students and non-medical students; (d) the study reported complete information on the mean and standard deviation of seven dimensions of PSQI; (e) studies conducted in Chinese population. Studies were excluded if they met the following criteria: (a) case reports, reviews, lectures, editorials or correspondence letters; (b) only the most recent ones with the most complete dataset were ultimately selected if there were duplicated studies; (c) difference studies presenting the same comparison(s) were excluded to prevent overrepresentation.

Data extraction and quality assessment

For studies included in this meta-analysis, a standard data extraction form was used by two reviewers and the consensus was achieved for all data. First author, year of publication, sampling methods, sampling area, sample size, male to female ratio (M:F), mean and standard deviation of seven dimensions and global PSQI scores in medical students and non-medical students were extracted. The quality of the studies included in this meta-analysis was assessed by an 11-item checklist which was recommended by Agency for Healthcare Research and Quality (AHRQ) [26]. An item would be scored “1” if it was answered “yes”; if it was answered “no” or “unclear”, then the item scored “0”. Article quality was assessed as follows: low quality = 0–3; moderate quality = 4–7; high quality = 8–11.

Statistical analysis

Data management and statistical analyses were completed with the STATA software (version 14.0; Stata Corp, College Station, TX, USA). If a study gave sample sizes, means and standard deviations separately for non-medical students with different majors, this paper combined them into a single group. Pooled weighted mean difference (WMD) and 95% confidence interval (95% CI) were used to assess the sleep quality between medical school students and non-medical school students. The heterogeneity across individual studies was evaluated with Q statistic (P < 0.10 was considered to be of statistical significance) and I 2 statistic (I 2 > 50% was considered to be of statistical significance) [27]. If there was no heterogeneity, the fixed effects model was used; otherwise, the random effects model was used [28, 29]. To identify potential sources of heterogeneity, subgroup analysis were carried out based on male to female ratio and study area. Publication bias was tested by Begg’s and Egger’s test. [30, 31]. Sensitivity analyses were performed by omitting one study each time to reflect the accuracy and stability of the analytic results. Statistical significance was defined as two-sided P value less than 0.05.

Results

Characteristics of studies

The initial search yielded 2232 articles, 2221 of them were rejected after further screening. Finally, a total of 11 full text articles satisfied the inclusion and exclusion criteria, including 7154 medical students and 9073 non-medical students [32,33,34,35,36,37,38,39,40,41,42]. All of them were performed in Chinese population. Figure 1 describes the procedure for study selection. The main characteristics of eligible studies are shown in Table 1.

Fig. 1
figure 1

Flow diagram of the study selection process examining the sleep quality of the medical students

Table 1 Characteristics of eligible studies included in the meta-analysis

Quantitative data synthesis

The main results of the meta-analysis are summarized in Table 2. The sleep quality of medical students was significantly higher in scores of subjective sleep quality (WMD, 0.137; 95% CI 0.049–0.224; P = 0.002), sleep duration (WMD, 0.147; 95% CI 0.069–0.225; P < 0.001), daytime dysfunction (WMD, 0.194; 95% CI 0.077–0.311; P = 0.001) and the total scores (WMD, 0.524; 95% CI 0.152–0.896; P = 0.006). On the contrary, there was no significant difference in sleep latency, habitual sleep efficiency, sleep disturbance and use of sleeping medications. The forest plots are shown in Figs. 2, 3, 4, 5, 6, 7, 8, 9.

Table 2 Combined results in this meta-analysis of sleep quality between medical students and non-medical students
Fig. 2
figure 2

Forest plot of subjective sleep quality between medical students and non-medical students

Fig. 3
figure 3

Forest plot of sleep latency between medical students and non-medical students

Fig. 4
figure 4

Forest plot of sleep duration between medical students and non-medical students

Fig. 5
figure 5

Forest plot of habitual sleep efficiency between medical students and non-medical students

Fig. 6
figure 6

Forest plot of sleep disturbance between medical students and non-medical students

Fig. 7
figure 7

Forest plot of use of sleeping medications between medical students and non-medical students

Fig. 8
figure 8

Forest plot of daytime dysfunction between medical students and non-medical students

Fig. 9
figure 9

Forest plot of the total scores of PSQI between medical students and non-medical students

Table 3 presents the results of subgroup analysis. Because of the limit information provided by the original studies, subgroup analysis based on male to female ratio was performed in our study. It was divided into two groups according to the characteristic of the data. The first one was the group of gender imbalance that the difference between male to female ratio was greater than 0.15, the other one was the group of gender balance which was considered that the difference between male to female ratio was less than or equal to 0.15. It showed statistical significance in subjective sleep quality, sleep duration, sleep disturbance, daytime dysfunction and the total scores in the group of gender imbalance. And there was no significance in sleep latency, habitual sleep efficiency and use of sleeping medications. It also showed statistical significance in subjective sleep quality, sleep duration, daytime dysfunction and the total scores in the group of gender balance. And there was no significance in sleep latency, habitual sleep efficiency, sleep disturbance and use of sleeping medications.

Table 3 Subgroup analysis of the sleep quality between medical students and non-medical students

Publication bias and sensitivity analysis

Publication bias was not detected in Begg’s tests except for sleep duration (Table 2). Egger regression tests did not suggest a significant publication bias for habitual sleep efficiency and use of sleeping medications. However, Egger tests indicated a significant publication bias in other components and the total scores (Table 2). Sensitivity analysis sequentially excluding each individual study did not significantly change the pooled WMDs except for the use of sleeping medications and the total scores (Figs. S1–S8). When the research of Wei [34] was excluded, the result changed to WMD, −0.019; 95% CI −0.036 to −0.002 for the study of use of sleeping medications and WMD, 0.450; 95% CI −0.0004–0.900 for the total scores.

Discussion

Medicine is recognized as one of the most stressful fields of education and medical students are regarded as a population that is more susceptible to sleep-related problems [13, 43]. Purim KS’s study conducted in Brazil showed the total scores of PSQI was 5.90 ± 2.39 in medical students [44]. However, the mean global PSQI score of Chinese medical students was around 6.16 in most of the studies included in this meta-analysis. The components of PSQI and the total scores of Purim KS’s study were better than most Chinese medical students except for use of sleeping medications. The study conducted in Lithuanian showed that the global PSQI scores was 7.3 ± 2.9 for medical students, it was worse than most of the medical students in China [45]. A study conducted by Giri P revealed that the global PSQI scores were 5.28 ± 2.39 in Indian medical undergraduates [46]. Most of the studies included in this meta-analysis were worse than this study. Because of the different cultural backgrounds, the PSQI of medical students make a great difference all over the world.

In this circumstance, we conducted this study to learn the sleep conditions of Chinese medical students comprehensively. As far as we know, this meta-analysis represents the first effort to systematically assess the sleep quality of medical students.

Based on 11 cross-sectional studies, this meta-analysis revealed that the sleep quality of Chinese medical students was significantly worse in respect of total scores, subjective sleep quality, sleep duration and daytime dysfunction domains. Whereas, no significant difference was observed in sleep latency, habitual sleep efficiency, sleep disturbance and use of sleeping medications. However, the sensitivity analysis changed the results of use of sleeping medications and the total scores. When the study of Wei [34] was excluded, the results of them had been changed. It might be caused by the large sample size of Wei’s study. There were only 7154 medical students; the study of Wei occupied 2368. In addition, the authors compared the sleep quality of medical students with engineering students. But the male to female ratio was seriously unbalanced between two groups. In the medical students, the male to female ratio was 590:1778; nevertheless, it was 1038:604 in the engineering students. So, the results revealed by this meta-analysis should be interpreted with caution. Because of the high heterogeneity detected in this meta-analysis, we conducted subgroup analysis to explore the source of heterogeneity. Poor sleep quality is often attributed to a range of modifiable, and non-modifiable factors [47]. Gender is identified as a vital role among non-modifiable factors, as many studies report a higher rate of sleep problems in females [48]. But owing to the insufficient information, we could only carry out subgroup analysis based on the male to female ratio. According to the characteristic of the data, studies were divided into two groups (the group of gender imbalance and the group of gender balance). The group of gender imbalance still had high heterogeneity and results showed that the sleep quality of medical students was significantly worse in terms of subjective sleep quality, sleep duration, sleep disturbance, daytime dysfunction and the total scores domains. But we should make conclusions with caution on account of the unbalance male to female ratio. However, in the group of gender balance, the heterogeneity got much smaller. On this basis, we speculated that the unbalance of male to female ratio might be a source of the heterogeneity. In this group, the results had statistical significance in terms of subjective sleep quality, sleep duration, daytime dysfunction and total scores domains. There was still no significance in sleep latency, habitual sleep efficiency, sleep disturbance and use of sleeping medications. And the results in the group of gender balance were consistent with the overall results.

Meta-analysis is known as a method to increase the sample size and statistical power for a more reliable result through combining comparable articles. In this meta-analysis, it was revealed that the sleep quality of medical students was worse than non-medical students in some domains. University students should deal with stressors, such as long class schedules, overloaded lessons, and academic performance concerns [49]. Medical students are often thought to have more stressors than most of non-medical students like less free time, longer courses, and longer working hours [50]. When they feel difficult to manage the stress, these stressors may lead to sleep disorders in them [51]. Because of the highly demand of professional and academic requirements, medical students have to reduce their sleeping hours to achieve their desired goal [52]. Due to the reduction of sleeping hours, the sleep duration of medical students was worse than non-medical students. Due to the lack of sleep, they feel tired and sleepy; so the daytime dysfunction and subjective sleep quality are worse. Though their sleeping time is short, but the time they spend to fall asleep have no significance compared with non-medical students and there is no other special thing to influence their sleeping. In a word, the results are mainly caused by the insufficient sleep hours. Though, medical students want to achieve their goal through the reduction of sleeping hours. It is also reported that poor sleep quality has been correlated with worse academic achievement before exams [53]. And poor sleep quality frequently co-occurs with mental disorders. So, effective measures are needed to improve the situation in this population. The stress should be reduced in this population. Teachers and parents should tell them the right way to face the pressure. And medical students should correct their attitudes towards various stressful things.

This study provided the evidence of sleep quality of medical students was worse than non-medical students in China. But, results in the present meta-analysis should be interpreted with caution due to the following limitations. Firstly, publication bias was detected in our meta-analysis. Secondly, only eleven Chinese published studies were ultimately included in this meta-analysis. We undoubtedly missed some potential studies in other languages, which may be one reason for the publication bias. Thirdly, this study was based on individual unadjusted estimates without adjusted for potentially confounding factors such as gender, grade, body mass index (BMI), family background and other factors. Fourthly, only cross-sectional studies were adopted which was not sufficiently powerful to demonstrate a causal relationship.

In conclusion, this meta-analysis suggests that the sleep quality of medical students is worse than non-medical students in China. Given the limitations of the studies included in the meta-analysis, further studies with more well-designed investigations are required to validate our results.