Introduction

The prevalence of type 2 diabetes (T2D) is increasing rapidly worldwide. The number of people living with T2D has almost quadrupled from 108 million in 1980 to 422 million in 2014 [1]. Individuals with diabetes may develop serious complications including cardiovascular diseases, nerve damage, kidney damage, and eye problems [2]. To determine the causes of T2D, numerous studies have been conducted to explore the potential linkages between diet and T2D.

Short-term trials have shown that diets high in protein were beneficial for weight loss and glucose homeostasis [3,4,5], which may play an important role in the development of T2D. Therefore, increasing dietary protein intake seems to be a promising strategy for preventing T2D. However, long-term prospective cohort studies suggested that animal-sourced foods with high protein, such as red and/or processed meat, were positively related to diabetes mellitus risk [6, 7], whereas, plant-based high-protein foods, such as nuts and legumes, were associated with lower risk of diabetes mellitus [8, 9]. Based on such evidence, a hypothesis was postulated that whether dietary protein per se is independently associated with risk of T2D given the evidence that protein theoretically could influence the development of T2D through the mechanism of insulin secretion and resistance [3, 4].

Recently, several prospective studies have focused on the effect of dietary protein and risk of T2D [10,11,12,13,14]. Nonetheless, results were inconsistent and not yet fully elucidated. To our knowledge, the dose–response associations of total protein and protein subtypes (animal and plant protein) with T2D risks have not been systematically synthesized, thus, we performed a meta-analysis using prospective studies to quantify the dose–response relationship.

Methods

Search strategy

We conducted and reported the present study according to Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [15]. Comprehensive literature search was performed by searching databases of PubMed, Embase, Web of Science, and Wiley Online Library on 20th March 2018 using the following keywords: (protein intake or dietary protein or protein consumption) and (diabetes or diabetes mellitus) and (cohort studies or follow-up studies or longitudinal studies or prospective studies or nested case–control studies or case–cohort studies). Detailed search terms on each specific database were presented in online supplementary materials, Table S1. No restrictions on language or year of publication were applied. Articles included had to: (1) assess dietary protein as exposure of interest; (2) report T2D incidence as outcome; (3) use a prospective design in general population and a quality score ≥ 6; (4) provide estimates of relative risks (RR) or hazard ratios (HR) and corresponding 95% confidence intervals (CI) or data necessary to calculate these estimates. We furthermore manually searched the reference lists of included studies, all relevant reviews, and meta-analysis to identify additional articles that might be missed in the primary literature review. If duplicate publications in the same population were found, the most recent or the one with most applicable information was selected.

Data extraction and quality assessment

Two investigators (L.-G.Z and Q.-L.Z) independently reviewed the identified publications and evaluated their relevance to the research topic based on the prespecified inclusion criteria. For eligible studies, both of the two authors carried out the data extraction and quality assessment. Discrepancies were resolved through consensus of the two authors. A standard data extraction table was developed to obtain the following information: first author’s last name, year of publication, country in which the study was conducted, study name, baseline years, years of follow-up, overall number of participants, number of T2D cases, gender, age range of study population at recruitment, protein assessment methods, methods of identification of T2D cases, type of dietary protein, median or mean dietary protein intake in each category and the RR and 95% CI of T2D incidence related to those categories of protein intake, and covariates included in multivariable models. If more than one estimate was provided, priority was given to the one with the most adjusted potential confounding factors. One study [16] provided results with biomarker-calibrated or uncalibrated. To be consistent with other studies, we chose uncalibrated HRs in the current analyses. The Newcastle–Ottawa Scale (NOS) for cohort study based on selection, comparability, and outcome assessment with a full score of 9-star was used to assess the study quality [16].

Statistical analyses

To investigate the association between protein intake and T2D risk, we not only calculated pooled RR for the highest versus the lowest categories of protein intake, but also investigated the dose–response association between protein intake and T2D risk. We used random-effects models proposed by DerSimonian and Laird [17], which incorporated both within- and between-study variability to combine the study-specific risk estimates.

In dose–response analysis, we both estimated the RR and 95% CI for each increment of 5% energy from protein intake and explored potential nonlinear associations. For each study, the trend from the correlated log relative risks across categories of protein intake was calculated using the method proposed by Greenland et al. [18] and Orsini et al. [19]. The method requires more than two exposure categories and the following information for each category should be available: (1) the number of cases and total number of participants or person-years, (2) the RR and corresponding 95% CI, and (3) the mean or median protein consumption. When protein intake was presented in g/day, we transformed it into percent of energy using the energy 1 g of protein provides (4 kcal/g) and the average daily energy intake of the population. We assigned the median or mean protein intake of each category to the corresponding risk estimates of each study. For nonlinear associations, we used a two-stage, random-effect dose–response meta-analysis by modeling protein consumption using restricted cubic splines with three knots at fixed percentiles (10, 50, and 90%) of the distribution [19, 20]. We first fitted a restricted cubic spline model into each set of relative risks within the specific study [19, 20] and then combined the two regression coefficients and the variance/covariance matrices for each study using multivariate random-effects model [21]. A P value for nonlinearity was calculated by testing whether the coefficient of the second spline was equal to zero [22].

Heterogeneity among studies was assessed using the Q test and I2 statistic. A P < 0.10 for Q test or an I2 > 50% for I2 statistic was used to define heterogeneity [23]. We evaluated small study bias, such as potential publication bias by visual inspection of funnel plots and using Egger’s test [24]. In addition, a sensitivity analysis was conducted by reanalyzing the pooled estimates after excluding one study at a time to test the robustness of the result. We also explored whether the gender, specific adjusted covariates, and FFQ types had impacts on the main results. All statistical analyses were conducted using Stata, version 13.0 (Stata Corp, College Station, TX, United States). Two-sided test with P value of less than 0.05 was considered to be statistically significant if not specified.

Results

Literature search

A flowchart (Fig. 1) presents the process of study selection. Briefly, we identified a total of 368 records in PubMed, 436 in Embase, 269 in Web of Science, and 101 in Wiley Online Library, among which 389 records were excluded, because they are duplicates. After a review of title and abstract, we further removed 745 records because of the violation of prescribed inclusion criteria. Four articles were identified through manual search of the reference lists. Among the remaining 44 articles for full-text review, 36 were excluded due to the reasons listed in Table S2. Finally, eight articles with ten studies (one article provided results of three separate studies) including 440,418 participants and 34,221 T2D cases were included in the current meta-analysis.

Fig. 1
figure 1

Systematic identification of the published literature on dietary protein intake and type 2 diabetes

Included study characteristics

Table 1 shows the characteristics of included studies. Of the included ten prospective studies, five were conducted in the United States [14, 25, 26], two in Australia [27, 28], two in Europe [13, 29], and one in Japan [30]. Nine studies analyzed the effect of total protein and eight studies analyzed the effect of protein subtype (animal protein and plant protein). Four studies used interview-based food-frequency questionnaires [25, 26, 28, 31], and one used 4-day food record [32] and another five used self-administrated food-frequency questionnaires [14, 27, 30] to obtain information on the protein intake. T2D case ascertainment was through self-report in six studies, of which five studies with combination of registry information and another one with physician-confirmed diagnoses. All studies controlled age, sex, body mass index (BMI), physical activity, smoking, and alcohol use. Furthermore, most studies additionally provided results that did not adjust BMI. Most studies adjusted energy intake (n = 9) and family history of T2D (n = 7). As displayed in Table S3, all studies included in the current analysis were of high quality (All quality scores ≥ 6).

Table 1 Characteristics of prospective cohort studies included in meta-analysis on associations of dietary protein intake with incidence of type 2 diabetes

Highest versus lowest category

Figure S1 shows the relative risks of T2D related to protein intake by comparing participants in the highest category with those in the lowest category. Total protein and animal protein but not plant protein were significantly associated with higher T2D risk no matter whether BMI was adjusted (Figure S1). Apart from plant protein when BMI was not adjusted, no significant between-study heterogeneity was found for other exposures. We did not find evidence of publication bias in above associations (Fig. S3). In sensitivity analysis, the results were similar to the overall estimate when we excluded one study at a time (Fig. S5).

Dose–response meta-analysis

All included studies provided sufficient information for dose–response analyses. In models that adjusted for BMI, total protein and animal protein consumption showed a significantly positive association, while plant protein presented a borderline inverse correlation with T2D incidence in a linear fashion (Fig. 2a). For a 5% of energy increment from protein, the relative risk of T2D was 1.09 (95% CI 1.04, 1.13) for total protein, 1.12 (95% CI 1.08, 1.17) for animal protein, and 0.86 (95% CI 0.75, 1.00) for plant protein. Variances in heterogeneity between studies were 42.0% (P = 0.078), 14.0% (P = 0.317), and 34.2% (P = 0.144) for total, animal, and plant protein accordingly. No evidence of publication bias was detected (Fig. S2). Sensitivity analysis with one study at a time was excluded which showed that the pooled relative risks were not materially influenced by a single study (Fig. S4). When we excluded Tinker’s study which failed to adjust energy intake, the results remain stable and robust (Pooled RRper 5% of Energy from protein = 1.07, 95% CI 1.02, 1.12; I2 = 29.8%, P = 0.180). When we restricted our analysis in studies with use of semi-FFQ, the results did not change materially. The sex-specific results were also consistent with our main analysis. To determine whether animal or plant protein per se can exert its effect on T2DM, we analyzed studies with models in which animal or plant protein was mutually adjusted. However, the results seem to be in consistence with main analyses. These sensitivity results are provided in Table S4.

Fig. 2
figure 2

Prospective associations of dietary protein intake with incident type 2 diabetes for per 5% of energy increase with (a) or without adjusted for body mass index (b)

By combining studies that provided results before adjustment for BMI (seven studies for total protein; six studies for animal protein and plant protein), an increased intake of 5% of energy was related to a 36% (95% CI 1.23, 1.52) higher risk of T2D for total protein and a 32% (95% CI 1.27, 1.36) higher risk for animal protein (Fig. 2b).

In models failed to adjust BMI, we observed statistically significant nonlinear associations of T2D risk with total and animal protein intake (Fig. 3b, d). However, these nonlinear relationships became insignificant when adjusted for BMI (Fig. 3a, c). In addition, we observed a significant nonlinear association of T2D risk with plant protein intake in both models with BMI adjusted and BMI not adjusted (Fig. 3e, f). With the increase of dietary plant protein intake, a U-shaped relationship was observed for risk of T2D with the maximum reduction occurred at about 6% of energy from plant protein intake. However, with further increase in consumption, the relative risk tended to be closer to the null.

Fig. 3
figure 3

Dose–response analysis of the association between dietary protein intake and incident type 2 diabetes with (a, c, e) or without adjusted for body mass index (b, d, f). Total protein (a, b); animal protein (c, d); plant protein (e, f)

Discussion

To our knowledge, the present study is the first time to quantitatively assess the T2D risks related to total protein and protein type from prospective studies. In models that adjusted for BMI, we found that a 5% of energy increase in consumption of total and animal protein was related to a 9 and 12% higher risk of T2D, respectively. For plant protein, we observed a statistically significant nonlinear association with the largest risk reduction at intake of about 6% of energy from plant protein. These associations attenuated but remained statistically significant compared to the results without BMI adjusted.

Results from the current study were in agreement with the previous studies on T2D risk related to diets high in protein. A meta-analysis showed each 100 g unprocessed red meat per day and each 50 g processed red meat per day increase were associated with 19% (95% CI 1.04, 1.37) and 51% (95% CI 1.25, 1.83) higher risk of T2D, respectively [6]. Summary results including six eligible observational studies on nuts and T2D suggested that nuts were inversely associated with T2D (RR 0.87; 95%CI 0.81, 0.94) [8]. In addition, another meta-analysis on dietary patterns found that the Mediterranean diet characterized by high plant-based food was associated with 23% (95% CI 0.66, 0.89) lower risk of developing T2D by comparing the upper and the lowest available centile [33]. For western diet high in animal food, the relative risk was 1.41 (95% CI 1.32–1.52) for people in the highest category compared with the lowest category [34]. Although we cannot rule out the possibility that these observed associations were related to other components in foods, our results indicated these relationships were attributed at least partly if not all to the protein consumption.

The biological mechanism of dietary protein impact on T2D risk was largely unknown. There are several potential explanations for the observed associations of total protein and protein sources with T2D development. Studies suggested that dietary protein may increase glucagon, which is a contributor to high blood glucose level [35]. In addition, dietary proteins are also known to promote the secretion of insulin, while hyperinsulinemia is a risk factor for insulin resistance [4, 36]. The discrepancy in effect between animal protein and plant protein may be determined by the difference of amino acid composition. In metabolomics studies, branched-chain and aromatic amino acids, such as leucine, tyrosine, and phenylalanine, were found to be positively related to incident T2D [37, 38]. Although these amino acids exist in all high-protein contents food, they mainly present in animal food like meat and dairy [39]. All the above-mentioned biological effect might explain the positive association of total and animal protein with T2D risk as well as the different impact between animal protein and plant protein.

Recommendations of dietary intake of protein for diabetics should not exceed 20% of energy intake in U.S. [40], but the true relationship between protein consumption and T2D remains unknown. The current study improved the evidence that total and animal protein should be considered for T2D prevention in general population and provided the ideal daily intake amount of plant protein for T2D prevention according to its dose–response association with T2D risk. In addition, studies have previously suggested that high intake of total and animal protein was positively related to mortality risk, while plant protein inversely was associated with risk of deaths, which was also partly supported our findings [41, 42]. As all except one studies included in the present meta-analysis were conducted in countries with a Westernized lifestyle and food source and consumption level of protein varied among people in different places, future studies should explore the relationship between dietary protein intake and health outcomes in other geographical locations. Furthermore, studies investigating its associations with other diseases such as cardiovascular diseases and cancer are needed to assess the effect of protein comprehensively.

Strengths of the meta-analysis included the prospective cohort design of original studies, which greatly reduced the possibility of recall bias. A large number of cases and participants provided sufficient statistical power to figure out the association between dietary protein and incident T2D. Though two studies have summarized the associations between dietary proteins and T2D, they just provided results from comparisons of high- and low-protein consumers [28, 43]. In the current analysis, we provided a comprehensive and quantitative analysis and broadened the evidence on this topic. Finally, studies included in this meta-analysis adjusted most of the known potential confounding factors to rule out substantial amount of confounding bias. For total protein intake, all included studies except for Tinker’s [26] adjusted energy and fat intake in the multivariable model, which means that the impact of dietary protein can be interpreted as the effects of substituting dietary protein for dietary carbohydrate. When we only included studies using substitution models, results did not change materially compared with our main analysis.

Some limitations also should be considered. First, residual confounding still might exist though most confounders were taken into consideration in original studies. For example, the observed association between protein and T2D risk might be caused by other factors related to protein intake such as iron and nitrates in protein-rich food. Second, the potential role of obesity as a potential mediator or confounding factor is unclear, so we presented the associations adjusting with and without adjustment for BMI in the current analysis to elucidate the possible difference. However, unfavorable effects of total and animal protein independent of obesity may exist as the direction and significance of the association unchanged with adjustment for BMI. Third, some degree of heterogeneity was detected. Differences in study locations, sex, various sources and types of protein, cohort size, and follow-up time could lead to the heterogeneity. We provided sex-specific results with no significant differences observed. When excluded Tinker’s study [16], we observed the I2 changed from 42.0 to 29.8%. Another limitation is the underlying publication bias. Although, in the primary analysis, we did not detect publication bias using a statistical method, since test power for publication bias was limited, especially when the number of studies was not many.

In conclusion, we found total protein was associated with a higher risk of T2D and this association was largely due to the protein of animal origin. Plant protein has a modest nonlinear dose–response association with T2D risk. Therefore, public health recommendations should consider the protein sources for T2D prevention.