Introduction

Gastroesophageal reflux disease (GERD) is associated with pathological reflux of acidic or non-acidic gastric contents into the esophagus and is characterized by heartburn, regurgitation, and chest pain [1,2,3,4]. Severe GERD may result in significant morbidity, impairment in quality of life, and work absenteeism, and, if left untreated, complications like peptic stricture, Barrett’s esophagus, and esophageal adenocarcinoma [5, 6]. However, the frequency, severity, and complications of GERD can vary according to the geographic and ethnic factors [7, 8]. Though GERD is a common problem in the Western population [5, 9,10,11], it may be less common in the Asian population in general and in India in particular [12,13,14].

Though multiple factors might contribute to the lower frequency and severity of GERD and Barrett’s esophagus in the Indian population, the dietary variation might be one of the most important contributors [3, 8, 15, 16]. A few earlier studies did show that non-vegetarian foods might contribute to the occurrence, symptom exacerbation, and severity of GERD [17,18,19]. Since vegetarianism is widespread in the Indian population, Indian epidemiological studies may uncover the dietary factors contributing to GERD that may have implications beyond India, as diet is a modifiable factor. However, Indian population–based data on prevalence and risk factors of GERD are scant. Moreover, easy over-the-counter availability of the proton pump inhibitors (PPI) and histamine 2 receptor antagonists (H2RA), high frequency of Helicobacter pylori (H. pylori) infection, which is known to reduce the severity of GERD, and cultural variation in the symptom reporting might also reduce the prevalence of GERD [20,21,22,23,24,25,26,27]. Since the studies on the prevalence and risk factors of GERD from India are scanty, we thought it worthwhile to undertake a meta-analysis and meta-regression on the existing population-based studies with aims (i) to evaluate prevalence of GERD in the Indian population, (ii) to evaluate its risk factors including the influence of dietary variation, and (iii) to critically analyze the lacunae in the current literature and directions for the future studies.

Methods

Search methodology, study selection, and data extraction

Two authors independently performed a literature search in PubMed (https://pubmed.ncbi.nlm.nih.gov/), Scopus, and Web of science (from January 2000 through May 2020) for abstracts matching one or more of the following keywords: “gastro-esophageal reflux,” “India,” “prevalence,” “risk factors,” and boolean operators AND/OR: (e.g. gastroesophageal reflux disease OR GERD) AND (India OR prevalence OR risk factors). This was based on the assumption that studies not including the above search terms in the abstract would not have reported the prevalence and risk factors for GERD in India. Two authors independently screened the titles and abstracts of the identified studies and further searched for the full texts of the potentially relevant publications. Studies in the English language were eligible for inclusion in the meta-analysis. The full text of the resulting papers were reviewed, and bibliographies of these were screened to look for other relevant studies. Eligibility criteria were pre-defined. Any discrepancy was resolved by consensus. After a thorough search, a total of 340 abstracts were found. After title and abstract reviews, 332 studies were excluded. The 8 studies that were selected were thoroughly reviewed, including bibliography. At this stage, one more population-based study from our group accepted for publication recently was also added [28]. Hence, nine studies were used in the final analysis (Fig. 1).

Fig. 1
figure 1

Flowchart on screening, exclusion, and inclusion of the studies for the systematic review, meta-analysis, and meta-regression

Definitions of significant GERD varied across the studies. Some studies utilized a pre-validated symptom score to diagnose and grade the severity of GERD. We recorded data for each study regarding the definition and severity criteria. Different risk factors for GERD have been assessed in various studies. All these data were extracted from relevant studies and appropriately inserted into a Microsoft Excel spreadsheet and subjected to meta-regression for the risk factors for GERD.

The sample size of the included studies ranged from 358 to 6174, with a combined total of 20,614 subjects [3, 7, 16, 20, 29,30,31,32,33] (Fig. 1). The following data were extracted from each paper: (1) author and publication year; (2) type of study (population-based or selected subjects); (3) sample size and GERD prevalence; (4) definition of GERD; (5) risk factors (age, body mass index [BMI], predominant diet, tea/coffee intake, tobacco smoking/chewing, alcohol intake) and predominant symptoms. The quality of the study was assessed by Jadad score (Table 1). The row of the table represented the data extracted from each study, and each column described the variables of interest, which were essential to compute effect sizes and plots. The proportions were transformed to make them follow a normal distribution to accurately estimate the summary proportion and increase the validity of the statistical analysis. The results were reported by converting the logits back into the proportions.

Table 1 Characteristics of gastroesophageal reflux disease included in the meta-analysis

Data synthesis and analysis

MedCalc version 14 (Warandeberg 3, 1000 Brussels, Belgium) was used to generate forest plots of pooled prevalence with 95% confidence intervals (CIs). R::metafor package version 1.9-7 of R studio (R development core team, Vienna, Austria) was used for meta-regression. Data were analyzed using both fixed and random-effects models. Heterogeneity was assessed using I2 statistics. A fixed-effect model was used in the absence of heterogeneity (I2 < 50%), whereas if I2 was > 50%, a random-effects model was used.

A meta-analysis statistically synthesizes quantitative findings and measures the average proportion of multiple studies weighted by the inverse of sampling variances using the appropriate model. The forest plot was constructed to represent the degree of variation between effect sizes across studies, and it also helped to detect the outliers. The studies were identified as outliers whose 95% confidence intervals did not overlap with the overall summary effect; if the large studies were outliers, it might result in higher heterogeneity.

Meta-regression is a statistical technique used in meta-analysis to investigate the independent contribution of the changes in the main factors on the prevalence. A univariate model of meta-regression was evaluated before multivariate model to quantify the magnitude of each continuous moderator on the study effects. A restricted maximum likelihood (REML) estimation method with backward elimination regression procedure was used in meta-regression analysis. The scatter plots were constructed to indicate the association between the moderator and effect sizes through the slope of the regression line. The significantly positive or negative slope suggested whether the explanatory variable had a significant moderating effect or not. The I2 statistic quantified the overall heterogeneity of the model that could be accounted for by the real differences between studies; thus, subgroup analysis or meta-regression applied to identify potential moderating factors explained the inconsistencies between effect sizes across the studies.

Results

The search strategy initially revealed 340 studies, of which 324 were excluded after an initial review, and the remaining 16 were reviewed in detail for eligibility. Eight of these studies were excluded due to various reasons, and the remaining eight were eligible for the meta-analysis [3, 7, 16, 20, 29,30,31,32] (Fig. 1). Besides, an original population-based survey from the author’s group accepted for publication recently was also added [28]. Characteristics of all the studies included in the current meta-analysis, including the criteria to diagnose GERD, are presented in Table 1. The study cites are shown in Fig. 2. Four studies were population-based [7, 16, 30], and the other five were on selected populations such as medical students, patients’ attendants, doctors, nursing staff, and hospital employees [3, 20, 29,30,31](Table 1). The nine studies included 20,614 subjects, with the prevalence of GERD ranging from 5% to 28.5%. Different criteria were used for diagnosis of GERD in different studies, most of which are quite standard methods used to diagnose GERD in epidemiological surveys except for two studies, one by Bhalaghuru et al. and the other by Chowdhury et al. (Table 1) [7, 29]. The estimated prevalence of GERD in the Indian population was 14.15% (95% CI 9.9–19.7%) (Fig. 3). Analysis revealed high between-study variability (I2 98.9%). The model automatically selected the random-effects model in the presence of substantial heterogeneity. Thus, we have conducted a moderator analysis to examine how characteristics of studies were related to variation in the effect sizes across studies. Researchers have found different risk factors to be significant in GERD prevalence and severity. The risk factors found to be associated are depicted in Table 2. Briefly, three studies reported that smoking was not a risk factor; two showed that the younger population had GERD less often. A significant increase in GERD score was observed with higher BMI in five studies, non-vegetarian diet in three studies, with living in urban areas in two studies, with infrequent milk consumption in one study, with an increase in age in four studies, with inadequate sleep in one study, with excessive consumption of alcohol in two studies, with an increase in tea/coffee intake in four studies, and with an increase in intake of carbonated drinks in two studies.

Fig. 2
figure 2

The study cites and prevalence of gastroesophageal reflux disease (GERD) in India

Fig. 3
figure 3

a The forest plot shows the prevalence of gastroesophageal reflux disease (GERD) in India. The squares represent individual studies and the size of the square represents the effect size of each study included in the meta-analysis. The pink horizontal line with each square represents confidence limit of each study. Diamond represents pooled prevalence (fixed and random effect). b Funnel plot showing the publication bias of prevalence studies on GERD

Table 2 Meta-regression of the risk factors of gastroesophageal reflux disease

The unadjusted meta-regression model was constructed for the effect of confounders. The estimated prevalence were 17.6 (95% CI = 10.07–29.04), 13.9 (95% CI = 8.60–21.89), and 15.4 (95% CI = 10.71, 21.66) for the subgroups (variable study type with categories; population-based/-selected population) and overall group of studies, respectively. The separate random-effects models were fitted for the subgroups of study population variable, and the difference between the two subgroup summary estimates was not significant (p = 0.522). Thus, the study population is not a significant moderating variable.

In the meta-regression model, sample size (p = 0.005) explained about 50% of the heterogeneity. A greater prevalence of GERD of 28.5% and 22.3% was noted in studies published in southern part of India [29, 30]. The slope of the estimated regression line of the scatter plot is neither completely horizontal nor very steep, suggesting a weak association with age, non-vegetarianism, and observed effects. By visual examination of scatter plots, we observed that nearly half of the studies fell outside of 95% CI bounds, which suggested that there might be more than one factor that could account for the heterogeneity in the effect sizes. The age (p = 0.53) and non-vegetarianism (p = 0.327) were found to be insignificant moderators. Similarly, the effects of other continuous moderators, BMI (kg/m2) (p = 0.771), publication year (p = 0.380), non-vegetarian diet (p = 0.327), tea/coffee intake (p = 0.935), tobacco use (p = 0.337), and alcohol consumption (p = 0.881), were evaluated through model and visually inspected through scatter plots, suggesting insignificant association of moderators with the observed effect sizes (proportion of GERD) (Fig. 4a–f).

Fig. 4
figure 4

Scatter plots of effect size (proportion of gastroesophageal reflux disease [GERD]) vs. a potential risk factor; a sample size, b publication year, c age (in years), d non-vegetarian diet, e tobacco consumption, and f alcohol consumption. The symbol for individual study is sized proportionally to the precision of the study (the larger the circle, the more precise is the estimated effect) and the dotted line represents the regression line for the analysis

Discussion

We conducted a meta-analysis and meta-regression assessing the prevalence and risk factors of GERD in the Indian population. The pooled prevalence of GERD in the Indian population was found to be 15.57, though there was significant heterogeneity in the studies. The risk factors for GERD included age, BMI, non-vegetarian diet, tea/coffee intake, tobacco, and alcohol consumption. To the best of our knowledge, this is perhaps the first Indian population–based meta-analysis and meta-regression on the prevalence and risk factors of GERD.

GERD is a common condition worldwide (Fig. 5) [34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57]. A systematic review and meta-analysis by Nirwan et al. demonstrated substantial variations in the pooled prevalence of GERD in different geographical regions and countries [58]. The prevalence of GERD varied from 8.7% to 33.1% in the Middle East, 11.6% in Australia, 23.0% in South America, 2.5% to 7.8% in East Asia, 8.8% to 25.9% in Europe, and 18.1% to 27.8% in North America (Fig. 5) [59]. The prevalence of GERD has been reported to be considerably higher in North America (19.5%) and Europe (25.9%). A high prevalence of GERD was reported in South America in the study by El-Serag et al., though another study by Nirwan et al. did report a lower pooled prevalence (12.8%) [59]. In a recent study, prevalence (age-standardized) was highest in the USA, Italy, Greece, New Zealand, north Africa and the Middle East, eastern Europe, Latin American, and the Caribbean countires [60]; in contrast, the prevalence was lowest in Asia Pacific, east Asia, Iceland, France, Denmark, and Switzerland [60]. Considering the variation in the population prevalence of several risk factors for GERD (including those revealed in this meta-analysis) in Western population as compared with the Indian population, difference in prevalence of GERD in different countries is not entirely unexpected [61]. These risk factors include intake of fatty and non-vegetarian foods, carbonated drinks, coffee/tea, high level of urbanization, and presence of obesity. In fact, several studies did support the fact that variation in the frequency of GERD in Asian and Western countries might be related to the frequency of different risk factors in different population [30, 5, 8, 18, 23, 38, 43, 54, 58, 59, 62]. Risk factors for GERD reported in the studies from the Western countries included age (35–59 years), use of non-steroidal anti-inflammatory drugs (NSAIDs), lower income, socioeconomic status, urbanization, genetic factors, family history of GERD, obesity, and consumption of carbonated drinks. Variation in prevalence in GERD has even been reported in different regions within the same country. For example, in the current systematic review, several studies from the southern part of India demonstrate a higher prevalence of GERD (22.2–28.5%) [29, 30] than those reported from the northern parts of India (11–18%; Fig. 2) [16, 20]. One of the reasons for such variation in prevalence of GERD in the southern and northern parts of India might be the differences in the diet in the two regions. Southern Indian diet is typically rice-based, with excess spice, tamarind, and non-vegetarian foods; in contrast, the northern Indian diet is wheat-based and less spicy and vegetarianism is common [63].

Fig. 5
figure 5

Prevalence of gastroesophageal reflux disease in different parts of the world and the reported risk factors

There are quite a few other factors that might explain regional differences in prevalence of GERD in different parts of the world. Differences in the definition/criteria used and ethnic and cultural variation on symptom reporting may affect the prevalence of GERD. Only a few studies used translated-validated questionnaire for assessing the symptoms [28]. House-to-house survey is known to show lower prevalence of functional GI disorders than internet survey [64]. H. pylori infection is known to reduce gastric acid secretion that may reduce the frequency and severity of GERD [13, 65]. Since the prevalence of H. pylori infection is high in India, this might contribute to the lower frequency of GERD in Indian population [13].

This is perhaps the most comprehensive meta-analysis and meta-regression study in Indian population on prevalence and risk factors of GERD. We pooled data from four population-based studies and five studies on selected subjects (medical students, patients’ attendants, doctors, nursing staff, and hospital employees) to analyze the prevalence of GERD [3, 7, 8, 16, 20, 29,30,31,32]. We have also identified that several earlier published studies used different survey methods and definitions and online instruments (questionnaires) to collect data from subjects, leading to a wide range in GERD prevalence. Most of the epidemiological studies on GERD in India are house-to-house surveys [64]. Furthermore, we have also investigated how risk factors affect the prevalence of GERD using meta-regression approach.

This meta-analysis, however, has a few limitations. Significant heterogeneity in the included studies is a limitation. The observed heterogeneity in the current study could be because of the different criteria used for diagnosis of GERD, selection criteria, and lack of controls. A number of studies are also small. In fact, meta-regression should ideally be conducted when the number of studies included in the meta-analysis is at least 10 [66]; there were 9 studies in the current meta-analysis. The diagnosis of GERD is based on symptoms alone. The gold standard for diagnosis of GERD is 24-h pH impedance monitoring rather than symptoms alone [13]. Several other conditions may manifest with symptoms of heartburn such as functional heartburn, esophageal hypersensitivity, and esophageal motor disorders [13]. All the reported studies are not population-based. In spite of all these limitations, we believe that the current meta-analysis does provide important information on epidemiology and risk of GERD in Indian population and suggests the future areas of research on this subject.

In conclusion, this meta-analysis showed the pooled prevalence of GERD in Indian population to be 15.57%, though there was significant heterogeneity in the studies included in this meta-analysis. On meta-regression, we found that age, BMI, non-vegetarian diet, tea/coffee intake, tobacco, and alcohol consumption are the risk factors for GERD in Indian population. These results may have several implications in clinical practice and research. The knowledge on risk factors may be used for public education to prevent GERD. Educating patients with GERD may help them to modify their lifestyle to reduce the severity and complications of GERD.