Introduction

Guillain-Barré syndrome (GBS) consists of a wide spectrum of acute immuno-mediated polyneuropathies characterized by acute-subacute development of muscular weakness with hyporeflexia or areflexia [1, 2].

Overall annual incidence of GBS in the world is between 1 and 2 cases for 100,000 people. In the USA, it has been estimated between 1.65 and 1.79 cases for 100,000 people [3]. The incidence of GBS seems to show a seasonal trend over the year [4]. This seasonality may depend on several environmental, climatological and demographic factors [5]. For example, it is known that particular climate conditions favour seasonal gastrointestinal and respiratory tract infections, which are recognized as important antecedent events [6].

Nowadays, technology plays a crucial role in our society. New interest is arising in the use of digital technologies in the field of epidemiology. Google Trends (Alphabet Inc., USA) is a publicly available online tool aimed to analyse Google search keywords popularity over time and space, assessing role of factors such as country, region, related keywords and topics. This tool has a good potential in today’s epidemiology, which has been explored by some recent studies. It has been shown that search peaks for seasonal influenza reflect data deriving from traditional control systems and may even precede them in detecting disease outbreaks [7]. In other studies in the field of neuro-epidemiology, Google Trends (GT) data confirmed the existence both of a latitude gradient and a seasonal trend in the incidence of multiple sclerosis, as previously observed by traditional epidemiology [8, 9].

Aims

The primary aim of this work is to use GT search results to analyse seasonal variation in GBS web search popularity in the USA, overall and in geographic subregions; we are also interested to understand if any relationship exists with the data coming from traditional epidemiology.

Material and methods

We conducted a retrospective web-based research to investigate whether a seasonal trend of GT search volumes for ‘Guillain-Barré Syndrome’ exists in the USA. We included searches from January 2008 to December 2017, analysing the monthly average over 10 years to smooth out any outlier peaks; we further refined the search analysing the ten most populous US states as of April 2010 Census (CA, TX, FL, NY, PA, IL, OH, GA, NC, MI) and grouping them by geographic regions (Pacific, Midwest, South West, South East, North East).

The raw index publicly provided by GT is RSV (relative search volume), with maximum monthly number of queries during the specified timeframe set to 100. GT provides also a regional interest index for the whole timeframe analysed, set to 100 for the region with maximum search frequency per unit of population. To integrate measures from different states, we calculated weighting coefficients based on US state population; then we linearly transformed data with maximum regional monthly peak set to 100.

We performed generalized estimating equation (GEE) analysis with search volume (for US overall and by geographic region) as dependent variable, year and month as independent variables and year as grouping variable. Post hoc Wald tests with adjustment for multiple comparisons were used to test significance of associations. Then we collected mean monthly temperatures for each state in 1981–2010 interval and tested association with RSV. These data were collected from publicly available repository from National Oceanic and Atmospheric Administration; data from the largest city of each state were used as proxy.

Privacy issues

Data coming from GT are completely anonymous, and no kind of personal information which may recall to individual web users or patients is retained by the system. Full privacy policy is available at: http://www.google.com/privacypolicy.html.

Results

Average RSV for GBS during the analysed timeframe (2008 to 2017) was 38.7 ± 8.9 arbitrary units. Analysing the USA overall with a GEE model, we verified the absence of outlier years, since no significant effect of year was observed (p = 0.83) on RSV. Analysing 10-year RSV average for each month, association with month of the year was highly significant for October (+ 13.74 units, 95% CI 5.60 to 21.88, p = 0.002), which was also an extreme outlier (> 3·IQR; Fig. 1a). The preceding and following month displayed the second highest degree of anomaly (September + 4.88 [95% CI − 0.46 to 10.22], p = 0.17; November + 6.02 [CI − 1.75 to 13.80], p = 0.22) compared to all remaining months of the year.

Fig. 1
figure 1

Google Trends relative search volume (RSV) for Guillain-Barré syndrome, assessed grouping by month (a) and by month-to-month mean temperature variation (b)

Analysis by geographic US regions confirmed the observation of October peak (p < 0.03 in all regions). This peak clustered with significant September subpeak in some regions (Pacific, Midwest); in these two regions, a secondary spring peak was observed too (April in Pacific, + 3.45 [95% CI 0.15 to 6.76, p = 0.04]; February in Midwest, + 5.14 [CI 0.97 to 9.31], p = 0.016).

For each state analysed, we correlated the mean monthly temperature variation (ΔT °C) from previous month to the monthly average RSV for GBS over years 2008–2017 (Fig. 1b). We observed a J-shaped trend, with highest RSV for records with the most negative ΔT (mostly October). Excluding the highest ΔT decile (with ∆T ≥ 5.33 °C), which formed the lesser peak of J (with 5/10 observations being April or May), we observed inverse linear correlation of RSV with ∆T. In particular, for every 1 °C month-to-month temperature reduction, mean + 1.08 (± 0.17) unit increase of RSV was observed (p < 0.001, linear model).

Discussion

Several traditional epidemiology studies have investigated the incidence and the seasonality of GBS all over the world. However, the results have often been contrasting. In a systematic review and meta-analysis published by Webb et al. [5], across 9386 patients from 42 studies, an increased risk of GBS in winter versus summer was observed. Shui et al. [4] reported an increased number of visits for GBS in winter and spring in the USA, compared to summer and fall. In other studies, otherwise, no clear seasonal trend was found [10, 11].

In our analysis we found a significant peak of GBS web searches in the month of October overall in the USA. This observation was also confirmed assessing seasonality in the ten most populous states, grouped in five macro-regions. We observed a trend of increased searches also in the months surrounding the October peak, mainly September, and a secondary spring peak in some regions. Month-to-month average temperature variation seems to be an interesting predictor of GBS search peaks. One possible explanation for this phenomenon could be that sudden large variations of the temperatures accompany incidence peaks for seasonal infectious antecedents of GBS (e.g., upper respiratory tract infections in cold season, Campylobacter jejuni gastroenteritis in warm season). The interval (some weeks on average) [12] from the infectious antecedent to GBS onset and the subacute course of the disease could result in a delay from disease onset to Google search for the disease. Therefore, we hypothesize that the observed autumn peak may relate to a summer C. jejuni-related incidence peak [12], while the observed spring peak may be linked to a winter influenza virus-related peak [13].

Comparison of our findings to traditional epidemiology data is made difficult by the absence of analogous population-based incidence data for external validation; hence, our work constitutes only a seminal, proof-of-concept work in the field. Another limit of web search-based approach may arise from a boost in web search popularity caused by mass-media or extensive social-media coverage of a particular topic. We minimized this bias by analysing only the 10-year-averaged monthly data. In addition, we sought proof of internal consistency by comparing data from different geographical regions in the same timeframe, thereby overcoming the effect of geographically limited hype.

One main issue regarding the previous studies of incidence and epidemiology of GBS is that they have been conducted mainly on limited size populations. Conversely, a distinctive feature of this work is that, although we used a surrogate measure of incidence, search engine data constitute a very large and geographically distributed sample.

In previous works assessing disease dynamics with web-based tools, data deriving from GT showed impressive correlation with actual incidence trends [7, 14] and reflected accurately traditional epidemiologic features of neurologic and non-neurologic diseases [8, 9]. This data may offer timely information that, in the influenza field, was observed to possibly even precede traditional public health control systems [7, 14].

Conclusions

This study is the first search engine-based assessment to date of Guillain-Barré syndrome epidemiology. We observed repeatable seasonality patterns emerging from Google Trends searches in USA, with autumn and, at a lesser extent, spring showing the highest peaks for GBS. We correlated these peaks with the largest periods of average temperature change along the year, possibly reflecting increased susceptibility to infectious antecedents of GBS. However, at the present time, it is not possible to conclude if the data deriving from our analysis may indicate that digital epidemiology may precede traditional systems in identifying GBS outbreaks; further studies are needed to assess this topic.