Abstract
Web search data are associated with disease incidence, population interest, and seasonal variations. This study aimed to investigate seasonal and geographical variations of web search data for sarcoidosis and to explore its association with external factors and sarcoidosis incidence in Sweden. Therefore, sarcoidosis-related data from Google Ads Keyword Planer (2017–2020) were generated for Sweden according to its 21 counties. The relationship between search volume and season, region, population demographics, environmental factors, and the sarcoidosis incidence listed in the National Patient Register was assessed. Analyses revealed seasonal variations for Sweden with an overall peak in the spring and autumn. Geographical differences were observed, with a higher search volume for north-western counties and the lowest search volume for Stockholm County. At the country level, the search volume was positively associated with the sarcoidosis incidence. Higher male proportion and older mean age were associated with a higher search volume, while a higher proportion of foreign-born residents, humidity, and mean temperature were associated with a lower search volume. Our analyses detected correlations between web search data, sarcoidosis incidence, and external factors. Analyses of sarcoidosis web search data therefore appear to be a valuable approach to disease surveillance to address medical needs and public interest.
Similar content being viewed by others
Introduction
Sarcoidosis is a chronic, inflammatory disease characterized by non-caseating granulomas. Sarcoidosis is very heterogeneous in terms of onset, and it can manifest in almost any organ of the body including the skin, heart, joints, lymph nodes, and gastrointestinal tract, although the lungs are affected in more than 90% of affected individuals1,2. The aetiology and pathogenesis of sarcoidosis are largely unknown, and the prognosis depends on the type of disease, therapy response, and the affected individual’s general conditions2,3. The burden of sarcoidosis for individuals and society is substantial2,4. In addition to its economic and psychological impact, sarcoidosis is also characterized by its non-specific symptoms that complicate the diagnostic process2,4.
Incidence and prevalence estimates vary substantially depending on ethnicity and geography, with the highest sarcoidosis prevalence reported for African Americans and in Nordic countries like Sweden5,6,7. Higher rates of sarcoidosis in certain regions may be due to a common contagion, genetic predisposition, and environmental exposure. Local weather (e.g., temperature, sunshine hours, and humidity) is believed to be a key environmental factor influencing the incidence of sarcoidosis in a specific geographic area. Moreover, population density, occupational activities, household, and lifestyle determinants are discussed as environmental factors associated with the development of sarcoidosis8,9,10. While some studies did not show any gender differences11,12, others reported that sarcoidosis was more common in women13,14. A nationwide register-based study in Sweden, however, reported a higher incidence and prevalence in men compared to women (16.0 vs. 13.5/100,000 per year and 179 vs. 141/100,000, respectively)6. The observed peak in age of onset in Europe and the United States is between 20 and 60 years, occurring earlier in men than in women2,6,9,13,14.
Nowadays, internet search engines are widely used to seek health information or to evaluate symptoms. Moreover, the internet may play an essential role in rare diseases like sarcoidosis in information seeking15. Previous studies have revealed that search analysis—one of the methods of infodemiology and infoveillence16—can be a powerful tool to reflect a population’s interest in specific topics and thus can be associated with conventional data17,18. Numerous studies have also shown that web search data are a valuable resource for studying noncommunicable diseases and may be able to predict diseases19,20,21. Recent analyses of web search queries for sarcoidosis found seasonal and geographical variations and indicated an overall unmet medical need22,23. However, no study to date has assessed factors associated with sarcoidosis-related search behaviour to monitor sarcoidosis and related medical needs. In the present study, we analysed web search data on sarcoidosis in Sweden from January 2017 to December 2020 to examine seasonal and geographic trends. Furthermore, we examined the relationship between search volume and environmental and population-related factors as well as the sarcoidosis incidence listed in the National Patient Register (NPR).
Materials and methods
Data collection
Internet search approach
For this retrospective longitudinal study, we used Google Ads Keyword Planner (Google Ads) to generate web search data of the Swedish term for sarcoidosis “sarkoidos” and related keywords for the 21 Swedish counties.
Google Ads is primarily used to optimize marketing campaigns. As it can reflect a population’s interest in certain topics, it is also increasingly used to answer scientific questions18,22. Google Ads provides the most relevant keywords with their average monthly search volume (SV) for different regions and timeframes. Data are available from the four years before the day of the query. We assessed the SV across the Swedish counties from January 1, 2017, to December 31, 2020. Data were limited solely to users in Sweden whose language preference was Swedish. We then normalized the results per 100,000 inhabitants to account for different population sizes between counties. Numbers of inhabitants in the Swedish counties are provided in Additional file 1: Table S1 (www.statistikdatabasen.scb.se).
Sarcoidosis incidence from administrative health data
Data on date and ICD code for sarcoidosis diagnoses in Sweden were obtained from the NPR. The NPR includes data on both inpatient hospitalizations since 1964 (nationwide since 1987; coverage of nearly 100%) and outpatient visits since 2001 (coverage of approximately 87%). Visits have been coded using the Swedish version of the ICD coding system (code for sarcoidosis: ICD-8 135, ICD-9 135, or ICD-10 D86). We identified individuals receiving their first-ever visit listing ICD code for sarcoidosis occurring from January 1, 2010, to December 31, 2013.
Population statistics and environmental factors
Yearly data on inhabitant demographics (mean age, proportion of male inhabitants, foreign-born residents per 100,000 inhabitants, and population density (sq. km)) were taken from Statistics Sweden (Statistiska centralbyrån (SCB); www.statistikdatabasen.scb.se) for the years 2017 to 2020. Weather data (monthly mean humidity (%), sunshine duration (hours), and temperature (°C)) for all counties were taken from the Swedish Meteorological and Hydrological Institute (SMHI; www.smhi.se/data) for the years 2017 to 2020.
Institutional review board approval and informed consent were not applicable for the internet data. Ethical approval for the Swedish register data on the incidence of sarcoidosis was obtained from the Swedish Ethical Review Authority DNR 2014-230-31, amendment 2020-00437.
Statistical analysis
Descriptive data were generated for the SV of sarcoidosis for Sweden as a whole and for its counties (absolute SV per 100,000 and mean per 100,000, standard deviation (SD)). Keywords identified by Google Ads were qualitatively analysed. Differences in SV per 100,000 inhabitants between Sweden’s counties and European seasons (meteorological seasons; winter: December–February, spring: March–May, summer: June–August, and autumn: September–November), were tested with Kruskal–Wallis tests and Friedman tests, respectively, as SV in counties and seasons was not normally distributed. Dunn-Bonferroni corrector was used for post hoc analysis. Spearman’s correlation coefficient was used to assess the relationship between the monthly SV and (1) sarcoidosis incidence and (2) the aforementioned external factors. The comparison between sarcoidosis SV 2017–2020 and incidence 2010–2013 was justifiable, as studies have demonstrated that the incidence of sarcoidosis in Sweden has not increased in recent years 6 and that there were no significant differences over time in the number of new diagnoses in the available NPR dataset since January 2007 (p = 0.34). All factors were included in a multivariable linear regression model to further assess the relationship with SV. To avoid multicollinearity, correlations of all variables were calculated before inclusion in the multivariable model. Standardized regression coefficient (beta) and 95% confidential intervals (95%-CIs) were estimated. P values < 0.05 were considered statistically significant. Analyses were performed using IBM SPSS Statistics for Windows, version 28.0 (IBM Corp). Geodata from the European Commission—Eurostat/GISCO were used to determine administrative boundaries using a geographic information system, QGIS, version 3.30.3 (QGIS Development Team).
Ethics statement
Institutional review board approval and informed consent were not applicable for the internet data. Ethical approval for the Swedish register data on the incidence of sarcoidosis was obtained from the Swedish Ethical Review Authority DNR 2014-230-31, amendment 2020-00437.
Results
The total SV across the 21 Sweden counties was 123,555.74 per 100,000 inhabitants (mean: 122.58, (SD 41.18)). The greatest monthly SV for Sweden was observed in March 2020 with 175.83 searches per 100,000 inhabitants (Fig. 1a), while the highest monthly sarcoidosis incidence was registered in June 2010 (1.22/100,000 inhabitants; (Fig. 1b)). The highest mean monthly SV per 100,000 inhabitants was observed for “sarcoidosis” (71.75 (SD 27.52)). In total, 30 keywords related to the Swedish word for sarcoidosis were identified. Most of them assigned to “manifestations and forms” of sarcoidosis (e.g., pulmonary sarcoidosis, chronic sarcoidosis; 20/30, 66.67%), followed by treatment-related queries (e.g., treatment sarcoidosis; 4/30, 13.33%; [Additional file 2: Table S2]).
Seasonal variation
Overall, the SV per 100,000 inhabitants showed seasonal variations (x2 = 55.25, p < 0.001), with significantly more searches in spring (125.94 (SD 37.58)) and autumn (123.84 (SD 48.91)) than in the summer (116.69 (SD 39.88)) and winter months (118.83 (SD 36.18), 0.001 ≤ p ≤ 0.006). No differences were identified between summer and winter (p = 0.099) and autumn and spring (p = 1.000, (Fig. 1a)).
County variation in monthly SV
The mean monthly SV per 100,000 inhabitants for Swedish counties ranged from 50.07 (SD 7.98) in Stockholm County to 182.97 (SD 56.77) in Västerbotten County and showed significant differences between counties (x2 = 504.09, p < 0.001). Post hoc analyses showed a significantly lower number of monthly search queries per 100,000 for Stockholm County (50.07 (SD 7.98)) compared to all other counties (0.001 < p ≤ 0.009) except for Västra Götaland (83.57 (SD 14.93), p = 1.000). Västerbotten (182.97 (SD 56.77)) had a significantly higher number of monthly search queries than all the other counties (0.001 ≤ p ≤ 0.009) except for Skåne (96.39 (SD 15.73)), Södermanland (98.49 (SD 20.43)), and Halland (103.59 (SD 18.73)).
Population statistics
The SV per 100,000 inhabitants showed a moderate and weak positive correlation with the inhabitants’ mean age (r = 0.44, p < 0.001, (Table 1)) and the proportion of male inhabitants (r = 0.23, p < 0.001). In contrast, the number of foreign-born persons per 100,000 inhabitants and population density showed a good negative correlation with the overall SV (r = − 0.52, p < 0.001 and r = − 0.52, < 0.001, respectively).
Environmental factors
No correlation was observed between SV and the monthly sunshine hours (r = − 0.03, p = 0.417) as well as between the SV and the monthly relative humidity (r = 0.05, p = 0.135; (Table 1)). Furthermore, the monthly mean temperature showed a weak negative correlation with the overall SV (r = − 0.12, p < 0.001). Correlations between the different environmental factors and SV on the county level were weak to moderate (− 0.34 ≤ r ≤ 0.28, 0.02 ≤ p ≤ 0.94; (Table 2)).
Association between monthly SV and incidence
Across Sweden, the monthly incidence showed a low positive correlation with the SV (r = 0.16, p = 0.003). Västerbotten, Jämtland, and Västernorrland, the counties with the highest sarcoidosis incidence per 100,000 inhabitants, exhibited elevated search frequencies compared to other counties (Fig. 2). On the county level, the SV showed a moderate correlation in Stockholm (r = 0.38, p = 0.008), Kronoberg (r = 0.32, p = 0.020), and Västernorrland (r = 0.43, p = 0.002; (Table 2)), while the other counties showed no to weak correlation (− 0.29 ≤ r ≤ 0.20, 0.14 ≤ p ≤ 0.98).
Prediction of the sarcoidosis SV
Test for multicollinearity showed a correlation between a higher proportion of foreign-born residents and higher population density (r = 0.70, p < 0.001; [Additional file 3: Table S3]). Overall, the proportion of male inhabitants had the greatest effect on the SV per 100,000 inhabitants, with a higher proportion resulting in a higher SV (beta 8.87, 95%-CI [2.13; 14.42], p = 0.005, (Table 1)). A higher monthly sarcoidosis incidence, a higher mean age, and more sunshine hours were associated with a higher SV (beta 5.86, 95%-CI [2.57; 9.16], p < 0.001, beta 0.19, 95%-CI [− 0.28; 0.75], p = 0.881, and beta 0.01, 95%-CI [− 0.05; 0.06], p = 0.887, respectively). In contrast, a higher temperature, higher humidity, and a higher number of foreign-born residents per 100,000 inhabitants resulted in a lower SV (beta − 0.60, 95%-CI [− 1.06; − 0.13], p = 0.012, beta − 0.25, 95%-CI [− 0.67; 0.18], p = 0.258, and beta − 0.01, 95%-CI [− 0.006; − 0.004], p < 0.001, respectively). Adjusted R2 of model summary was 0.316.
Discussion
Our analysis of sarcoidosis-related web search data showed that in Swedish counties, the highest number of search queries was observed in the spring and autumn. Västerbotten, Jämtland, and Västernorrland the counties with the highest number of new sarcoidosis cases per 100,000 inhabitants, had more searches per 100,000 inhabitants compared to the other counties. A positive correlation was found between web search queries and monthly incidence on country level showing that web search queries can serve as a proxy for the incidence of sarcoidosis. Further, we found that counties with a higher proportion of male inhabitants, lower average temperature, and fewer foreign-born residents have significantly more sarcoidosis-related searches.
Previous studies using web search data have also demonstrated positive correlations with various disease registry data (e.g., for conjunctivitis24, coronary heart disease19, and cancer incidence and mortality20,21). However, the correlations observed in our study were weaker compared to those of previous studies using web search data. This may be because sarcoidosis is a rare disease with not as readily available high-quality information on the internet25 and low exposure in the media compared to the previously studied diseases26. Additionally, it should be noted that the date of the ICD-coded visit listing sarcoidosis may be different than when the patient begins searching for information on the internet if they are suspected of having sarcoidosis, but it is not confirmed until the visit. This aspect should be taken into account in further investigations. Nevertheless, web search data seem to play an important role in gathering information about rare diseases for affected individuals15. Analysing web search data for sarcoidosis therefore remains a feasible and inexpensive solution to disease surveillance.
Our study showed a relationship between the geographical variation in the number of searches. Geographical differences in the prevalence of sarcoidosis between different areas of one country is a phenomenon observed worldwide27,28,29. For Sweden, the highest sarcoidosis prevalence was reported for the less densely populated north-western regions, and it was discussed that these differences may be caused by the population genetics of the Swedish counties, with higher ethnic diversity in the southern urbanized counties6. This theory is in agreement with our study, as a higher population density and number of foreign-born residents were associated with fewer searches for sarcoidosis. However, foreign-born residents can also search online in another language, which should be taken into account when interpreting the results.
The higher prevalence of sarcoidosis observed in northern counties could be due to decreased exposure to sunlight and thereby a vitamin D deficiency may be associated with sarcoidosis8. Moreover, exposure to humidity and lower temperature also appear to be associated with sarcoidosis, which could explain the observed geographical variations30. Such an association was only observed to a small degree in our study, with a lower temperature being associated with more search queries. Thus, as only 30% of the variation in search behaviour could be explained by the assessed variables including age and sex, there must be additional variables that influence sarcoidosis-related search behaviour.
A recent study indicated differences in sarcoidosis diagnoses and treatment among different Swedish healthcare regions, which may suggest differences in awareness of the disease31. Differences in diagnosis and treatment do not appear to match the differences in prevalence in Sweden31. However, it was observed that a better supply of general practitioners and outpatient specialists was associated with a higher number of searches in Germany17. Regional differences in Sweden’s healthcare system and utilization32 may therefore explain further variations in the search behaviour.
We identified a seasonal pattern that showed a higher number of sarcoidosis-related search queries during the spring. This is in accordance with prior studies assessing sarcoidosis-related web search data22,23 and various epidemiological studies that demonstrated sarcoidosis occurring more commonly during the spring8,10. One explanation for this seasonality is that sarcoidosis may be triggered by organic bioaerosols, predominately pollen, that are more common during springtime8. It is also possible that there is a connection between public holidays in Sweden (July/August and December/January) and the associated shifts in diagnosis. Signs and symptoms tend to be overlooked and left unaddressed until after the vacation period33,34. Other epidemiological studies also reported that the prevalence of sarcoidosis is associated with certain industrial organic and inorganic dust8,35; agents that are found in abundance in the work environment of miners, firefighters, and agricultural workers9,35,36,37, occupational groups which show a higher risk for sarcoidosis, a factor that should be considered in future analyses.
Despite the absence of identified COVID-19-related keywords, the elevated SV observed in 2020 can be attributed in part to the pandemic’s influence. This phenomenon may stem from factors such as decreased access to medical appointments for symptom clarification, medical inquiries, and support provision for affected individuals38.
The reliability of the internet search data for sarcoidosis in Sweden may be limited, as it only identified 30 keywords with an overall SV of 123,556 searches per 100,000 inhabitants, while for Germany 433 keywords associated with sarcoidosis with a SV of 3,068,200 per 100,000 inhabitants22 were identified despite a similar market share of Google (> 90%39). Moreover, the accuracy of Swedish websites compared to English or other language websites may differ, potentially affecting the study's results. In addition, internet search data and disease incidence did not match in time due to limited data availability. Analyses of Google search queries are limited to individuals who have internet access and use this search engine. Although 98% of Sweden’s households had access to the Internet and 94% of the population used it in 201840 (93% of Internet users using Google as a search engine39), only including data collected by Google excludes information from non-users. Although the elderly are increasingly using the internet, younger people still use the internet more frequently in Sweden40. We only included data from searches with the Swedish language preferred and are therefore missing searches done in other languages. No statements about individual users can be made since Google Ads do not provide information about user demographics. However, we observed correlations between the percentages of male inhabitants and the number of foreign-born residents, which indicate the influence of demographics on search behaviour. Also, educational level and communication habits among affected individuals can impact online search behaviour and should be considered in future research. It is essential to recognize, however, that our study adopts an ecological design, and caution must be exercised to avoid the ecological fallacy when interpreting these findings41. In addition, it must be considered that the searches do not necessarily only originate from affected individuals but also from their relatives, physicians, medical and science students, or national visitors to medical congresses interested in sarcoidosis. We did not have information on searches for Löfgren syndrome, which would be interesting to include in future studies to examine whether trends and relationships are different by sarcoidosis phenotype. Google provides automatic completion of search suggestions, which could also bias a person’s search behaviour. Moreover, the data provided by Google Ads are based only on monthly estimations instead of exact numbers. Thus, the estimations may be overestimated for the individual counties and underestimated for Sweden as a whole42.
In summary, our study demonstrated a correlation between Google searches for sarcoidosis and sarcoidosis incidence across Sweden. We characterized seasonal trends and identified geographical variations that may serve as a point of comparison or a standard for future studies or analyses. Moreover, we found that a higher number of searches was correlated with lower temperature and a lower proportion of foreign-born residents.
Conclusion
Analysis of Internet search data is a novel approach to detecting unmet medical needs, gauging public interest in sarcoidosis, and identifying high-risk groups and new risk factors. However, further analyses are needed with internet search and incidence data referring to the same period, and future studies should focus on more environmental and health-specific aspects to evaluate the specificity and sensitivity of the web search data.
Data availability
The datasets used during the current study are available from the corresponding author on reasonable request.
References
Bargagli, E. & Prasse, A. Sarcoidosis: A review for the internist. Intern. Emerg. Med. 13, 325–331. https://doi.org/10.1007/s11739-017-1778-6 (2018).
Gerke, A. K., Judson, M. A., Cozier, Y. C., Culver, D. A. & Koth, L. L. Disease burden and variability in sarcoidosis. Ann. Am. Thorac. Soc. 14, S421–S428. https://doi.org/10.1513/AnnalsATS.201707-564OT (2017).
Grunewald, J. et al. Sarcoidosis. Nat. Rev. Dis. Primers 5, 45. https://doi.org/10.1038/s41572-019-0096-x (2019).
Arkema, E. V., Eklund, A., Grunewald, J. & Bruze, G. Work ability before and after sarcoidosis diagnosis in Sweden. Respir. Med. 144S, S7–S12. https://doi.org/10.1016/j.rmed.2018.09.016 (2018).
Yoon, H. Y., Kim, H. M., Kim, Y. J. & Song, J. W. Prevalence and incidence of sarcoidosis in Korea: A nationwide population-based study. Respir. Res. 19, 158. https://doi.org/10.1186/s12931-018-0871-3 (2018).
Arkema, E. V., Grunewald, J., Kullberg, S., Eklund, A. & Askling, J. Sarcoidosis incidence and prevalence: A nationwide register-based assessment in Sweden. Eur. Respir. J. 48, 1690–1699. https://doi.org/10.1183/13993003.00477-2016 (2016).
Sharma, O. P. Sarcoidosis around the world. Clin. Chest Med. 29, 357–363. https://doi.org/10.1016/j.ccm.2008.03.013 (2008).
Judson, M. A. Environmental risk factors for sarcoidosis. Front. Immunol. 11, 1340. https://doi.org/10.3389/fimmu.2020.01340 (2020).
Deubelbeiss, U., Gemperli, A., Schindler, C., Baty, F. & Brutsche, M. H. Prevalence of sarcoidosis in Switzerland is associated with environmental factors. Eur. Respir. J. 35, 1088–1097. https://doi.org/10.1183/09031936.00197808 (2010).
Ramos-Casals, M. et al. How the frequency and phenotype of sarcoidosis is driven by environmental determinants. Lung 197, 427–436. https://doi.org/10.1007/s00408-019-00243-2 (2019).
Gribbin, J. et al. Incidence and mortality of idiopathic pulmonary fibrosis and sarcoidosis in the UK. Thorax 61, 980–985. https://doi.org/10.1136/thx.2006.062836 (2006).
Thomeer, M., Demedts, M., Vandeurzen, K. & Diseases, V. WGo. I. L. Registration of interstitial lung diseases by 20 centres of respiratory medicine in Flanders. Acta Clin. Belg. 56, 163–172. https://doi.org/10.1179/acb.2001.026 (2001).
Hillerdal, G., Nou, E., Osterman, K. & Schmekel, B. Sarcoidosis: Epidemiology and prognosis. A 15-year European study. Am. Rev. Respir. Dis. 130, 29–32. https://doi.org/10.1164/arrd.1984.130.1.29 (1984).
Erdal, B. S., Clymer, B. D., Yildiz, V. O., Julian, M. W. & Crouser, E. D. Unexpectedly high prevalence of sarcoidosis in a representative US Metropolitan population. Respir. Med. 106, 893–899. https://doi.org/10.1016/j.rmed.2012.02.007 (2012).
Morgan, T. et al. Using the internet to seek information about genetic and rare diseases: A case study comparing data from 2006 and 2011. JMIR Res. Protoc. 3, e10. https://doi.org/10.2196/resprot.2916 (2014).
Eysenbach, G. Infodemiology and infoveillance: Framework for an emerging set of public health informatics methods to analyze search, communication and publication behavior on the Internet. J. Med. Internet Res. 11, e11. https://doi.org/10.2196/jmir.1157 (2009).
Tizek, L. et al. Influence of climate on google internet searches for pruritus across 16 German cities: Retrospective analysis. J. Med. Internet Res. 21, e13739. https://doi.org/10.2196/13739 (2019).
Ziehfreund, S., Tizek, L. & Zink, A. Web search data as health data? : Geographic differences, temporal trends, and topics of interest from internet search engine analyses in Germany. Hautarzt 73, 53–60. https://doi.org/10.1007/s00105-021-04918-x (2022).
Senecal, C., Widmer, R. J., Lerman, L. O. & Lerman, A. Association of search engine queries for chest pain with coronary heart disease epidemiology. JAMA Cardiol. 3, 1218–1221. https://doi.org/10.1001/jamacardio.2018.3459 (2018).
Xu, C. et al. Detecting lung cancer trends by leveraging real-world and internet-based data: Infodemiology study. J. Med. Internet Res. 22, e16184. https://doi.org/10.2196/16184 (2020).
Wehner, M. R., Nead, K. T. & Linos, E. Correlation among cancer incidence and mortality rates and internet searches in the United States. JAMA Dermatol. 153, 911–914. https://doi.org/10.1001/jamadermatol.2017.1870 (2017).
Hilker, C. et al. Leveraging internet search data to assess prevalence, interest, and unmet needs of sarcoidosis in Germany. Sci. Rep. 11, 20841. https://doi.org/10.1038/s41598-021-00131-x (2021).
Stanton, A. & Katz, S. J. Internet search results correlate with seasonal variation of sarcoidosis. BMC Pulm. Med. 21, 227. https://doi.org/10.1186/s12890-021-01602-7 (2021).
Kammrath Betancor, P., Tizek, L., Zink, A., Reinhard, T. & Bohringer, D. Estimating the incidence of conjunctivitis by comparing the frequency of google search terms with clinical data: Retrospective study. JMIR Public Health Surveill. 7, e22645. https://doi.org/10.2196/22645 (2021).
Pauer, F. et al. Rare diseases on the internet: An assessment of the quality of online information. J. Med. Internet Res. 19, e23. https://doi.org/10.2196/jmir.7056 (2017).
Cervellin, G., Comelli, I. & Lippi, G. Is Google Trends a reliable tool for digital epidemiology? Insights from different clinical settings. J. Epidemiol. Glob. Health 7, 185–189. https://doi.org/10.1016/j.jegh.2017.06.001 (2017).
Dumas, O., Abramovitz, L., Wiley, A. S., Cozier, Y. C. & Camargo, C. A. Jr. Epidemiology of sarcoidosis in a prospective cohort study of US women. Ann. Am. Thorac. Soc. 13, 67–71. https://doi.org/10.1513/AnnalsATS.201508-568BC (2016).
Kajdasz, D. K., Judson, M. A., Mohr, L. C. Jr. & Lackland, D. T. Geographic variation in sarcoidosis in South Carolina: Its relation to socioeconomic status and health care indicators. Am. J. Epidemiol. 150, 271–278. https://doi.org/10.1093/oxfordjournals.aje.a009998 (1999).
Beghe, D. et al. Sarcoidosis in an Italian province. Prevalence and environmental risk factors. PLoS One 12, e0176859. https://doi.org/10.1371/journal.pone.0176859 (2017).
Salah, S., Abad, S., Monnet, D. & Brezin, A. P. Sarcoidosis. J. Fr. Ophtalmol. 41, e451–e467. https://doi.org/10.1016/j.jfo.2018.10.002 (2018).
Rossides, M., Kullberg, S., Eklund, A., Grunewald, J. & Arkema, E. V. Sarcoidosis diagnosis and treatment in Sweden: A register-based assessment of variations by region and calendar period. Respir. Med. 161, 105846. https://doi.org/10.1016/j.rmed.2019.105846 (2020).
Johansson, N., Jakobsson, N. & Svensson, M. Regional variation in health care utilization in Sweden—the importance of demand-side factors. BMC Health Serv. Res. 18, 403. https://doi.org/10.1186/s12913-018-3210-y (2018).
Lambe, M., Blomqvist, P. & Bellocco, R. Seasonal variation in the diagnosis of cancer: A study based on national cancer registration in Sweden. Br. J. Cancer 88, 1358–1360. https://doi.org/10.1038/sj.bjc.6600901 (2003).
Oiwake, H. Y., Nonaka, D. & Toyosato, T. Factors associated with delayed diagnosis among patients with COVID-19 in Okinawa, Japan. Int. J. Environ. Res. Public Health 19, 8624. https://doi.org/10.3390/ijerph19148634 (2022).
Newman, L. S. et al. A case control etiologic study of sarcoidosis: Environmental and occupational risk factors. Am. J. Respir. Crit. Care Med. 170, 1324–1330. https://doi.org/10.1164/rccm.200402-249OC (2004).
Hena, K. M. et al. Clinical evaluation of sarcoidosis in community members with world trade center dust exposure. Int. J. Environ. Res. Public Health 16, 7. https://doi.org/10.3390/ijerph16071291 (2019).
Ronsmans, S. et al. Associations between occupational and environmental exposures and organ involvement in sarcoidosis: A retrospective case-case analysis. Respir. Res. 22, 224. https://doi.org/10.1186/s12931-021-01818-5 (2021).
Wecker, H., Ziehfreund, S., Hindelang, M., Welcker, M. & Zink, A. Change of perspective: Impact of COVID-19 pandemic on axial spondyloarthritis-related web searches in Germany. Sci. Rep. 14, 4044. https://doi.org/10.1038/s41598-024-54047-3 (2024).
statcounter. Search Engine Market Share Worldwide. https://gs.statcounter.com/search-engine-market-share/all/sweden/#monthly-201701-202012 (2023).
interne, S. o. Swedens and the Internet. https://svenskarnaochinternet.se/english/earlier-years-reports/ (2023).
Pearce, N. The ecological fallacy strikes back. J. Epidemiol. Commun. Health 54, 326–327. https://doi.org/10.1136/jech.54.5.326 (2000).
Tizek, L. et al. Interest in skin cancer in urban populations: A retrospective analysis of google search terms in nine large German cities. Acta Derm. Venereol. 99, 797–804. https://doi.org/10.2340/00015555-3214 (2019).
Funding
Open Access funding enabled and organized by Projekt DEAL. This work was supported by the Technical University of Munich, TUM School of Medicine and Health, Munich, Germany and the Department of Medicine Solna, Karolinska Institute, Stockholm, Sweden.
Author information
Authors and Affiliations
Contributions
SZ, LT, EVA, and AZ contributed to the initial research question and created the study design. SZ, LT, EVA, and AZ planned and carried out the data generation and statistical analysis. SZ, EVA, and AZ contributed to the interpretation of the results. SZ drafted the manuscript. LT, EVA, and AZ provided critical feedback and helped shape the research, analysis, and manuscript. All authors approved the submitted version.
Corresponding author
Ethics declarations
Competing interests
EVA reported no conflict of interest SZ has received grants from Novartis LT currently works for ViiV Healthcare and has received speaker’s honoraria and/or grants from Novartis and Jassen Cilag. AZ has been an advisor and/or received speaker's honoraria and/or received grants and/or participated in clinical trials of the following companies: AbbVie, Almirall, Amgen, Beiersdorf Dermo Medical, Bencard Allergie, BMS, Celgene, Eli Lilly, GSK, Janssen Cilag, Leo Pharma, Miltenyi Biotec, Novartis, Pfizer, Sanofi-Aventis, Takeda Pharma, UCB Pharma.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ziehfreund, S., Tizek, L., Arkema, E.V. et al. Identifying sarcoidosis trends using web search and real-world data in Sweden: a retrospective longitudinal study. Sci Rep 14, 19260 (2024). https://doi.org/10.1038/s41598-024-69223-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-69223-8
- Springer Nature Limited