1 Introduction

That inflation expectations by microeconomic agents play a significant role in monetary policy and analysis, is a well-established fact. Taking this into cognizance, surveys like the Michigan Consumer Survey in the US, the Reserve Bank of New Zealand’s Survey of Expectations, the Bank of England Inflation Attitudes Survey, the Inflation Expectations Survey of Households in India, etc. are conducted to obtain information on inflation expectations of the general public. Although these surveys are comprehensive in terms of recording various aspects of the quantitative and qualitative responses on inflation expectations of the general public, very little is known about how exactly the survey respondents form their expectations about future prices.

In this aspect, the success of the otherwise-dominant rational expectations hypothesis, in explaining how agents form their expectations about inflation, has been limited and it has been criticized due to its inadequacy in modelling real-life expectations (Pfajfar and Santoro 2013). This led to alternate models for explaining expectations formation among agents based on behavioural, sociological and epidemiological factors. In this paper, we particularly focus on the empirical evidence of epidemiological sources of inflation expectations formation in India.

The literature on epidemiological sources of inflation expectations took off with the seminal work of Christopher Carroll in 2003. The basic idea, that draws inspiration from epidemiology, concerns how information about expected inflation emanates from a certain common source, like the forecasts made by professional forecasters (SPF), and the general public uses this information from experts to update their beliefs about future inflation. Carroll’s work provides a micro foundation for an aggregate expectations equation in a sticky information set-up consistent with Mankiw and Reis (2002). Since then, the epidemiological model has been explored by Lamla and Maag (2012), Pfajfar and Santoro (2013), Ehrmann et al. (2014) and Lei et al. (2015). The first few studies induct media news as the common source of information based on which the general public form their own inflation expectations. Lei et al. (2015) use newspaper reports on inflation as the common source and further categorize the newspapers (general, economic, political, etc.) to assess the impact of each type of newspaper report on inflation expectations formation in China.

Given this background, our paper focuses on the survey-based responses on inflation expectations by the general public in India and addresses two related issues. First, we explore if the Internet can be modelled as a common source of information based on which agents form their inflation expectations. Second, we ask if there is evidence of Carroll-type epidemiological sources of inflation expectations formation in India.

The contribution of this work to the existing literature is two-fold. This is the first study to our knowledge that uses the Inflation Expectations Survey of Households data to validate the presence of Carroll-type epidemiological sources of inflation expectations formation in India. Second, we depart from all the epidemiology-based studies mentioned above as far as the common source of information is concerned. Instead of considering the forecasts made by the professional forecasters or those provided by the news media, as the common source of information, we hypothesize that the Internet is the common source through which the general public update their future inflation expectations. This is not an implausible assumption in the Indian context since the number of Internet users in India is secondFootnote 1 in the world at approximately 481 million usersFootnote 2 (about 24% of the total population) during December 2017. Additionally, the Inflation Expectations Survey of Households is currently conducted in the urban cities of India where the usage of Internet is pervasive. As per the report “Internet in India 2017” by the Internet and Mobile Association of India (IAMAI) and Kantar IMRB, urban India with an estimated population of 455 million already has 295 million using the Internet which is almost 65% of the urban population. To corroborate our hypothesis that the Internet is indeed a common source of information that people use to form their expectations about future inflation, we use the statistics of inflation-related searches conducted through Google in India between 2006 and 2018, the data on which is extracted through Google Trends.

In the process, this paper brings together two strands in the existing literature- the first one being the epidemiological sources of inflation expectations pioneered by Carroll (2003) as mentioned above, and the other is the use of meaningful data based on Internet search initiated by Choi and Varian (2012). Choi and Varian (2012) have demonstrated how the Google Trends data can be used to forecast automobile sales, forecast travel destination planning, unemployment, etc. More recently, researchers have used Google Trends data to predict spread of diseases (Cho et al. 2013), stock market movements (Dergiades et al. 2015), oil price volatility (Afkhami et al. 2017), unemployment rates (Naccarato et al. 2018) etc. That the Internet-based data is gaining traction in terms of macroeconomic research in India is evident from the recent news that the Reserve Bank of India is all set to start a “Big Data Analytics” division by the end of the year 2018.Footnote 3 In this backdrop, this particular use of Google Trends data for epidemiological sources of inflation expectations formation, is a new addition to the existing literature, both from the point of view of a novel common source for information and also as a new application of the Internet search data.

Our findings indicate that the epidemiological sources of inflation expectations of the general public in India hold for both Carroll (2003) type specification based on the inflation expectations of professional forecasters, as well as the Internet search data. Thus, we infer that the Internet is indeed a significant source from where the agents derive information and update their future inflation expectations. Additionally, we find some limited evidence that the Internet search “sentiment”, that is, the keywords used to conduct inflation-related searches, affect inflation expectations in the expected direction. All our findings are in line with the various works on epidemiology cited above, especially that of Lei et al. (2015).

The rest of the paper is organized as follows. Section 2 outlines the benchmark epidemiology framework pioneered by Carroll (2003), including the modifications made to accommodate our model. Section 3 discusses the data sources. Section 4 analyses the results and Sect. 5 concludes.

2 The model

Carroll’s (2003) seminal work on the epidemiology of inflation expectations draws inspiration from Kermack and McKendrick (1927) model in the epidemiology literature that explains the process of transmission of disease in society from a “common source”. At any given point in time, a constant fraction λ of the population gets affected by the disease that spreads from a common source, like air pollution, while the remaining (1 − λ) fraction of the population do not get affected but nevertheless remain susceptible. In the next time period, fraction λ of those who escaped the disease in the previous period get infected, while the rest do not. This is how a disease is transmitted from a common source over a period of time. Analogous to disease transmission from a common source, Carroll’s (2003) model hypothesizes that news media is the “common source” of information (transmission) based on which a section λ of the population update their expectations about future inflation. The remaining (1 − λ) proportion of the population, who have not been affected by the “common source” or the news media, continue with the inflation expectations from the last period. Thus, at any given point in time, there is a combination of agents who have updated their inflation expectations and those who continue with their inflation belief from the previous period. This is how an element of “information stickiness” is introduced in the model.

$$ E_{t} \left( {\pi_{t + 1} } \right) = \lambda N_{t} \left( {\pi_{t + 1} } \right) + \left( {1 - \lambda } \right)\left\{ {\lambda N_{t - 1} \left( {\pi_{t + 1} } \right) + \left( {1 - \lambda } \right)\left( {\lambda N_{t - 2} \left( {\pi_{t + 1} } \right) + \cdots } \right)} \right\} $$
(1)

where, \( E_{t} (\pi_{t + 1} ) \) is the expectation of inflation rate for period t + 1 conditional on the information available till time period t, \( \pi_{t + 1 } {\text{is the realized}} \) inflation for period t + 1 and \( N_{t} (\pi_{t + 1} ) {\text{is the}} \) updated news at period t to form inflation expectations for period t + 1.

Quoting Carroll (2003), “The derivation of this equation is as follows. In period t a fraction λ of the population will have been ‘infected’ with the current-period newspaper forecast of the inflation rate next quarter, Ntt+1]. Fraction (1 − λ) of the population retains the views that they held in period t  − 1 of period t + 1’s inflation rate. Those period-t − 1 views in turn can be decomposed into a fraction λ of people who encountered an article in period t − 1 and obtained the newspaper forecast of period t + 1’s forecast, Nt−1t+1], and a fraction (1 − λ) who retained their period-t − 2 views about the inflation forecast in period t + 1. Recursion leads to the remainder of the equation”.

The expression \( \left\{ {\lambda N_{t - 1} \left( {\pi_{t + 1} } \right) + \left( {1 - \lambda } \right)\left( {\lambda N_{t - 2} \left( {\pi_{t + 1} } \right) + \cdots } \right)} \right\} \) can be expressed as the sum of an infinite series and Eq. (1) can be written succinctly as:

$$ E_{t} \left( {\pi_{t + 1} } \right) = \lambda N_{t} \left( {\pi_{t + 1} } \right) + \left( {1 - \lambda } \right)E_{t - 1} \left( {\pi_{t} } \right) $$
(1a)

For the purpose of estimation, we use the following Eq. (2) that follows from Eq. (1a),

$$ E_{t} \left( {\pi_{t + 1} } \right) = \alpha_{1} N_{t} \left( {\pi_{t + 1} } \right) + \alpha_{2} E_{t - 1} \left( {\pi_{t} } \right) + \varepsilon_{t} $$
(2)

Most works on the epidemiological sources of inflation expectations, including that by Carroll (2003), use the forecasts made by the professional forecasters as the source from which the general public form their expectations (i.e. \( {\text{N}}_{\text{t}} (\pi_{{{\text{t}} + 1}} ) \)). Lei et al. (2015) assume that \( {\text{N}}_{\text{t}} (\pi_{{{\text{t}} + 1}} ) \) is directly obtained from news reports. Stepping aside from these assumptions, we hypothesize that agents draw their information from the Internet search represented by the Google Trends (GT) data.Footnote 4 Thus, we get Eq. (3) as follows:

$$ N_{t} \left( {\pi_{t + 1} } \right) = \gamma GT_{t} + \eta_{t} $$
(3)

Substituting Eq. (3) in Eq. (2), we getFootnote 5:

$$ E_{t} \left( {\pi_{t + 1} } \right) = \beta GT_{t} + \alpha_{2} E_{t - 1} \left( {\pi_{t} } \right) + \mu_{t} $$
(4)

Following Lei et al. (2015), we categorize the inflation-related searches into three categories- “Favorable” (like low inflation, low prices), “Unfavorable” (like high inflation, rising prices) and “Neutral” (stable prices). The exact keywords searched for are outlined in the Data description section. Accordingly, to assess the search sentiment and its impact on inflation expectations, as in Lei et al. (2015), we estimate the following equation:

$$ E_{t} \left( {\pi_{t + 1} } \right) = c + \alpha E_{t - 1} \left( {\pi_{t} } \right) + \beta^{ + } GT\_Favorable_{t} + \beta^{ - } GT\_Unfavorable_{t} + \beta^{0} GT\_Neutral_{t} + \mu_{t} $$
(5)

However, due to very low number of searches in the “Neutral” category, no data points are available. Thus, for search sentiment we estimate Eq. (6) which is a version of Eq. (5) after excluding the variable \( GT\_Neutral_{t} \):

$$ E_{t} \left( {\pi_{t + 1} } \right) = c + \alpha E_{t - 1} \left( {\pi_{t} } \right) + \beta^{ + } GT\_Favorable_{t} + \beta^{ - } GT\_Unfavorable_{t} + \mu_{t} $$
(6)

Having presented an outline of the epidemiological model suitable for our analysis, we proceed to the following section that gives an overview of the data used for this study, along with a mention of the data sources.

3 Data description

3.1 Inflation expectations

Data on inflation expectations of households in India is available from the Inflation Expectations Survey of Households (IESH) conducted quarterly by the Reserve Bank of India (the Central Bank of India) since the second quarter of 2006.

The survey covers about 5000 urban individuals in each round across 18 major cities in India at present. The sample surveyed represents both genders, nine age categories and seven different broad occupational categories in each city. The survey respondents are asked to quote quantitative numbers based on their inflation perception, one-quarter ahead and one-year ahead inflation expectations. Figures 1 and 2 plot the mean one-quarter ahead and mean one-year ahead inflation expectations of the general public in India against realized CPI and WPI respectively.

Fig. 1
figure 1

Mean 1-quarter-ahead households’ inflation expectations and actual inflation in India

Fig. 2
figure 2

Mean 4-quarters-ahead households’ inflation expectations and actual inflation in India

At a cursory glance, the above figures indicate that through the initial years until about the fourth quarter of 2011, the households’ inflation expectations have moved in tandem with WPI inflation while staying above the actual inflation. Post 2011, the household expectations series shows a co-movement with CPI inflation while staying above CPI inflation all along. It would not be out of place to mention here that the official inflation rate of India was calculated using WPI till 2014 and since then, it is being calculated using CPI (combined). Due to this switch in the official inflation series, it is difficult to infer as to which inflation numbers the general public have been following during the period covered by this study.

Apart from the IESH, there is the Survey of Professional Forecasters (SPF), which at present is conducted bi-monthly by the Reserve Bank of India. This survey was initiated in 2007 and was conducted at a quarterly frequency until April 2014. At present, 21 professional forecasters participate in this survey and give their quantitative forecasts on CPI-combined inflation, CPI-core inflation and WPI-combined inflation rates for the current and future quarters, apart from other macroeconomic forecasts. Figure 3 shows the mean one-quarter ahead and mean-one-year ahead forecasts of the professional forecasters across different categories of WPI inflation.

Fig. 3
figure 3

Professional forecasters' WPI inflation and the actual WPI inflation in India

As evident from Fig. 3, the inflation forecasts by the Survey of Professional Forecasters (SPF) in most periods have stayed below the WPI inflation. If we combine this observation with the conclusions from Figs. 1 and 2 that the household inflation expectations have always remained above WPI inflation, it seems to indicate that households are deriving information or knowledge about future inflation from sources other than the forecasts made by the professional forecasters. This makes a case for our study where we hypothesize that there might be possibly some common source of information like the Internet, based on which the general public form their expectations about future inflation.

Both IESH and SPF data are publicly available at the Reserve Bank of India website (https://www.rbi.org.in/).

3.2 Internet search data

The Internet search data has been collated from Google Trends (https://trends.google.co.in/trends/). Google Trends is a public web facility made available by Google Inc. that gives Google search related statistics.

Search related data in Google Trends can be filtered according to geographic zones, time period, frequency and categories. For example, we looked into the search history for the keyword “Inflation” in India starting 2006: Q2 till 2018: Q2 at a quarterly frequency across “All categories”. The data generated is an index known as the Google Trend Index for the searched term. It is to be noted that Google Trends does not generate data on search volume. Instead, it expresses the searched term as a fraction of the total number of searches conducted in the zone of our interest during a specified time period and assigns the number 100 against the highest fraction. The rest of the data series is rescaled with respect to the data point that corresponds to the number 100.

We draw attention to the fact that since the Google Trends Index is not a search volume data but relative to the total number of searches conducted and is rescaled based on the highest relative search, the index might display slight variation in the past data due to changes in the sample period.

We collected data on the search history of each term related to inflation, namely- “Inflation”, “CPI Inflation”, “WPI Inflation”, “Core Inflation” and “Headline Inflation”; across three search categories, namely- “All Category Web Search”, “Business and Industrial Web Search” and “News Web Search”, that originated from India between 2006: Q2 and 2018: Q2.

For search sentiment, we collected data on the search history of terms to represent “Favorable” search, “Unfavorable” search and “Neutral” search in India during the time period 2006: Q2 and 2018: Q2.

Terms like “price fall”, “price decrease”, and “low inflation”, were clubbed under “Favorable” search, reflecting positive search sentiment. “price rise”, “price increase”, and “high inflation”, were clubbed under “Unfavorable” search, thereby reflecting negative search sentiment. Not enough searches were recorded for terms related to “Neutral” search, thereby generating no data points by Google Trends. Data was generated by using Google Trends comparison that allowed search comparison across multiple keywords.

4 Results

In this section, we present our empirical analysis in two parts. In the first part, following Lei et al. (2015), we explore if the Internet is indeed a common source from where the general public draws inflation-related information. In doing so, we employ the internet search data for the keyword “Inflation” and related terms based on the Google Trends data. In the second part, we go one step ahead and investigate if there is Carroll-type “information stickiness” where only a fraction of the population updates their beliefs about future expectations based on information drawn from the Internet. This requires us to calculate inflation expectations based on Google Trends data, following the methodology proposed by Guzman (2011). Next, with the calculated Google Trends based inflation expectations series, we estimate the epidemiological model of Carroll (2003) and assess the extent of prevalence of information stickiness in the economy.

Prior to starting our analysis, we check for the stationarity of all series considered in this work. Based on Augmented Dickey Fuller (ADF), KPSS and Phillips-Perron tests; the CPI inflation, WPI inflation, CPI of professional forecasters, WPI of professional forecasters, the inflation expectations of households and the inflation expectations derived from Internet search- are all found to be I(1).Footnote 6

Additionally, for each equation estimated in this section and presented in Table 1 through Table 12, we check for the stationarity of the estimated residuals. The estimated residuals are all stationary at levels, thereby indicating that that variables considered are cointegrated and the estimated coefficients of OLS are cointegrating vector. Existence of cointegration implies estimation can be done at level even if individual variables are I(1).

Table 1 Inflation expectations and the internet as a source for “all categories web search”

4.1 Internet as a common source of information

We begin our analysis by estimating Eq. (4) that checks if the Internet is a common source of information for forming inflation expectations. The use of Internet is proxied by the Google Trends data for search keywords like “Inflation”, “CPI Inflation”, “WPI Inflation”, “Core Inflation” and “Headline Inflation”. We further categorize the search keywords across three different web search categories, namely, “All Category Web Search”, “Business and Industrial Web Search” and “News Web Search”. The results of OLS estimations, as well as 2SLS estimation of Eq. 4 are presented in Table 1 through Table 3.

Results indicate that the coefficients of Google search of five inflation-related keywords across three web search categories (third column of Tables 1, 2, 3) are all positive and significant. This implies that the Internet, proxied by the Google search statistics of agents, is indeed a common source from where the public draws information to form future expectations about inflation. This conclusion is robust to the choice of inflation-related keyword, the Google search category and the method of estimation.

Table 2 Inflation expectations and the internet as a source for source for “business and industrial web search”
Table 3 Inflation expectations and the internet as a source for “news category web search”

4.2 Internet search sentiment

Having established the role played by Internet search in inflation expectations formation, we next check if the Internet search sentiments affect inflation expectations of both the households and the professional forecasters, as represented by Eq. 6. Following Lei et al. (2015) who had estimated how news sentiment (how news on inflation are reported) affect expectations formed about future inflations, we try to find if search sentiments affect inflation expectations. It is expected that favorable searches should lead to lower inflation expectations while unfavorable searches are associated with higher inflation expectations. Search sentiments are “Favorable” when searches involve terms like “price fall”, “price decrease”, and “low inflation”. “Unfavorable” search sentiment reflects searches like “price rise”, “price increase”, and “high inflation”. Not enough searches were recorded for terms related to “neutral” search, thereby generating no data points by Google Trends.

Table 4 presents the estimation results of search sentiments based on Eq. 6. The upper panel of the table considers survey-based inflation expectations of households as \( E_{t} (\pi_{t + 1} ) \) (the dependent variable in Eq. 6), while the lower panel of the same table considers the CPI inflation forecasts by the professional forecasters as \( E_{t} (\pi_{t + 1} ) \). Row 1 of Table 4 indicates that for favourable searches, although the coefficient is of the desired negative sign it is statistically not significant and hence no conclusion can be drawn about positive sentiment reducing inflation expectations of households. Unfavorable searches increase inflation expectations of households since the coefficient is both positive and significant. However, the results are not particularly robust for the method of estimation since 2SLS results (row 2) give the correct coefficient signs for search sentiments but they are statistically not significant.

Table 4 Internet search sentiment and inflation expectations

The lower panel of Table 4 gives partial evidence of the impact of search sentiments on inflation expectations of professional forecasters. Row 3 indicates that positive search sentiment is negative, as desired, and is statistically significant, while the negative search sentiment, although of the correct positive sign, is statistically not significant. However, if 2SLS (row 4) is considered as the method of estimation, then none of the search sentiments are statistically significant.

To conclude the discussion on Internet search sentiment and its impact on inflation expectations, we believe that there is some evidence of the same, since the results are sensitive to the choice of the dependent variable (\( E_{t} (\pi_{t + 1} ) \)) and also to the method of estimation. For those particular specifications that give evidence of the impact of search sentiment on inflation expectations, the inferences drawn here match with Lei et al. (2015) that finds the similar impact of media reports on inflation expectations.

4.3 Internet search-based inflation expectations and information stickiness

Having established that the Internet is indeed a source of information for agents’ inflation expectations formation, we next investigate the extent of expectations updation that happens in the economy in a Carroll-like framework. We first estimate the benchmark Carroll (2003) model represented by Eq. 2, followed by our version of the same equation that uses inflation expectations based on Google Trends data.

Table 5 reports the OLS results of the benchmark Carroll (2003) equation, represented by various specifications of Eq. 2, where the WPI inflation expectations of the professional forecasters (SPF) are the epidemiological source of inflation expectations for the general public. In all versions of Eq. 2, the coefficient of SPF is positive and significant, thereby implying that epidemiological sources of inflation expectations formation exist in the Indian economy. Past inflation expectations by the general public are also positive and significant in all versions of Eq. 2, thereby highlighting the contribution of agent’s past expectations while the current expectations about future inflation. Past actual inflation (WPI inflation) matters as well for inflation expectations formation, but it has a negative relationship with the latter (row 5). This negative relation with past WPI inflation seems counter-intuitive, but if we look at the households’ inflation expectations series vis-à-vis the WPI inflation (Fig. 1), we observe that post 2012, the two expectations have moved in opposite directions. When we run the same regressions for a smaller sample between 2008: Q1 and 2013: Q1, lagged value of WPI inflation turns out to be negative and statistically not significant.Footnote 7

Table 5 Benchmark epidemiology model with WPI inflation forecasts of professional forecasters

We repeat the exercise as done in Table 5, to check for epidemiological sources of expectation formation when the CPI forecasts of the professional forecasters are considered.

Similar to the results obtained in Table 5, in Table 6 we find that the coefficient on the CPI forecasts made by the professional forecasters is positive and significant, thereby implying Carroll-type the epidemiological sources of inflation expectations formation by the general public. Past inflation expectations by agents are also positive and significant, thereby indicating that not all agents update their expectations based on current information.

Table 6 Benchmark epidemiology model with CPI inflation forecasts of professional forecasters

Thus overall, using the two types of inflation numbers (WPI and CPI) provided by the professional forecasters as the source of news for the general public, we do find evidence of Carroll-like epidemiological sources of inflation expectations formation in India.

Next, we depart from the SPF-based Carroll type specification of the epidemiology equation and substitute it by inflation expectations derived from Google Trends. Our contention is that some fraction of the population updates their beliefs based on the inflation-related information that they obtain from the Internet, while the rest of the population continue with their previous beliefs about expected inflation.

We follow the methodology outlined by Guzman (2011) in deriving inflation expectations based on Internet search. The expected inflation series based on Internet search, \( (GT\_InfExp)_{t} , \) is calculated as the sum of the directional change in Internet search, that is [\( ln(GT_{t} ) - ln(GT_{t - 1} ) \)], where GT is Google Trends data and a proxy for Internet search; and realized inflation from the last period.Footnote 8 This is represented by Eq. 7 below.

$$ \left( {GT\_InfExp} \right)_{t} = \left[ {ln\left( {GT_{t} } \right) - ln\left( {GT_{t - 1} } \right)} \right] + \pi_{t - 1} $$
(7)

For a visual reference, we plot the calculated series of inflation expectations based on Google Trends data in Fig. 4, against the survey-based expectations of households as well as realized CPI inflation. The correlation between inflation expectations based on Google Trends and survey-based inflation expectations is 0.42 (standard error is 0.13) and it is significant at 1% level of significance, while the correlation between Google Trends based inflation expectations and realized CPI inflation is 0.89 (standard error is 0.07) and it is significant at 1% level of significance. Hence, the Google Trends based inflation expectations series follows realized CPI inflation closely.

Fig. 4
figure 4

Internet search-based inflation expectations, households’ inflation expectations and realized inflation

The calculated series of inflation expectations based on Google Trends has a mean and standard deviation of 7.68 and 3.01 respectively, between 2006: Q2 and 2018: Q2. Compared to this, the mean and standard deviation of households inflation expectations are 9.66 and 2.51 respectively (during 2006: Q2 to 2018: Q2), of CPI by professional forecasters are 9.29 and 1.78 respectively (during 2008: Q1 and 2018: Q2), and of WPI by professional forecasters are 6.86 and 3.38 respectively (during 2008: Q1 and 2018: Q2). It may be noted that the forecasts by professional forecasters commenced from 2008: Q1.

We test for the nature of expectations (adaptive or rational expectations), and find that inflation expectations based on Google Trends data follow rational expectations.Footnote 9

Once we obtain the \( \left( {GT\_InfExp} \right)_{t} \) series, we next estimate the benchmark Carroll (2003) equation (Eq. 2) while replacing the inflation expectations of the professional forecasters by this new series. It is worth noting here, that we do not consider the version of the Carroll equation that includes the \( \pi_{t - 1} \) term as an explanatory variable, since this term (\( \pi_{t - 1} ) \) is already embedded in the calculation of inflation expectations based on Internet search, \( \left( {GT\_InfExp} \right)_{t} \).

As in the first part of our analysis, we consider five key search words- “Inflation”, “CPI Inflation”, “WPI Inflation”, “Core Inflation” and “Headline Inflation”, across three different search categories- “All Category Web Search”, “Business and Industrial Web Search” and “News Web Search”, between 2006: Q2 and 2018: Q2. While “All Category Web Search” is general in nature (suppose a student is searching for “what is inflation”), “News Web Search” caters to searches specific within the news section. For our context, news search is more relevant, although we present the results for all categories for robustness check.

Table 7 reports the OLS and 2SLS estimation results for Carroll-type equation where the Internet is deemed as a common source of information for the general public. Hence, inflation expectations based on Google search for the keyword “Inflation” across three search categories is considered in place of inflation expectations of experts (SPF) in the benchmark Carroll model.

Table 7 Epidemiology model with internet search-based inflation expectations (search keyword: “inflation”)

Results indicate that Carroll-type epidemiological sources of inflation expectations, where the Internet is deemed as a common source for drawing information on inflation, is valid in the Indian context. A positive and significant coefficient of \( \left( {GT\_InfExp} \right)_{t} \) gives this evidence (model 1 in each panel of Table 7). Following Carroll (2003), we run a restricted version model 1 (model 2 in each panel of Table 7) and results indicate that since the coefficient of Google Trends-based inflation expectations is positive and significant at 0.10 across all search categories for both OLS and 2SLS methods, about 10% of the population update their beliefs about future inflation. On the other hand, the positive and significant coefficients of \( E_{t - 1} \left( {\pi_{t} } \right) \) indicate that a significant proportion of the economy do not update their future inflation expectations, thereby indicating the presence of “information stickiness” in the system.

For robustness check, we repeat the estimation exercise done in Table 7, for the various inflation-related searches under specific search categories.

Tables 8, 9, 10 and 11, present the results for searches for four inflation-related keywords- “WPI Inflation”, “CPI Inflation”, “Core Inflation” and “Headline Inflation”, respectively. As in case of Table 7, all the search words are divided across three search categories and the corresponding \( \left( {GT\_InfExp} \right)_{t} \) is calculated in each case.

Table 8 Epidemiology model with internet search-based inflation expectations (search keyword: “WPI inflation”)
Table 9 Epidemiology model with internet search-based inflation expectations (search keyword: “CPI inflation”)
Table 10 Epidemiology model with internet search-based inflation expectations (search keyword: “core inflation”)
Table 11 Epidemiology model with internet search-based inflation expectations (search keyword: “headline inflation”)

Similar to the results in Table 7, we find that irrespective of the search keyword and search category, the epidemiological source of inflation expectations based on Internet search, is positive in all cases and significant in all but eight cases (model 1 for all panels for all tables). A restricted version of model 1 (model 2 for all panels across all tables) indicate that about 9–11% of the population update their inflation expectations when we consider the searches in the “All Category Web Search”. Inflation expectations formed in the previous period also continue to be positive and significant, thereby implying the agents’ stickiness in updating expectations. However, for other categories of search, the restricted version across various keywords, do not give a statistically significant coefficient for internet search-based inflation expectations.

When compared with the benchmark Carroll (2003) epidemiology-based equation where the forecast of professional forecasters is considered (Table 6), while 14% of the population update their inflation expectations based on the former, for Internet as a source of information, this number is around 10%. Thus, the Internet as a source of information for formation of inflation expectations, comes second to possibly news or newspaper reports (where forecasts of professional forecasters are reported), nevertheless it’s impact on future expectations formation is significant and cannot be ignored.

4.4 Internet search and inflation expectations gap

In the concluding part of our analysis, we check if there is evidence of increase in Internet search leading to “near-rational” inflation forecasts by the households. In other words, does the increase in the use of Internet search, that might be considered as procuring more information on inflation, reduce the gap between inflation expectations of the general public and that of the professional forecasters?

To test this premise, Carroll (2003) considered a news index based on the first-page coverage of two newspapers in the US of inflation-related reports. He conjectured that with higher inflation-related news reports, the general public would be better informed and thus the gap between their inflation expectations and that of the professional forecasters would narrow down.

Methodology-wise, the news index of Carroll (2003) matches with the nature of the Google Trends data since both are about inflation-related news coverage/Internet search relative to total newspaper reports/total searches conducted. Thus, the explanatory variable in our case is the Google Trends data for the search word “Inflation”.

To test our premise of more Internet searches (more information) leading to “near rational” inflation expectations of households, we estimate Eq. 8 below:

$$ \left| {GAP_{t} } \right| = \alpha + \beta \left( {GT_{t} } \right) + u_{t} $$
(8)

where the dependent variable is the modulus of inflation expectations of households minus the inflation forecasts of professional forecasters and the explanatory variable on the right-hand side is the Google Trends based index.

Akin to Carroll (2003), results in Table 12 show that the coefficient on Google Trends is negative and significant, for both WPI forecasts and CPI forecasts of professional forecasters, thereby implying that as more Internet searches are conducted, that is equivalent to obtaining more inflation-related information, the inflation expectations gap between the general public and that of the professional forecasters, reduce.

Table 12 Internet search and inflation expectations gap

To sum up the main findings, we show that there is evidence of Carroll-type epidemiological sources of inflation expectations formation in India where the Internet is a common source of information for the general public. However, there is presence of “information stickiness” since not all agents update their expectations each period. Additionally, there is some limited evidence of the impact of Internet search sentiment on inflation expectations of the general public in India.

5 Conclusion

This paper explores the epidemiological sources of inflation expectations formation of the general public in India. While Carroll’s (2003) benchmark study considers the inflation expectations of professional forecasters as the “common source” based on which people form their expectations, we propose that the Internet is also a potential “common source” of information. Based on data extracted from Google Trends that represents Internet search, our results indicate that during the period 2006–2018, the Internet has indeed been a common source of information based on which agents have formed their expectations about future inflation, and the Internet search sentiment has had some impact on inflation expectations. Additionally, based on the inflation expectations series derived from the Google Trends data, we find that there is presence of “information stickiness” in the system since only a small fraction of agents update their inflation expectations each period. Overall, our findings conform to the findings of the epidemiology literature and conclude that the epidemiological sources of inflation expectations formation do exist for the Indian economy.