A substantial increase in the availability of store brands across various categories and the expectation that recessionary trends often involve a shift toward store brands have renewed scholarly interest in understanding consumer behavior related to store brands. Despite the breadth of the literature on store brands (e.g., Batra and Sinha 2000; Dhar and Hoch 1997; Sethuraman 1992), little research has been done on umbrella branding in the context of store brands. Most previous research has concentrated on the umbrella branding of national brands rather than store brands. For example, Erdem (1998) demonstrated how consumers may learn about national umbrella brands through their user experiences across categories. Erdem et al. (2004) further estimated a learning model to explain differential market shares for store brands in the United States versus those in Europe; however, they did not explore any cross-category learning effects.

The purpose of this work is to extend previous work by Erdem (1998) and Erdem et al. (2004) to study the learning spillover effects of umbrella brands across multiple categories for both national and store brands. For this purpose, we apply the Multivariate Multinomial Probit Model of cross-category learning, which allows for simultaneous and interdependent choice of different brands across categories. To the best of our knowledge, our model is the first Multivariate Multinomial Probit Model (MVMNP) with learning (previous learning models assumed the multinomial logit (MNL)). In addition, we repeat the analysis in three countries: the United States, the United Kingdom, and Spain. Our results indicate that cross-category learning effects exist between several pairs of categories for both store and national umbrella brands, although some of the categories in which correlated learning happens differ between store and national umbrella brands. The degree of cross-category learning also varies across product categories. This paper provides valuable insights into cross-learning effects of umbrella brands and differences in learning between store and national umbrella brands. It has managerial implications such that umbrella brands—whether national or store brands—need to provide consistent experiences within and across categories and firms need to understand the factors underlying the transfer of quality associations thoroughly.

We briefly outline the relevant literature in the next section. We subsequently introduce our model and discuss our data and empirical findings. We conclude with a discussion of implications and future research.

Literature review

Literature on umbrella brands

Theoretical work has established that umbrella branding might reduce consumer uncertainty and perceived risk associated with extension (Wernerfelt 1988). Early empirical work also demonstrated that consumers—faced with uncertainty about product quality—might develop prior expectations about the quality of an extension based on the perceived quality of its parent brand (Montgomery and Wernerfelt 1992). According to Erdem (1998), in the context of a dynamic structural model with myopic consumers, consumers’ quality perceptions for a brand in a particular category (toothpaste) are affected by their experience with the same brand in another category (toothbrush). She estimates a multinomial logit model, thereby not allowing for coincidence in purchases across the two categories studied. Also, she focuses on consumers’ learning of national umbrella brands and did not take into account that of store brands, which are also umbrella brands in general.

Although prior literature has provided evidence for spillover effects in umbrella branding (e.g., Balachander and Ghose 2003; Chen and Liu 2004; Erdem 1998),Footnote 1 Moorthy (2010) has shown that such branding serves as a signal of quality only if the correlation between the parent and new brands is large enough. Indeed, the success of brand extensions has been shown to largely depend on the fit between the original product and the extension (Rotemberg 2010; Volckner and Sattler 2006; Volckner et al. 2008).

Finally, theory papers have shown conditions when umbrella branding may or may not be the optimal strategy for firms (e.g., Amrouche and Zaccour 2009; Cabral 2008; Miklos-Thal 2008). For example, Amrouche and Zaccour’s (2009) analysis takes into account the strategic interactions between manufacturers and retailers as well as the positive spillover among private label sales in different categories. Surprisingly, their results indicate that umbrella branding may lead to lower profits for the retailer. These results ultimately depend on the market potential of the different brands in the various categories and the degree of competition between the national brand and the private label. Ultimately, the literature demonstrates that umbrella branding does not always imply high quality (Hakenes and Peitz 2008). Despite the breadth of the literature on national umbrella brands, store brands have not been investigated in depth as “umbrella brands.”

Recent literature on store brands

A number of studies in the recent literature on store brands have focused on the impact of store brands either on retailers’ margin and profit (Ailawadi and Harlam 2004) or on store loyalty (Ailawadi et al. 2008; Corstjens and Lal 2000; Sudhir and Talukdar 2004). Previous literature has also examined the determinants of retailers’ store and national brand pricing behaviors (Cotterill and Putsis 2000; Cotterill et al. 2000). On the other hand, some researchers have investigated the competitive interaction between national and store brands. For example, Chintagunta et al. (2002) investigate changes in preferences for national brands and price elasticities due to the introduction of a store brand. The introduction of a store brand has been shown to influence not only brand choice but also retailers’ bargaining power (Geyskens et al. 2010; Meza and Sudhir 2010; Narasimhan and Wilcox 1998; Pauwels and Srinivasan 2004). Furthermore, Gielens (2011) find that new products offering substantially new, intrinsic benefits are relatively more effective than other new product introductions and even more so when fighting private labels rather than national brands. In contrast, private labels are hardly hurt by new products that offer new, extrinsic benefits.

In addition, existing literature has examined differential market shares of store brands across product categories. It has been shown that private label shares are higher in categories with low perceived consumer risk (Batra and Sinha 2000), higher relative quality, and lower quality variability (Sethuraman 1992). Furthermore, Sethuraman (2003) measures the overall brand equity of national brands based on customers’ reservation price and perceived quality differentials between national and store brands. He concentrates on the decomposition of brand equity into quality and non-quality equity, yet neglects to consider the role of consumer learning in the formation of quality perceptions.

Lastly, Erdem et al. (2004) have shown that differences between national and store brands’ perceived quality and experience variability, as well as differences in consumer sensitivities to price, risk, and quality across countries, may explain the differential success of store brands across countries. Szymanowski and Gijsbrechts (2012) take into account cross-brand learning among private labels based on the assumption that spillovers may also occur between different private labels perceived as similar. However, none of these papers—including Erdem et al. (2004)—allow for any cross-category effects, including cross-category learning effects.

Model

Consider a set of consumers (households) I={i|i = 1, 2, …, I} that make purchases chosen from at least one of M different product categories, where M={m|m = 1, 2, …, M} and I is the number of consumers. Let J n ={j n | j n  = 1, 2, …, J n } be the set of national brands that is available in at least one of m categories, and J s ={j s | j s =1, 2, …, J s } is the set of store brands that is available in at least one of m categories. The consumers’ purchases are observed over the period T={t|t = 1, 2, …, T}, where T is the time span.

Consumers may be imperfectly informed and thus uncertain about product quality. As mentioned in Erdem (1998), this uncertainty can persist even after the consumers experience the product, because use experience might provide only noisy information. Thus, consumer perceptions of quality levels may deviate from true levels. Therefore, we define the overall quality of store brands and national brands as follows:

$$ {X_{{i{j_n}tm}}} = {A_{{{j_n}m}}} + {x_{{i{j_n}tm}}} $$
(1)
$$ {X_{{i{j_s}tm}}} = {A_{{{j_s}m}}} + {x_{{i{j_s}tm}}} $$
(2)

where\( {X_{{i{j_n}tm}}} \) is the overall quality level of national-brand j n in category m that consumer i would have perceived at time t if consumer i purchased the brand in category m at time t. \( {A_{{{j_n}m}}} \) is the (true) mean quality level for national-brand j n in category m, and \( {x_{{i{j_n}tm}}} \) is an i.i.d. random error term. We denote the overall quality level, mean quality level, and random error term for the store brands as \( {X_{{i{j_s}tm}}},{A_{{i{j_s}m}}}, \) and \( {x_{{i{j_s}tm}}} \), respectively. Equations 1 and 2 show that experience with a brand provides unbiased but imperfect information about the brand’s true product quality.

As in Erdem (1998), we assume that consumers learn about the mean brand quality levels \( \left( {{A_{{{j_n}}}},{A_{{{j_s}}}}} \right) \) through Bayesian updating. Previous literature suggests that the Bayesian updating mechanism often provides a reasonable fit to observed choice behavior (Erdem 1998; Erdem and Keane 1996; Erdem et al. 2004; Roberts and Urban 1988). We also assume that consumers’ priors on the quality levels \( {A_{{{j_n}}}} \) and \( {A_{{{j_s}}}} \) are normally distributed at time t = 0 for m = 1, 2,…, M, and consumer prior qualities are correlated across categories for national and store umbrella brands.Footnote 2

$$ \left[ \begin{gathered} {A_{{{j_n}1}}} \hfill \\ {A_{{{j_n}2}}} \hfill \\ \vdots \hfill \\ {A_{{{j_n}M}}} \hfill \\ \end{gathered} \right]\sim N\left( {\left[ \begin{gathered} {{\bar{A}}_{{{j_n}1}}} \hfill \\ {{\bar{A}}_{{{j_n}2}}} \hfill \\ \vdots \hfill \\ {{\bar{A}}_{{{j_n}M}}} \hfill \\ \end{gathered} \right],\left[ {\begin{array}{*{20}{c}} \begin{gathered} \sigma_{{{A_n}1}}^2 \hfill \\ {\pi_{{{A_n}1,2}}} \hfill \\ \vdots \hfill \\ {\pi_{{{A_n}1,M}}} \hfill \\ \end{gathered} & \begin{gathered} {\pi_{{{A_n}1,2}}} \hfill \\ \sigma_{{{A_n}2}}^2 \hfill \\ \vdots \hfill \\ {\pi_{{{A_n}2,M}}} \hfill \\ \end{gathered} & \begin{gathered} \cdots \hfill \\ \cdots \hfill \\ \ddots \hfill \\ \cdots \hfill \\ \end{gathered} & \begin{gathered} {\pi_{{{A_n}1,M}}} \hfill \\ {\pi_{{{A_n}2,M}}} \hfill \\ \vdots \hfill \\ \sigma_{{{A_n}M}}^2 \hfill \\ \end{gathered} \\ \end{array} } \right]} \right) $$
(3)
$$ \left[ \begin{gathered} {A_{{{j_s}1}}} \hfill \\ {A_{{{j_s}2}}} \hfill \\ \vdots \hfill \\ {A_{{{j_s}M}}} \hfill \\ \end{gathered} \right]\sim N\left( {\left[ \begin{gathered} {{\bar{A}}_{{{j_s}1}}} \hfill \\ {{\bar{A}}_{{{j_s}2}}} \hfill \\ \vdots \hfill \\ {{\bar{A}}_{{{j_s}M}}} \hfill \\ \end{gathered} \right],\left[ {\begin{array}{*{20}{c}} \begin{gathered} \sigma_{{{A_s}1}}^2 \hfill \\ {\pi_{{{A_s}1,2}}} \hfill \\ \vdots \hfill \\ {\pi_{{{A_s}1,M}}} \hfill \\ \end{gathered} & \begin{gathered} {\pi_{{{A_s}1,2}}} \hfill \\ \sigma_{{{A_s}2}}^2 \hfill \\ \vdots \hfill \\ {\pi_{{{A_s}2,M}}} \hfill \\ \end{gathered} & \begin{gathered} \cdots \hfill \\ \cdots \hfill \\ \ddots \hfill \\ \cdots \hfill \\ \end{gathered} & \begin{gathered} {\pi_{{{A_s}1,M}}} \hfill \\ {\pi_{{{A_s}2,M}}} \hfill \\ \vdots \hfill \\ \sigma_{{{A_s}M}}^2 \hfill \\ \end{gathered} \\ \end{array} } \right]} \right) $$
(4)

In Eqs. 3 and 4, \( {\left( {{{\bar{A}}_{{{j_n}1}}},...,{{\bar{A}}_{{{j_n}M}}}} \right)^T} \)is the vector of prior mean perceived quality levels for national brand j n , such that \( {E_{{0i}}}\left[ {{A_{{{j_n}m}}}} \right] = {\bar{A}_{{{j_n}m}}} \) for each consumer. \( \sigma_{{{A_n}m}}^2 \) and \( {\pi_{{{A_n}k,m}}} \) are the prior variance of consumer quality beliefs in regard to the national brand’s quality level in category m and the initial covariance between the quality levels of the umbrella brands in categories k and m, where k≠m, as perceived by consumer i at t = 0. We denote the prior mean perceived quality levels, the prior variance, and the initial covariance between the quality levels (the covariance between the prior beliefs for the umbrella brand) in categories k and m of store brand as \( {\left( {{{\bar{A}}_{{{j_s}1}}},...,{{\bar{A}}_{{{j_s}M}}}} \right)^T}, \sigma_{{{A_s}m}}^2 \), and \( {\pi_{{{A_s}k,m}}} \), respectively. Thus, the diagonal elements of the covariance matrix in Eqs. 3 and 4 capture consumers’ initial uncertainty with the national and store brands, respectively, while the off-diagonal elements capture the covariance of initial priors. Hence, these covariance terms can be conceptualized reflections of the degree of perceived fit between categories, since if consumers do not see the two product categories’ fit at all, they may also feel that the information embedded in the use experience of an umbrella brand in one category is irrelevant for that brand in the other category.

The random error terms associated with consumer latent attribute (quality) perceptions are distributed as

$$ {x_{{i{j_n}tm}}}\sim N\left( {0,\sigma_{{{x_n}m}}^2} \right) $$
(5)
$$ {x_{{i{j_s}tm}}}\sim N\left( {0,\sigma_{{{x_s}m}}^2} \right) $$
(6)

where \( \sigma_{{{x_n}m}}^2 \) and \( \sigma_{{{x_s}m}}^2 \) refer to the experience variability for national brands and store brands, respectively, in category m. The experience variabilities are allowed to be different across categories. We assume that \( {x_{{i{j_n}tm}}} \) is i.i.d. across consumers, national brands, and time periods and that \( {x_{{i{j_s}tm}}} \) is i.i.d. across consumers, store brands, and time periods in each category. The experience variabilities capture the noisiness of information contained in use experience. Since national brands may achieve higher quality standardization, it might be expected that \( \sigma_{{{x_n}m}}^2 < \sigma_{{{x_s}m}}^2 \). Because consumers perform Bayesian updating, their expectations of the quality can be described as follows where the operator E ti (⋅) denotes the expectation operator, given the information consumer i has at time t:

$$ {E_{{ti}}}\left[ \begin{gathered} {A_{{{j_n}1}}} \hfill \\ {A_{{{j_n}2}}} \hfill \\ \vdots \hfill \\ {A_{{{j_n}M}}} \hfill \\ \end{gathered} \right] = \left[ \begin{gathered} {{\bar{A}}_{{{j_n}1}}} \hfill \\ {{\bar{A}}_{{{j_n}2}}} \hfill \\ \vdots \hfill \\ {{\bar{A}}_{{{j_n}M}}} \hfill \\ \end{gathered} \right] + \left[ \begin{gathered} {z_{{i{j_n}t1}}} \hfill \\ {z_{{i{j_n}t2}}} \hfill \\ \vdots \hfill \\ {z_{{i{j_n}tM}}} \hfill \\ \end{gathered} \right],\quad \left[ \begin{gathered} {z_{{i{j_n}t1}}} \hfill \\ {z_{{i{j_n}t2}}} \hfill \\ \vdots \hfill \\ {z_{{i{j_n}tM}}} \hfill \\ \end{gathered} \right]\sim N\left( {\left[ \begin{gathered} 0 \hfill \\ 0 \hfill \\ \vdots \hfill \\ 0 \hfill \\ \end{gathered} \right],\left[ {\begin{array}{*{20}{c}} \begin{gathered} \sigma_{{{A_n}i{j_n}t1}}^2 \hfill \\ {\pi_{{{A_n}i{j_n}t1,2}}} \hfill \\ \vdots \hfill \\ {\pi_{{{A_n}i{j_n}t1,M}}} \hfill \\ \end{gathered} & \begin{gathered} {\pi_{{{A_n}i{j_n}t1,2}}} \hfill \\ \sigma_{{{A_n}i{j_n}t2}}^2 \hfill \\ \vdots \hfill \\ {\pi_{{{A_n}i{j_n}t2,M}}} \hfill \\ \end{gathered} & \begin{gathered} \cdots \hfill \\ \cdots \hfill \\ \ddots \hfill \\ \cdots \hfill \\ \end{gathered} & \begin{gathered} {\pi_{{{A_n}i{j_n}t1,M}}} \hfill \\ {\pi_{{{A_n}i{j_n}t2,M}}} \hfill \\ \vdots \hfill \\ \sigma_{{{A_n}i{j_n}tM}}^2 \hfill \\ \end{gathered} \\ \end{array} } \right]} \right) $$
(7)
$$ {E_{{ti}}}\left[ \begin{gathered} {A_{{{j_s}1}}} \hfill \\ {A_{{{j_s}2}}} \hfill \\ \vdots \hfill \\ {A_{{{j_s}M}}} \hfill \\ \end{gathered} \right] = \left[ \begin{gathered} {{\bar{A}}_{{{j_s}1}}} \hfill \\ {{\bar{A}}_{{{j_s}2}}} \hfill \\ \vdots \hfill \\ {{\bar{A}}_{{{j_s}M}}} \hfill \\ \end{gathered} \right] + \left[ \begin{gathered} {z_{{i{j_s}t1}}} \hfill \\ {z_{{i{j_s}t2}}} \hfill \\ \vdots \hfill \\ {z_{{i{j_s}tM}}} \hfill \\ \end{gathered} \right],\quad \quad \left[ \begin{gathered} {z_{{i{j_s}t1}}} \hfill \\ {z_{{i{j_s}t2}}} \hfill \\ \vdots \hfill \\ {z_{{i{j_s}tM}}} \hfill \\ \end{gathered} \right]\sim N\left( {\left[ \begin{gathered} 0 \hfill \\ 0 \hfill \\ \vdots \hfill \\ 0 \hfill \\ \end{gathered} \right],\left[ {\begin{array}{*{20}{c}} \begin{gathered} \sigma_{{{A_s}i{j_s}t1}}^2 \hfill \\ {\pi_{{{A_s}i{j_s}t1,2}}} \hfill \\ \vdots \hfill \\ {\pi_{{{A_s}i{j_s}t1,M}}} \hfill \\ \end{gathered} & \begin{gathered} {\pi_{{{A_s}i{j_s}t1,2}}} \hfill \\ \sigma_{{{A_s}i{j_s}t2}}^2 \hfill \\ \vdots \hfill \\ {\pi_{{{A_s}i{j_s}t2,M}}} \hfill \\ \end{gathered} & \begin{gathered} \cdots \hfill \\ \cdots \hfill \\ \ddots \hfill \\ \cdots \hfill \\ \end{gathered} & \begin{gathered} {\pi_{{{A_s}i{j_s}t1,M}}} \hfill \\ {\pi_{{{A_s}i{j_s}t2,M}}} \hfill \\ \vdots \hfill \\ \sigma_{{{A_s}i{j_s}tM}}^2 \hfill \\ \end{gathered} \\ \end{array} } \right]} \right) $$
(8)

Here, \( {z_{{i{j_n}tm}}} \) and \( {z_{{i{j_s}tm}}} \) denote consumer i’s expectation errors at time t in category m for national brand j n and store brand j s , respectively.

The variances and the covariance of consumer quality beliefs (or, expectation errors) for national brand j n at any time t are as follows:

$$ \sigma_{{Ai{j_n}tm}}^2 = E\left[ {{{\left( {{A_{{{j_n}m}}} - {E_{{ti}}}\left[ {{A_{{{j_n}m}}}} \right]} \right)}^2}} \right],\,{\text{m}} = 1,2, \ldots, \,{\text{M}} $$
(9)
$$ {\pi_{{Ai{j_n}tm,k}}} = E\left[ {\left( {{A_{{{j_n}m}}} - {E_{{ti}}}\left[ {{A_{{{j_n}m}}}} \right]} \right)\left( {{A_{{{j_n}k}}} - {E_{{ti}}}\left[ {{A_{{{j_n}k}}}} \right]} \right)} \right],\quad \quad {\text{k}} \ne {\text{m}},{\text{ k }} = { 1},{2}, \ldots, {\text{ M}} $$
(10)

Similarly, those for store brand j s at any time t are

$$ \sigma_{{Ai{j_s}tm}}^2 = E\left[ {{{\left( {{A_{{{j_s}m}}} - {E_{{ti}}}\left[ {{A_{{{j_s}m}}}} \right]} \right)}^2}} \right],\quad \quad {\text{m }} = { 1},{2}, \ldots, {\text{ M}} $$
(11)
$$ {\pi_{{Ai{j_s}tm,k}}} = E\left[ {\left( {{A_{{{j_s}m}}} - {E_{{ti}}}\left[ {{A_{{{j_s}m}}}} \right]} \right)\left( {{A_{{{j_s}k}}} - {E_{{ti}}}\left[ {{A_{{{j_s}k}}}} \right]} \right)} \right],\quad \quad {\text{k}} \ne {\text{m}},{\text{ k }} = { 1},{2}, \ldots, {\text{ M}} $$
(12)

\( \sigma_{{Ai{j_n}tm}}^2 \) and \( \sigma_{{Ai{j_s}tm}}^2 \) are the variance of consumer i’s quality beliefs associated with national and store brands, whereas \( {\pi_{{Ai{j_n}tm,k}}} \) and \( {\pi_{{Ai{j_s}tm,k}}} \) depict the covariance between the quality levels in categories m and k as perceived by consumer i given this consumer’s information at time t.

According to the Bayesian updating rule as in Erdem (1998) and Erdem et al. (2004),

$$ {E_{{ti}}}\left[ {{A_{{{j_n}m}}}} \right] = {E_{{t - 1,i}}}\left[ {{A_{{{j_n}m}}}} \right] + {\beta_{{mi{j_n}tm}}}\left( {{X_{{i{j_n}tm}}} - {E_{{t - 1,i}}}\left[ {{X_{{i{j_n}tm}}}} \right]} \right) + \sum\limits_{\begin{subarray}{l}_{k = 1} \\ _{k \ne m} \end{subarray} }^M {{\beta_{{k,i{j_n}t,m}}}\left( {{X_{{i{j_n}t,k}}} - {E_{{t - 1,i}}}\left[ {{X_{{i{j_n}t,k}}}} \right]} \right)} $$
(13)
$$ {E_{{ti}}}\left[ {{A_{{{j_s}m}}}} \right] = {E_{{t - 1,i}}}\left[ {{A_{{{j_s}m}}}} \right] + {\beta_{{mi{j_s}tm}}}\left( {{X_{{i{j_s}tm}}} - {E_{{t - 1,i}}}\left[ {{X_{{i{j_s}tm}}}} \right]} \right) + \sum\limits_{\begin{subarray} _{k = 1} \\_{k \ne m} \end{subarray} }^M {{\beta_{{k,i{j_s}t,m}}}\left( {{X_{{i{j_s}t,k}}} - {E_{{t - 1,i}}}\left[ {{X_{{i{j_s}t,k}}}} \right]} \right)} $$
(14)

where the \( {\beta_{{ci{j_n}tm}}} \) and \( {\beta_{{ci{j_s}tm}}} \), c = 1, 2,…, M, i = 1, 2,…,I, j = 1, 2,…, J, t = 1, 2,…, T, m = 1, 2,…,M, are the Kalman gain coefficients obtained from the Kalman filtering algorithm.

It is clear from Eqs. 13 to 14 that \( {\beta_{{ci{j_n}tm}}} \) and \( {\beta_{{ci{j_s}tm}}} \) represent the weight attached to the cth piece of information with respect to brand j n and j s , respectively. When the precision of the cth piece of information for national brands is higher, \( {\beta_{{ci{j_n}tm}}} \) is larger. Equations 13 and 14 allow consumers to update their expectations about quality after gaining use experience with the same umbrella national or store brand in another category.

Consumer i also updates the variances\( \left( {\sigma_{{Ai{j_n}tm}}^2,\sigma_{{Ai{j_s}tm}}^2} \right) \)and the covariances \( \left( {{\pi_{{Ai{j_n}tm,k}}},{\pi_{{Ai{j_s}tm,k}}}} \right) \)of the quality levels for national brand j n and store brand j s in category m at time t. As in the Technical Appendix in Erdem (1998),

$$ \sigma_{{Ai{j_n}tm}}^2 = \left( {1 - {\beta_{{mi{j_n}tm}}}} \right)\sigma_{{Ai{j_n}t - 1,m}}^2 - \sum\limits_{\begin{subarray}{l} k = 1 \\ k \ne m \end{subarray} }^M {{\beta_{{k,i{j_n}t,m}}}{\pi_{{Ai{j_n}t - 1,m,k}}}}, {\text{ m }} = { 1},{2}, \ldots, {\text{ M}}, {\text{k }} = {1},{2}, \ldots, {\text{ M}} $$
(15)
$$ \sigma_{{Ai{j_s}tm}}^2 = \left( {1 - {\beta_{{mi{j_s}tm}}}} \right)\sigma_{{Ai{j_s}t - 1,m}}^2 - \sum\limits_{\begin{subarray} _{k = 1} \\ _{k \ne m} \end{subarray} ^M} {{\beta_{{k,i{j_s}t,m}}}{\pi_{{Ai{j_s}t - 1,m,k}}}} $$
(16)

and

$$ {\pi_{{Ai{j_n}t,m,k}}} = \left( {1 - {\beta_{{mi{j_n}tm}}}} \right){\pi_{{Ai{j_n}t - 1,m,k}}} - \sum\limits_{\begin{subarray}_{k = 1} \\_{k \ne m} \end{subarray} }^M {{\beta_{{k,i{j_n}t,m}}}{\sigma_{{Ai{j_n}t - 1,m,k}}}^2} $$
(17)
$$ {\pi_{{Ai{j_s}t,m,k}}} = \left( {1 - {\beta_{{mi{j_s}tm}}}} \right){\pi_{{Ai{j_s}t - 1,m,k}}} - \sum\limits_{\begin{subarray}_{k = 1} \\_{k \ne m} \end{subarray} }^M {{\beta_{{k,i{j_s}t,m}}}{\sigma_{{Ai{j_s}t - 1,m,k}}}^2} $$
(18)

In our model, we denote the mean (perceived) quality levels as A and perceived risk (variance of quality beliefs) as \( \sigma_A^2 \) (Erdem 1998; Erdem and Keane 1996; Erdem et al. 2004).

Recall from Eqs. 7 to 8 that \( {z_{{i{j_n}tm}}} \) and \( {z_{{i{j_s}tm}}} \) denote the consumer perception errors at time t for brand j n and brand j s in category m. Thus, \( {z_{{i{j_n}tm}}} = {E_{{ti}}}\left[ {{A_{{{j_n}m}}}} \right] - {A_{{{j_n}m}}} \) and \( {z_{{i{j_s}tm}}} = {E_{{ti}}}\left[ {{A_{{{j_s}m}}}} \right] - {A_{{{j_s}m}}} \). In addition, because \( {x_{{i{j_s}tm}}} \)has a mean of zero, \( {E_{{t - 1,i}}}\left[ {{X_{{i{j_n}tm}}}} \right] = {E_{{t - 1,i}}}\left[ {{A_{{{j_n}m}}}} \right] \) and \( {E_{{t - 1,i}}}\left[ {{X_{{i{j_n}tm}}}} \right] = {E_{{t - 1,i}}}\left[ {{A_{{{j_n}m}}}} \right] \). Given these expressions, Eqs. 13 and 14 can be written as follows:

$$ {z_{{i{j_n}tm}}} = {z_{{i{j_n}t - 1,m}}} + {\beta_{{mi{j_n}tm}}}\left( {{x_{{i{j_n}tm}}} - {z_{{i{j_n}t - 1,m}}}} \right) + \sum\limits_{\begin{subarray} _{k = 1} \\ _{k \ne m} \end{subarray} }^M {{\beta_{{k,i{j_n}t,m}}}\left( {{x_{{i{j_n}t,k}}} - {z_{{i{j_n}t - 1,k}}}} \right)} $$
(19)
$$ {z_{{i{j_s}tm}}} = {z_{{i{j_s}t - 1,m}}} + {\beta_{{mi{j_s}tm}}}\left( {{x_{{i{j_s}tm}}} - {z_{{i{j_s}t - 1,m}}}} \right) + \sum\limits_{\begin{subarray}_ {k = 1} \\ _{k \ne m} \end{subarray} }^M {{\beta_{{k,i{j_s}t,m}}}\left( {{x_{{i{j_s}t,k}}} - {z_{{i{j_s}t - 1,k}}}} \right)} $$
(20)

Equations 13 through 20 capture consumer learning about true product quality and the interdependence of quality expectations across products that share the same national or store umbrella brand names. The model implies that consumers who have some experience with brand j in one category have a lower perceived variance associated with brand j in other categories given a nonzero perceived covariance. Our model extends the research of Erdem (1998) and Erdem et al. (2004) by incorporating learning about both umbrella national and store brands across categories.

Expected utilities

We assume that U ijtm , the utility of consumer i from the purchase of brand j∊J at time t, where J is the set of brands that includes both national brands and store brands and is available in at least one of M categories, depends on the perceived quality level X ijtm and price P ijtm of brand j in category m that consumer i experiences at time t. Also, to allow for risk aversion, we allow U ijtm to depend on X ijtm nonlinearly. One such nonlinear utility function that captures risk aversion and risk taking is the quadratic form that follows:

$$ {U_{{ijtm}}} = {\alpha_{{im}}}{P_{{ijtm}}} + {w_{{im}}}{X_{{ijtm}}} + {w_{{0m}}}{\gamma_{{im}}}{X_{{ijtm}}}^2 + {\varepsilon_{{ijtm}}} $$
(21)

Where α im and w im are the price sensitivity and utility weight of the perceived quality levels for consumer i in category m, respectively, and are heterogeneous across consumers. w 0m is the mean utility quality weight, and γ im is the heterogeneous risk-aversion coefficient. If w 0m  > 0, then γ 0m  < 0 suggests risk aversion at the mean, where γ 0m is the mean of γ im . If w 0m  > 0 and γ 0m  = 0, this suggests risk neutrality at the mean, while w 0m  > 0 and γ 0m  > 0 imply risk-taking behavior at the mean. Finally, ε ijtm is a time varying stochastic component of utility, which is assumed to be i.i.d. normally distributed with mean zero and covariance Σ, and it captures random unobservable preference shocks that are known by the consumer but are unobservable by the analyst. We allow consumer heterogeneity in price sensitivities(α im ) to be correlated across categories.

$$ \left[ \begin{gathered} {\alpha_{{i1}}} \hfill \\ \vdots \hfill \\ {\alpha_{{iM}}} \hfill \\ \end{gathered} \right]\sim N\left( {\left[ \begin{gathered} {\alpha_{{01}}} \hfill \\ \vdots \hfill \\ {\alpha_{{0M}}} \hfill \\ \end{gathered} \right],\left[ {\begin{array}{*{20}{c}} {\sigma_{{\alpha 1}}^2} & {{\pi_{{\alpha, 1,2}}}} & \cdots & {{\pi_{{\alpha, 1,M}}}} \\ {{\pi_{{\alpha, 1,2}}}} & {\sigma_{{\alpha 2}}^2} & {} & {{\pi_{{\alpha, 2,M}}}} \\ \vdots & {} & \ddots & {} \\ {{\pi_{{\alpha, 1,M}}}} & {} & \cdots & {\sigma_{{\alpha M}}^2} \\ \end{array} } \right]} \right), $$
(22)

Also,

$$ {w_{{im}}}\sim N\left( {{w_{{0m}}},{\sigma_w}} \right),\,{\text{and}}\,{\gamma_{{im}}}\sim N\left( {{\gamma_{{0m}}},\sigma_{\gamma }^2} \right),\,{\text{m}} = 1,\,2, \ldots, \,{\text{M}} $$
(23)

Thus, utility weights(w im ) and risk sensitivities(γ im ) are allowed to be heterogeneous across consumers.

In addition, we specify i’s utility for no purchase (not purchasing any brand) in category m at time t as

$$ {U_{{ijtm}}} = TREN{D_m}t + {\varepsilon_{{0tm}}} $$
(24)

where TREND m is the time trend coefficient.Footnote 3 The utility function can be expressed as

$$ {U_{{ijtm}}} = {V_{{ijtm}}} + {\varepsilon_{{ijtm}}} $$
(25)

where

$$ {V_{{ijtm}}} = {\alpha_{{im}}}{P_{{ijtm}}} + {w_{{im}}}{X_{{ijtm}}} + {w_{{0m}}}{\gamma_{{im}}}{X_{{ijtm}}}^2 $$
(26)

Consumers form expectations about product quality and thus about the utility they derive from consuming a brand. Thus, the expected utility of consuming brand j in category m at time t for consumer i, given the information consumer i has at time t, is

$$ {E_{{ti}}}\left[ {{V_{{ijtm}}}} \right] = {\alpha_{{im}}}{P_{{ijtm}}} + {w_{{im}}}{E_{{ti}}}\left[ {{X_{{ijtm}}}} \right] + {w_{{0m}}}{\gamma_{{im}}}{E_{{ti}}}\left[ {{X_{{ijtm}}}^2} \right] $$
(27)

Rearranging Eq. 27, we present the following:

$$ {E_{{ti}}}\left[ {{V_{{ijtm}}}} \right] = {\alpha_{{im}}}{P_{{ijtm}}} + {w_{{im}}}{E_{{ti}}}\left[ {{X_{{ijtm}}}} \right] + {w_{{0m}}}{\gamma_{{im}}}{E_{{ti}}}{\left[ {{X_{{ijtm}}}} \right]^2} + {w_{{0m}}}{\gamma_{{im}}}{E_{{ti}}}\left[ {{{\left( {{X_{{ijtm}}} - {E_{{ti}}}\left[ {{X_{{ijtm}}}} \right]} \right)}^2}} \right] $$
(28)

Equation 28 can be written as follows:

$$ {E_{{ti}}}\left[ {{V_{{ijtm}}}} \right] = {\alpha_{{im}}}{P_{{ijtm}}} + {w_{{im}}}\left( {{A_{{jm}}} + {z_{{ijtm}}}} \right) + {w_{{0m}}}{\gamma_{{im}}}{\left( {{A_{{jm}}} + {z_{{ijtm}}}} \right)^2} + {w_{{0m}}}{\gamma_{{im}}}{\left( {{\sigma_{{Aijtm}}}^2 + \sigma_{{xm}}^2} \right)^2} $$
(29)

Note that the experience variabilities, \( \sigma_{{xm}}^2 \), are different for store brands and national brands as in Eqs. 5 and 6. The fourth term in Eq. 29 suggests that, in the case of risk aversion, uncertainty and perceived variance associated with the mean quality level of brand \( j\left( {\sigma_{{Aijtm}}^2} \right) \)will lower consumer expected utility. In this context, \( \sigma_{{Aijtm}}^2 + \sigma_{{xm}}^2 \) can be conceptualized as the perceived risk associated with a brand. As long as π A is positive, perceived variance will be lower and expected utility under risk aversion will be higher since the more experience a consumer has with brand j in any categories in which brand j is available, the more precise the information gained through experience is.

Choice probabilities

Let Y it =[j it1, j it2, …, j itM ] denote the index vector of the alternatives chosen by consumer i for the m categories at time t. The probability of observing a choice profile y it =[j 1, j 2, …, j M ] at time t conditional on β i and Σ is given by

$$ \Pr \left( {{Y_{{it}}} = {y_{{it}}}|{\beta_i},\sum } \right) = \Pr \left( {{U_{{i{j_1}t1}}} \geqslant \mathop{{\max }}\limits_{{p \in {C_1}}} {U_{{ipt1}}},{U_{{i{j_1}t2}}} \geqslant \mathop{{\max }}\limits_{{p \in {C_2}}} {U_{{ipt2}}}, \ldots, {U_{{i{j_1}tM}}} \geqslant \mathop{{\max }}\limits_{{p \in {C_M}}} {U_{{iptM}}}} \right) $$
(30)

where Cm is a choice set for category m with Jm elements. Given the multivariate normal distribution assumption for ε, we can rewrite Eq. 30 as follows:

$$ {\Pr_{{it}}}\left( {{Y_{{it}}} = {y_{{it}}}|{\beta_i},\sum } \right) = \int_{{{\varepsilon_{{it}}} \in {B_{{it}}}}} {\varphi \left( {{\varepsilon_{{it}}}} \right)d{\varepsilon_{{it}}}} $$
(31)

where Bit={εit s.t. \( {U_{{i{j_1}t1}}} \geqslant \mathop{{\max }}\limits_{{p \in {C_1}}} {U_{{ipt1}}},{U_{{i{j_1}t2}}} \geqslant \mathop{{\max }}\limits_{{p \in {C_2}}} {U_{{ipt2}}}, \ldots, {U_{{i{j_1}tM}}} \geqslant \mathop{{\max }}\limits_{{p \in {C_M}}} {U_{{iptM}}} \)} and φ(ε it ) is the joint normal density with mean zero and covariance Σ. Thus, we estimate a multivariate multinomial probit model, which allows random components to be correlated across the m categories.Footnote 4 The dimension of this integral is the total number of brands available across all m categories (\( = \sum\limits_{{k = 1}}^M {{J_k}} \)).

The likelihood for the household i is given by

$$ \prod\limits_{{t = 1}}^T {{{\Pr }_{{it}}}\left( {{y_{{it}}}|{\beta_i},\sum } \right)} $$
(32)

and the unconditional likelihood for an arbitrary consumer is obtained as

$$ \int {\int { \cdots \int {\prod\limits_{{t = 1}}^T {{{\Pr }_{{it}}}\left( {{y_{{it}}}|{\beta_i},\sum } \right)} } } } f\left( {{\beta_i}} \right)d{\beta_i} $$
(33)

Since this equation requires multidimensional integral, we use the GHK simulator, which is the most widely used probit simulator (Train 2003). As our estimation method, we adopt the method of simulated maximum likelihood (SML).

Identification

Similar to Erdem (1998) and Erdem et al. (2004), three identification restrictions are made. First, since only relative quality levels matter, we set

$$ \sum\limits_{{j = 1}}^J {{A_{{jm}}} = 0} $$
(34)

where the set of J includes both the store and national brands.

The second identification problem is the scale invariance in Eq. 26. To remove this indeterminacy, we normalize the distribution of w im by imposing the following requirement:

$$ {\sigma_w} = 1 $$
(35)

The last indeterminacy in the model is rotational invariance (Erdem 1998). To solve this problem, we fix the direction of the utility weights vector and the risk-aversion parameters matrix as follows:

$$ {w_{{im}}}\sim N\left( {{w_{{0m}}},1} \right),{\text{ and}}\;{\gamma_{{im}}}\sim N\left( {{\gamma_{{0m}}},\sigma_{\gamma }^2} \right) $$
(36)

Finally, we should note that empirical identification of learning models has been discussed in other papers in depth (e.g., Erdem et al. 2008). Here it may suffice to say that price parameters are empirically identified by variation in prices. The learning parameters (experience variabilities, prior uncertainty, etc.) are identified by patterns in switching behavior, conditional on past choices. Furthermore, the quadratic functional form of the utility function helps to separately identify the quality levels, weights, and risk coefficients.

Data

We used A.C. Nielsen’s household-level scanner panel data on detergent, ketchup, toilet paper, margarine, and canned tuna in three countries (Spain, the United Kingdom, and the United States). The Spain dataset includes household purchases at 89 stores across the country in 1999–2000. The U.K. dataset includes household purchases at 212 stores across the country in 1998–1999. Finally, the U.S. dataset includes household purchases at 119 stores in Atlanta and Chicago in 1998–1999.

The market share of store brands varies by product category and across countries. In general, store brands in consumer product categories are relatively stronger in European countries than in the United States. For example, the store brand market share for toilet paper in Spain and the United Kingdom exceeds 40%, while in the United States it is only 11% (Table 1). In addition, the store brand share for detergent is almost 10 times larger in Spain and the United Kingdom (23.8% and 27.4%) than in the United States (2.8%). Table 1 reports the market share and unit price for the top national and store brands in each category and country.

Table 1 Descriptive statistics of marketing variables

Four to six national brands in each category across countries, in combination with store brands, constitute 75–95% of the market in each country. The United Kingdom has three national umbrella brands: Family Choice and Nisa are available in four categories, and Heinz is available in two categories. The United States has no national umbrella brands available in at least two of the five categories in our dataset. In Spain, a couple of national umbrella brands exist, but the number of purchases for them is extremely small. Neither Spain nor the United States has “national” umbrella brands that account for a relatively high market share; however, store brands are available in all five categories in all three countries. Since multiple store brands are available, we lumped those that are available across the categories studied into one “store brand” for each market and treated it as an umbrella store brand in our analysis for each country. We included all the brands in Table 1 for the model estimation. Since the United Kingdom has national umbrella brands across different categories, we utilized the three national umbrella brands available in the United Kingdom to compare the difference in umbrella branding strategies between national and store umbrella brands in the United Kingdom only.

For the analysis, all data were aggregated to the week level. Since our model captures cross-category learning across the five product categories, the households with fewer than three purchases in any of the five categories were excluded from the analysis. We also excluded households that purchased brands not included in this study. This resulted in a sample of 922 households in Spain, 3,333 in the United Kingdom, and 324 in the United States. We kept all the 324 households in the United States for the model estimation. For Spain and the United Kingdom, we randomly selected 300 households to make the sample sizes computationally manageable and have similar sample sizes across countries.

Empirical results

Table 2 reports the goodness-of-fit statistics for the three models estimated: the MVMNP with correlated random normal errors but no learning, the model with learning but no cross-category learning, and our full model with cross-category learning. Our full model fits the data best in all three countries.

Table 2 Goodness-of-fit statistics by country

Tables 3, 4, 5, 6 and 7 list the parameter estimates, except for mean quality level estimates (listed in Tables 8, 9, 10, 11 and 12) and the covariance-variance matrices (listed in Tables 13, 14 and 15).

Table 3 Parameter estimates for the detergent category
Table 4 Parameter estimates for the ketchup category
Table 5 Parameter estimates for the toilet paper category
Table 6 Parameter estimates for the margarine category
Table 7 Parameter estimates for the tuna category
Table 8 Estimates of the mean quality levels for detergent
Table 9 Estimates of the mean quality levels for ketchup
Table 10 Estimates of the mean quality levels for toilet paper
Table 11 Estimates of the mean quality levels for margarine
Table 12 Estimates of the mean quality levels for tuna
Table 13 Covariance matrix of price sensitivity
Table 14 Covariance matrix of prior perceptions of quality for store brands
Table 15 Covariance matrix of prior perceptions of quality for national umbrella brands in the United Kingdom

The parameter estimates in Tables 3, 4, 5, 6 and 7 all have the expected signs in all three countries and across all product categories. The price sensitivities are negative, the quality weights are positive, and consumers are risk averse. Initial uncertainty exists, and in most cases initial uncertainty about store brands is higher than that of national brands. Use experience provides (noisy) information about product quality (please see the estimates of experience variability). In most cases, heterogeneity in price sensitivities and degrees of risk aversion also exist. A few noteworthy exceptions should be mentioned—namely, the risk aversion parameter is statistically insignificant in margarine in Spain, experience variability of both national and store brands is statistically insignificant in margarine in the United Kingdom, and the standard deviation of risk aversion is insignificant (indicating a lack of heterogeneity) in the United States in detergent, ketchup, margarine, and tuna. It is also worth noting that, although experience variability is consistently higher in the United States for store brands than for national brands across all five categories, the reverse is true in the United Kingdom for ketchup, toilet paper, and margarine and in Spain for toilet paper and margarine. This result already hints at the relative success of store brands in the United Kingdom and Spain: store brands provide consumers with more consistent experiences than national brands in the United Kingdom and Spain (at least in a number of categories).

Table 13 reports the variance-covariance matrix of price sensitivities (recall that we allowed price sensitivity’s heterogeneity distributions to be correlated in each country). Out of ten pairwise correlations, three in Spain, four in the United Kingdom, and five in the United States are significant.

Tables 14 and 15 report the variance and covariance matrices of prior quality perceptions for store and national umbrella brands, respectively. For store brands (Table 14), prior quality perceptions are correlated in three cases in Spain and the United Kingdom (i.e., tuna and margarine, ketchup and margarine, and ketchup and toilet paper) and four cases in the United States (i.e., tuna and margarine, tuna and ketchup, ketchup and toilet paper, toilet paper and detergent). In the case of national umbrella brands (Table 15), we have data from only the United Kingdom. For the Nisa brand, which is available in four product categories, three out of six pair-wise correlations are statistically significant (i.e., ketchup and detergent, detergent and toilet paper, and ketchup and toilet paper). For Heinz, prior quality perceptions are also correlated between the two categories in which Heinz is available: ketchup and tuna. Finally, Family Choice is available in four categories as well, but only one out of six possible correlations is statistically significant (i.e., between margarine and tuna).

A comparison of correlations among prior quality perceptions (covariances) between store and national brands in the United Kingdom (the only country in which we have national umbrella brands in multiple categories) reveals that the highest (and statistically significant) covariance occurs in the case of store brands between margarine and ketchup (0.639). The corresponding highest covariance for national brands is in the case of the umbrella brand Nisa and between ketchup and detergent (0.526). We do not observe large differences in the magnitudes of these covariances (correlated initial priors) for store versus national umbrella brands. These covariances indicate that, in the pairs of product categories, cross-category learning and association transfer occur from one category to the other. Interestingly, this transfer occurs even between food categories (ketchup) and non-food categories (toilet paper), although most correlations occur within the food categories. This is not surprising, given that perceived fit may be higher among such categories.

Table 16 reports several ratios of different parameter estimates to aid in our ability to make direct comparisons across countries (two parameter estimates are not directly comparable across countries due to the scale factor (Swait and Louviere 1993)). In ratio 1 (prior standard deviation of quality beliefs of store brands versus national brands), store brands are usually associated with higher initial uncertainty, especially in most categories in the United States. Ratio 2 is the ratio of experience variability of store brands to that of national brands; in this ratio, the United States generally has the highest numbers, indicating that its store brands relative to national brands provide consumers with less consistent experiences. Thus, noisiness of information of use experience is higher for store brands in the United States. In the United Kingdom, this ratio is either under 1 or close to 1, suggesting that store brands provide more consistent information or as consistent information (experiences) as national brands. The picture emerging from Spain is more similar to the results observed in the United Kingdom than in the United States. Ratio 3 compares risk coefficients to price sensitivity coefficients, indicating that in general U.S. consumers are more risk averse than price sensitive when compared to Spanish or U.K. consumers in four out of five categories.Footnote 5 Quality versus price sensitivity comparisons (ratio 4) show that U.K. consumers are more price sensitive than quality sensitive, while the opposite is true in Spain and especially the United States. Finally, ratio 5 captures the perceived quality differences between national and store brands, demonstrating that national brands are perceived to be of higher quality in all countries, except for detergent in Spain and margarine in the United Kingdom. These results are overall in line with the findings in Erdem et al. (2004). Thus, we replicate their results with an extended model that incorporates cross-category learning and coincidence as well as extended analysis that involves more categories.

Table 16 Ratios of parameter estimates for cross-country comparisons

Discussion, conclusions, and future research

We estimated a model of quality learning that allows priors to be correlated for both store and national umbrella brands. Our results indicate that consumer learning occurs across categories for both brands. We did not find any significant differences in cross-learning effects between store and national umbrella brands. By using an extended model with cross-category effects (cross-category learning, correlated price sensitivities, and correlated probit errors—that is, coincidence), we also replicated the findings of Erdem et al. (2004) that store brands in the United States provide less consistent consumer experiences and are subject to higher perceived risk compared to national brands, which is often not the case in Europe. Furthermore, risk attitudes and price and quality sensitivities are different in the United States than in Europe. Higher risk aversion in the United States, coupled with higher perceived risk of store brands, leads to more aversion for store brands in the United States than in Europe.

One important managerial implication of this study is that umbrella brands—whether national or store brands—need to provide consistent experiences within and across categories as the existence of cross-learning effects means potential brand dilution effects when consumers are not satisfied with their brand experiences. Our results also suggest that cross-learning effects are not automatic for umbrella brands as the prior quality perceptions are not correlated in certain cases. In most extensions the assumption of the firm is that quality associations will be transferred; therefore, firms need to understand the factors underlying the transfer of quality associations thoroughly. In addition, we find variance in cross-category learning of umbrella brands across different countries. Hence, firms familiar with cross-category learning patterns in one country or category should not presume that similar patterns occur in other markets, nor should they be overconfident about their experience in similar product categories when extending their brands to dissimilar product categories.

Our study opens up some opportunities for future research. The current study does not analyze the factors underlying consumers’ correlated priors about quality across different product categories for umbrella brands. Such correlations exist across some categories, but not between others; the strength of these associations varies by product category and country. Although perceived fit is deemed to be a critical factor in branding literature (Keller 2002), more research in the context of learning models is needed to study such factors underlying perceived fit and association transfer.