1 Introduction

A rich theoretical literature establishes equilibria involving mixed strategies in posted prices for firms in various settings, such as when search costs are present or consumers differ in their willingness to pay and mixed strategy pricing serves to price discriminate. These models of mixed pricing are typically motivated by empirical cases of price mixing either over space, or over time: the search literature going back to Stigler (1961) interprets e.g., geographic price dispersion as a realization of mixed strategies, while the sales literature beginning with Varian (1980) views occasional sales as realizations of mixed strategies. However, the advent of online marketplaces has made the idea of mixed pricing strategies feasible in real time: firms can play pricing strategies that had previously only been theorized. This paper describes and analyzes a case of a firm engaging in mixed pricing behavior for a significant subset of products in a $10B USD online marketplace.

The industry we study is office supplies. Previous academic work (Ashenfelter et al. 2006; Cabral 2003; Baker 1999; Warren-Boulton and Dalkir 2001) has investigated pricing practices in this retail market, primarily as they relate to concentration in local markets. For the past few decades, there were three nationwide office supply specialty chains in operation: Staples, Office Depot, and OfficeMax. On Sept 4, 1996, Staples and Office Depot announced a merger, although it was subsequently challenged and halted by the Federal Trade Commission (FTC) in 1997 out of a concern over substantially reduced competition between office supply superstores in various local markets where the firms directly compete against each other, without facing competitive pressure from unrelated rivals.Footnote 1 In early 2013, Office Depot and OfficeMax announced a merger, which was permitted in light of declining demand and increased external competition from substitute retail industries like discount stores, warehouse clubs, and online general merchandisers. Finally in 2015, Staples proposed a merger with the combined Office Depot-OfficeMax entity, which was officially challenged by the FTC on December 7th, 2015. A final ruling is still pending as of this writing.

Pricing by Staples, the largest player, has also been scrutinized in the popular press such as the Wall Street JournalFootnote 2 (WSJ) for reasons unrelated to concentration. The WSJ, for example, highlights Staples’ recent surreptitious tailoring of prices on its website based on its customers’ locations, in this case zip codes. Accordingly, the website randomizes between showing a potential buyer one of two prices for a given product - a “high” price and a “low” price - with different probabilities in different zip codes. Consumers visiting the website are not directly aware of which price they are seeing, and computer science research indicates it is quite difficult to detect such price discrimination in online markets more broadly (Mikians et al. 2012). This strategy may inadvertently harm or help consumers who happen to live in areas with particular market conditions by altering prices they face. This may improve efficiency when prices are lowered for certain consumers, or may simply reallocate consumer surplus to firm profits if prices rise. The net effect of this is an empirical object that depends on competitors and local market demand conditions.

Such pricing strategies are to be expected going forward in online marketplaces. However, for policy makers, such price discrimination strategies also have implications for merger analysis. As we show below, one particular local market condition that the firm explicitly takes into account is the consumer’s distance to the nearest of its rivals’ brick-and-mortar outlets. If two competitors merge and reduce the number of offline locations they operate, the distributional effect of this form of price discrimination will thus also change. For example, the combined OfficeMax-Office Depot corporation announced in 2014 that of its 1,900 retail locations, at least 400 would close by the end of 2016, which would then alter prices faced by online consumers in affected markets.Footnote 3 If Staples were then to acquire the merged rival, the change in competition could have a significant effect on online pricing.

In this paper, we introduce three datasets on pricing in office supplies: first, a “breadth” sample of the high-price display frequency for a single product across all zip codes. Second, a “depth” sample of the same pricing frequency, together with the high and low prices, across a large set of products, but for a small set of zip codes. Finally, an “offline” sample of in-store prices for a small set of products at the three major chains in 17 different cities, for a total of 85 store locations. We document trends in the first dataset, such as how consumers who are close to a rival store location are more likely to see low prices compared to consumers in zip codes that are far from rival chains. This illustrates determinants of Staples’ choice of price mixing probabilities. To show that price mixing extends beyond the specific product covered by the “breadth” sample and that the choice between the high and the low price is indeed random, as assumed in the first analysis, we turn to the second data set. Here we can investigate price mixing across a large set of products, but - due to the small number of local markets - cannot speak to determinants of the observed mixing frequency. We test directly for randomization behavior by Staples, leveraging the detailed timing of pricing at the second level, and show that there are no systematic time trends nor serial correlation in the observed prices. We also show that the pricing strategy is statistically identical among the set of products that exhibit randomization. We show in the third dataset that variation in in-store pricing is small and uncorrelated with variation in online pricing.

We then propose a model of price discrimination that rationalizes firm randomization. Our model has “loyal” and “disloyal” consumers, as well as a search parameter among “disloyal” types. We calibrate certain demand parameters and estimate the remaining structural parameters relating to the randomization strategy using the “breadth” data, as it allows us to exploit rich geographic variation at the zip code level. Our estimates show that the implied share of loyal customers for Staples.com is increasing in the distance to the nearest rival chain, and that disloyal types exit the market more quickly as the number of establishments in the zip code increases. Our estimates imply small but reasonable distance elasticities.

We then proceed with an analysis of how such online pricing behavior changes after a merger of Staples’ rivals. We use data on post-integration store closures of the merged Office Depot-OfficeMax entity to re-estimate optimal price discrimination for Staples.com, and find that average prices displayed increase by a modest amount compared to pre-merger levels. This can be explained by the fact that rival store closures have the primary effect of removing duplication in the chains’ footprints. However, this is important to highlight as merger review in the industry focused on in-store pricing changes from an increase in market power and elasticities of substitution between the three specialty players and unrelated online competitors, but not on within-firm multi-channel pricing.

This paper revisits an extensive theoretic literature on search and sales, as well as growing empirical search and sales literatures. It makes a contribution to the literature on testing mixed strategies (Walker and Wooders 2001; Chiappori et al. 2002; Wang 2009), and also highlights the existence of online price discrimination and its interaction with offline prices. A drawback to our analysis is that it is limited to a single firm’s pricing, although the difficulty of studying such pricing strategies and our expectation that the sophistication of such strategies will grow going forward highlight the value in exploring this setting. We review the related literature in more detail below.

The paper is organized as follows. Section 2 reviews related literature. Section 3 describes the setting and current merger activity. Section 4 introduces our datasets. Section 5 tests for randomization. Section 6 analyzes the relation between online price mixing and the offline market structure, and rival prices more generally. Section 7 describes and estimates a model of firm pricing behavior. Section 8 concludes.

2 Literature on mixed pricing

There is a rich theory literature that explores firms using mixed strategies in equilibrium. We focus on strategies for posted price, recognizing that the auction literature has studied mixed strategies extensively in the context of bids.

A first stream of literature, going back to Stigler (1961), explores price dispersion and the implication for consumer search. Salop and Stiglitz (1977) and Salop (1977) considered how heterogeneity in search costs can lead to price dispersion as a tool for price discrimination. Subsequent work such as Burdett and Judd (1983) and Stahl (1989) modeled consumer search behavior more closely. Recent contributions have revisited competitive aspects (Janssen and Moraga-González 2004) as well as informational settings (Akin and Platt 2013). There have been a number of important empirical contributions that have used price patterns to identify search costs, such as in Hong and Shum (2006) or Hortaçsu and Syverson (2004), and the relative importance of search in observed prices (Wildenbeest 2011).

A second stream of literature on mixed strategies in prices began with Varian (1980) who shows that in competing over “informed” and “uninformed” consumers, firms employ mixed strategies. Subsequent papers such as Sobel (1984) and Conlisk et al. (1984) further the intuition that occasionally offering low prices allows a firm to “sweep” the low types of customers from a market. Later works such as Raju et al. (1990), Rao (1991) and Sinitsyn (2008) refined models to showing which types of firms should have low prices more often and how many different prices they needed to charge. A recent contribution, Chevalier and Kashyap (2012), examines both theoretically and empirically price variation across a retailer’s product line, highlighting the interplay between price dispersion and substitution patterns between products. Significant empirical contributions include Pesendorfer (2002) and Hendel and Nevo (2013), which show that intertemporal price discrimination is an effective strategy to differentiate between types in storable goods settings, while Hendel and Nevo (2006) highlights the impact of ignoring such behavior in traditional demand analysis.

In the sales literature, several papers have considered the related decision of uniform (“Every Day Low Pricing”, or EDLP) versus occasional promotion (“HiLo”) pricing, as in Lal and Rao (1997) and Ho et al. (1998); Fassnacht and El Husseini (2013) offers a review. Other work has explored sale pricing through the lens of reference prices, such as in Heidhues and Koszegi (2008), where the authors show that a monopolist selling to risk-averse consumers can maximize profits by varying between “high” and “low” prices.

Empirical work in mixed strategies has primarily focused on testing for mixed strategies, particularly in sports (Chiappori et al. 2002; Walker and Wooders 2001) but also in conduct (Wang 2009). These papers test assumptions of randomized play, including the predictive power of covariates and the absence of serial correlation. In addition, they test model predictions of equality of payoffs. We are not aware of any research looking at randomization of posted prices in real time.

We take no stand on which particular thread of literature explains the behavior we document. Instead, we focus on testing for the behavior and documenting its patterns across products and its relation to offline prices. We build a model inspired by the “sweeping” price discrimination mechanism of Sobel (1984) to gain a sense of the magnitude of potential pricing effects of market structure, in the context of the past and proposed mergers.

3 Setting

As of 2013, there were 9,000 office supply specialty stores in the US. Most of these are small, standalone retailers, such as stationary stores, but 42 % are owned and operated by one of the three large supply super store chains, Office Depot, OfficeMax and Staples, whose combined estimated market share is upward of 70 %. The majority of the firms’ revenue derives from the sale of school and office supplies (45 %) and office machines (29 %), but they also sell computers, furniture and other supplies. Among them, Staples is the market leader, operating 1,846 outlets in 2013, followed by Office Depot with 1,089 and OfficeMax with 823 outlets based on the chain affiliation of outlets prior to the companies’ merger in late 2013 (see ibisworld.com; firm counts derived from companies’ 2013 annual reports.)

Similar to other retail industries, the recession of the late 2000s resulted in sharp declines in demand. The retail chains responded to this, and the additional challenge of increasing external competition from substitute industries like discount stores, warehouse clubs, and online retailers by successfully consolidating. In 2013, Office Depot and OfficeMax proposed and obtained approval to merge; in 2015 Staples and the newly formed entity proposed a further merger that is currently being challenged by the FTC. In its December 7, 2015 challenge of the merger, the FTC directly alleged that it had “reason to believe that the proposed merger between Staples and Office Depot is likely to eliminate beneficial competition that large companies rely on to reduce the costs of office supplies”.Footnote 4

This follows the earlier proposed merger in 1996 by Staples and Office Depot, a case that the FTC successfully challenged based on concerns and econometric evidence that the merger would substantially reduce competition in the retail sale of office supplies in various local markets throughout the country where the firms competed directly against each other. The FTC’s case at the time applied a narrow market definition that focused on office supply superstores as a separate market from other office supply specialty stores or general big box retailers. Within local MSA markets, the FTC’s evidence suggested that (a) Staples and Office Depot had competed on price before, and that (b) in-store pricing was heavily influenced by the number of other office superstores competing in the local area, where fewer stores led to higher prices. Ashenfelter et al. (2006) present some of the detail behind the FTC’s evidence, and Manuszak and Moul (2008) develop a two-stage model of pricing for three representative products, using local market structure as a proxy for competition and instrumenting for the latter nonlinearly using the firms’ spatial expansion pattern relative to their headquarters as determinants of equilibrium market structure outcomes. Both show evidence in clear support of the FTC’s arguments; we rely on their pricing models to identify demand shifters for the empirical specifications we present below.

The FTC’s approval of the later Office Depot and OfficeMax merger concluded that in the intervening years, the market for consumer office supplies had changed significantly, as a result of which the merger of the two office supply superstore chains was unlikely to substantially lessen competition in consumable office supplies sales. The FTC’s revised market definition included non-office supply superstores such as Wal-Mart and Target, along with club stores like Costco and Sam’s Club, and online retailers of office supplies, most notably Amazon.com, that had grown quickly and significantly since the time of the earlier case and now compete with office supply superstores due to larger office supply product lines. The FTC separately investigated the potential impact of the proposed merger on the sale of consumable office supplies to large regional or national business customers on a contract basis, a market with fewer potential suppliers capable of meeting their large purchasing requirements, but concluded that the merger was unlikely to substantially lessen competition in the contract channel.Footnote 5 The office supply contract market is one focus of the ongoing investigation of the proposed merger of Staples and the combined Office Depot / OfficeMax entity; the 2013 decision noted that the merged company would still face significant competition from Staples for this business.

4 Data and descriptive evidence

Our analysis relies on three datasets. The first one derives from the WSJ’s investigation into Staples’ pricing practices.Footnote 6 As part of a larger investigation into online retailers’ price discrimination practices based for example on a customer’s physical IP address location, the WSJ showcased Staples.com as a typical example: during the period of the WSJ’s study from September to December 2012, the report found that Staples.com’s displayed price depended on the zip code of the computer’s IP address location. Zip codes got either a “high” price or a “low” price with some probability, with high and low prices differing by roughly 10 % based on a large set of representative products (these included more than 1,000 different products in 10 selected zip codes - typical examples are a Swingline Stapler, a 12-pack of BIC Rollerball Pens, a 15-pack of Staples-Brand Mailing Tubes, a 24-pack of Intertape Masking Tape, and a SnapSafe-Brand Safe). To demonstrate the pricing practice, the WSJ published on its website a dataset it compiled during its investigation that recorded for each of approximately 42,000 US zip codes the fraction of times Staples.com displayed the high price for the Swingline stapler out of 20 simulated visits to the Staples.com website from a potential customer in that zip code. We denote this data set as the “Breadth” sample in our analysis below, which contains, for each zip code the fraction of times out of the 20 shopping occasions a simulated visitor saw the high price appear. We denote this high-price score as f ∈ {0, 0.05, 0.1, ..., 0.95, 1}. Figure 1 displays the distribution of the frequency with which Staples displays the high price for the stapler in 29,545 standard continental US zip codes. For approximately 7.5 % of these zip codes, Staples never displays the high price; similarly, for approximately 5 % of zip codes, it always displays the high price. The remaining distribution is bimodal, with approximately 35 % of zip codes seeing the high price between 5-15 % of the time, while Staples.com displays the high price between 65–75 % of the time in around 23 % of zip codes.

Fig. 1
figure 1

Empirical Probability Distribution of High Price. Figure displays empirical probability distribution of observing the high price for a given product in a given zip code on a given day. Distribution constructed across products and zip codes

We complement the “Breadth” sample from the WSJ with a similar data set that we assembled subsequently to the WSJ’s reporting for a larger set of products. We used the 2012–2013 comScore Web Behavior data base to identify a set of 268 products that comScore customers purchased from Staples.com immediately prior to our data collection effort. We constructed a web scraping routine that began by loading the Staples.com web page, which created a tracking “cookie”. This cookie contained a field called “zip code”, which stored a value that we believe was estimated based on the machine’s IP address. We overwrote the zip code field with the targeted zip code and then re-loaded the page and saved the resulting HTML file. Figure 7 shows the contents of a typical “cookie”, with the zip code field located towards the end. We used this routine to query Staples.com eight times over a period of 4 weeks in April and May of 2013 to record prices for the comScore sample of products from a set of 21 zip codes representing each high-price score from the “Breadth” sample, for a total of over 45,000 queries.Footnote 7 We then parsed the resulting html files using regular expressions in Perl to extract the price that was displayed, as well as whether or not the price was specifically marked as a “sale” price. Among the 268 products, 135 show one price across the set of test zip codes, 130 show two prices, 3 show three prices. We drop any scrape results that explicitly declare that the product is offered at a “sale” price. Focusing on the products for which we observe two distinct non-sale prices during the data collection period, we observe the high price being displayed in 35.5 % of the 8 price collection rounds. The relative frequency with which the high price is shown in a zip code among our 8 price scrapes is correlated with the WSJ’s distribution in Fig. 1, although imperfectly due to the different number of scrapes. Similar to the WSJ, we observe a discount from the high to the low price with two modes - 5 % and 10 % - with an average price difference of 8 % and a median of 8.2 %. There are few systematic patterns in which of the 268 products are sold using mixed pricing, as opposed to a single uniform price. Correlating the incidence of price mixing with product attributes - the product’s category (office supplies, furniture, electronics excluding computers, computers, etc.), whether or not the product is a Staples’ store brand, and the price point of the product - suggests that mixing is concentrated among office supply products, but controlling for product category, there are no statistically significant differences between the types of products that have uniform prices versus mixed prices. We denote the resulting sample as the “Depth sample”.

Lastly, we collect information on offline prices across a number of local markets to be able to investigate the extent to which Staples’ online prices relate to its own in-store prices or respond to its rivals’ off-line prices. We hired mystery shoppers from the website TaskRabbit.com, an online marketplace that allows users to outsource small jobs to others in their local area, in November 2014. At the time, TaskRabbit had a presence in 18 cities across the continental US. For each of the cities, we assembled a sample of six stores – two for each office supply super store chain – that differed in Staples.com’s high-price frequency score based on the store’s zip code, but that were sufficiently close to each other to limit driving time by the mystery shoppers. We provided shoppers with a standardized set of five products (Post-Its, a stapler, Scotch tape, pens, and address labels), including pictures and SKUs, and instructed them to record the price for each of the products in each of the stores. Separately, we asked them to record the price for the products on each of the chain’s online shopping platform from their own home zip code. The final data collection effort resulted in price information from 85 stores. The store sample is smaller than the potential sample of 108 stores (18 markets with six stores each) for a number of reasons. First, we were unable to collect price information in Denver, CO, due to bad weather; second, not all three chains operate in all markets, for example, Office Depot did not have presence in Boston, MA prior to its merger with OfficeMax; and third, as the data collection took place shortly after the Office Depot/OfficeMax merger, the chains had already begun to re-optimize their merged store network and closed stores in some of the markets. This resulted both in us sending the mystery shoppers to fewer stores in those markets, and them reporting back store closures among the stores we sent them to for six out of our original set of stores. We have no reason to believe that the store reconfigurations or the chains’ aggregate market exit patterns are systematically related to Staples.com’s online pricing strategy, as we discuss below. For the average store in the offline pricing sample, denoted as “Offline” below, we observe 4.5 price observations, reflecting that despite our attempts to identify popular, standardized products, not all of the products were available in all stores at the time of the data collection. We also dropped price data reported by one of the mystery shoppers for one of the chains since her reported price information indicated that she had recorded prices for incorrect products. We further dropped price information for products that the mystery shoppers reported to be on a general sale; this occurred for 8.61 % of store-price observations. The resulting sample thus consists of 410 price observations from 85 stores in 17 markets.

We add various supplemental data to these three main pricing datasets. First, from ReferenceUSA.com, we construct a panel of each of three chains’ store locations across the continental US as of the end of 2012, 2013, 2014, and mid-2015. We use the latest data primarily to investigate in counterfactual simulations how Staples.com’s online pricing strategy would respond to an alternative offline store network that reflects post-merger reconfigurations. We also use ReferenceUSA to collect information on the store networks of the more distant competitors identified by the FTC’s investigation of the Office Depot/OfficeMax merger: we compile store locations for Walmart, Target, Costco, and Sam’s Club as of the date of our “Breadth” sample, the end of 2012. We use the detailed longitude and latitude information provided by ReferenceUSA for each store’s location, together with population-weighted centroids for each continental US zip code from the Missouri Census Data Center’s MABLE/Geocorr2K data base to calculate straight-line distance between each zip code and store location. From this master set, we then retain the distance between each zip code and the closest store location for each competitor.

Lastly, we use the Census 2010 to collect demographic information at the zip code level, such as income, wages, and presence of businesses.

4.1 Descriptive evidence

Using the “Breadth” sample, we can explore patterns in the pricing behavior. Table 1 shows descriptive regressions of the probability of getting a “high” price in a zip code on i) the distance from that zip code to the nearest Staples store location, ii) the distance to the nearest OfficeMax or Office Depot location, and iii) the natural log of the number of establishments taken from the census, as a proxy for local demand conditions and competitiveness. Store locations are likely to be endogenous to both rival locations and local demand, and we see in the first three columns that all three variables are related to the probability of being shown a “high” price. When we add all variables at once, in the final column, we see that the probability of seeing a high price is increasing in the distance to rival stores, and decreasing in the log of local establishments. In unreported regressions, we find that the distance to other less-direct competitors including Target, Costco, and Sam’s Club is not related to the probability of “high” prices in a zip code.

Table 1 Descriptive regressions

In Fig. 2, we show the relationship between rival distance and high prices graphically. The figure presents two binned scatterplots of the probability of being shown a “high” price in a given zip code on the distance to the nearest rival store, where rivals are defined as OfficeMax and Office Depot. The first panel is the raw data, while the second is residualized by the log of local establishments. In both we see an increasing relationship between distance to rival and the probability of being shown a “high” price. The first panel seems to indicate that the effect is a step-function, in that only rivals up to a distance of roughly 20 miles are relevant, although the second panel shows a more linear relationship, consistent with the fact that rival locations and local demand are confounded in the first.

Fig. 2
figure 2

Descriptive analysis of online prices. Figures are “binned scatterplots”, constructed by residualizing the data against all listed controls and adding back means, before plotting means of the data within bins. Regression lines are based on all underlying data, not only bin values

As we hope to exploit variation in distances from a zip code to Staples and its rivals, it is important to show that the chains are not perfectly co-located with one another. Table 2 shows that while there is positive correlation in location choices of the three chains, there is still significant variation in their location decisions.

Table 2 Joint distribution of zip code distance to closest Staples location and closest rival location

One concern about this exercise is that Staples.com could simply be undertaking some experimentation with price to learn demand. While we cannot conclusively disprove it, we believe this is not the case. First, if it were simple A/B testing to learn demand responses, the level of randomization should not correlate with market characteristics. If instead it were a more recent machine learning algorithm to learn demand, such as a multi-arm bandit “explore-exploit” algorithm, it should be expected to correlate with prior beliefs on correct prices; however it would not be necessary to experiment on the entire country, and we would expect systematic patterns in terms of which products are randomized. Therefore, even though we cannot rule out the possibility of experimentation, it appears less likely. Even if this was indeed a form of machine learning experimentation, most of the contributions of this paper, including documenting how online prices correlate with offline market conditions and the interplay between online and offline pricing remain.

5 Testing for randomization

Using our “Depth” sample of website scrapes, we can test for whether or not the observed pricing behavior is consistent with randomization. As discussed above, different zip codes have different patterns in the randomization. In Table 3, we regress an indicator variable for seeing the “high” price in a given website price query on different sets of fixed effects. As can be seen in the bottom row of the table, product and date-of-scrape fixed effects have little explanatory power, while zip code fixed effects have a significant amount of power.

Table 3 What explains a “high” price?

For this pricing behavior to be considered random, we must also show that there is no systematic pattern with respect to timing. In Table 4, we add time-related fixed effects to see if they offer any explanatory power. As shown, adding hour-of-day, minute-of-hour, and second-of-minute fixed effects do nothing to explain high prices. Furthermore, tests of joint significance of the time fixed effects fail to reject the null hypothesis, indicating that they collectively offer no explanatory power.

Table 4 Timing of “high” prices

Looking across products, we find the pricing behavior to be strikingly consistent, aside from a small set of 6 outliers.Footnote 8 Among the 122 non-outliers, the mean (median) rate of seeing “high” is 0.334 (0.333) with a 5 %–95 % range of 0.315 to 0.355. In a simple regression of an indicator for high price on product fixed effects, an F-test of equality of all 122 product fixed effects yields a test statistic of 0.10, and an associated P-value of 1.000, indicating that we fail to reject the null hypothesis that all coefficients are equivalent.

Randomization further requires no serial correlation. Table 5 tests for serial dependence along both the zip code and product panel dimensions, finding no predictive power of the last price shown on the current price being shown. At the bottom of the table are test statistics for panel autocorrelation proposed by Wooldridge (2002), pp 282–283. We fail to find evidence of serial correlation along either panel dimension.

Table 5 Testing for serial correlation

As a final illustration of the randomness achieved by changing the zip code field of the site cookie, Fig. 3 shows a full set of forty lags of high prices for the entire Depth sample.

Fig. 3
figure 3

Lagged autocorrelation in high prices

In other research that tests for randomization, the authors often establish that the firm was indifferent among the actions it was randomizing among, a direct implication of the model being tested. In this case, we are unable to conduct such a test, as we do not observe sales figures at the different price levels. Therefore, it is impossible to say that the firm is indifferent between showing the “high” and “low” price. However, in many models discussed in Section 2, a firm would not necessarily be indifferent between the two actions: it is the randomization itself that induces higher overall profits from the overall strategy.

6 Relation to offline prices

Using the “Offline” sample introduced above we next investigate whether, and if so how, Staples’ online pricing strategy responds to offline prices, or its competitors’ prices. We rely on the data compiled by mystery shoppers hired on the TaskRabbit platform who recorded prices for five preselected products at two of each of the three chains’ outlets in 17 markets, and simultaneously recorded the online prices for the five products on each competitors’ website when accessing the websites from their home zip code. The resulting sample of 410 price observations from 85 retail stores and 232 observations for online prices reveals the following.

First, there is no variation in Office Depot or OfficeMax’ online prices for any of the five products across zip codes. The Staples’ competitors thus do not appear to be practicing spatial price discrimination on their websites.

Second, we also observe limited variation in offline prices. The average coefficient of variation in the five products’ prices across the 17 markets is 0.065 for Staples, 0.032 for Office Depot, and 0.062 for OfficeMax; there are no systematic patterns across products in the degree of price variation across markets: in Staples’ case, Post-It Notes have the largest coefficient of variation in displayed prices with 0.11; for OfficeMax it is Scotch Tape with a CV of 0.11, and for Office Depot it is Address Labels with a CV of 0.06. Across stores, there is virtually no variation in the price of Pilot pens, which could reflect contracting arrangements with Pilot.

Given this limited price variation, it is not surprising that Staples’ online price score in the sample store’s zip code is virtually uncorrelated with the three chains’ chosen in-store price; the correlation coefficient between the online price score and in-store prices is -0.03 for Staples’ prices, 0.01 for Office Depot’s prices, and -0.16 for OfficeMax’s prices. There are also no significant differences in price levels between online and offline prices in Staples case; only for OfficeMax are offline prices a statistically significant 5.05 % higher than online prices.Footnote 9 We thus conclude that Staples’ online price discrimination strategy is not in response to offline prices, or its competitors’ online pricing behavior; the evidence suggests instead that online prices are set independently of offline prices, as well as competitors’ pricing strategies, even though online price mixing responds to the proximity of retail outlets to the customer, and thus the convenience with which a consumer could shop at a competitor, rather than through Staples’ online channel.

7 Model of firm pricing decisions

A number of models could generate equilibria where a monopolist or oligopolist chooses to randomize among a set of prices systematically across markets. We take no stand on validating a particular assumption taken in the literature; instead we propose a simple model, similar to that of Sobel (1984) with a single good and a dominant firm, and with empirical demand objects instead of stylized reservation prices. We then exploit the rich geographic variation in the “Breadth” data to estimate parameters of the model.

7.1 Model

A firm is selling a good that it can acquire at a constant marginal cost c with no capacity constraint, and is considering different prices to charge: a low price p L > c, or a high price p H > p L. There are two types of consumers in a market. Loyal (high) types make up fraction α of consumers in any time period, and bargain-hunter (low) types make up (1 − α). Both types of consumers have discrete-choice demand of the Logit form used widely in empirical work (McFadden 1981). One key difference between the two types is that the bargain-hunters do not consider buying a good at a high price, only at a low price. Furthermore, a loyal customer who chooses to buy zero units in a time period exits the market for the good, while with some probability ρ a bargain-hunter who purchased no units in the same period remains in the market until the next period. We interpret this as the bargain-hunter taking a draw from the distribution of prices charged by other firms in the market when shown a high price, and returning to this firm if the price drawn is greater than p H, which occurs with probability ρ.

Consider a firm that charges a high price at all times. Demand each period comes solely from high type customers, so expected profits are

$$ \pi^{H}=M\cdot\left( p^{H}-c\right)\cdot\alpha\cdot s^{H}\left( p^{H}\right) $$
(1)

where M is the market size per time period and \(s^{H}\left (p^{H}\right )\) is the Logit demand function for high types.

A firm could instead charge a low price at all times, earning profits from both segments of consumers:

$$ \pi^{L}=M\cdot\left( p^{L}-c\right)\cdot\left[\alpha\cdot s^{H}\left( p^{L}\right)+\left( 1-\alpha\right)s^{L}\left( p^{L}\right)\right] $$
(2)

where \(s^{L}\left (\cdot \right )\) is the demand function for low type consumers.

Now suppose this firm is able to randomize prices, showing a high price with probability f. In a time period where the high price is shown, the firm earns π H. However, due to the behavior of low types, when a low price is shown, the firm earns additional profit from past low types who purchased nothing after seeing a high price but remained in the market. Expected profits are then

$$\begin{array}{@{}rcl@{}} \pi^{f,L}&=&M\cdot\left( p^{L}-c\right)\cdot\left[\alpha\cdot s^{H}\left( p^{L}\right)+\left( 1-\alpha\right)\left( \frac{1}{1-\rho\cdot f}\right)s^{L}\left( p^{L}\right)\right]\\ &=&M\cdot\left( p^{L}-c\right)\bar{s}(p^{L}), \end{array} $$
(3)

where \(\bar {s}(p^{L})\) denotes the weighted average share of high and low types served at p L. Therefore, a firm that follows a randomization strategy earns total expected profits of

$$ \pi=f\cdot\pi^{H}+\left( 1-f\right)\pi^{f,L} $$
(4)

Intuitively, the randomization acts as a mechanism to separate types. The more “loyal” types there are in a market (higher α), the more the firm loses from showing a low price, and so it will put a higher probability on the high price. Similarly, as the probability of market exit of the bargain-hunters decreases (higher ρ), the fewer customers are lost from showing high prices, and so the firm will put a higher probability on high price.

7.1.1 Solving the model

To operationalize the model, we specify the demand of person i of type k ∈ {L, H} as having utility and purchase probability given price p of

$$\begin{array}{@{}rcl@{}} u_{ik} & = & \delta_{k}-\beta_{k}p+\epsilon_{ik}\\ s^{k}\left( p\right) & = &\frac{\exp\left( \delta_{k}-\beta_{k}p\right)}{1+\exp\left( \delta_{k}-\beta_{k}p\right)} \end{array} $$

where 𝜖 i k is distribute i.i.d. Type I Extreme Value and the utility of the outside good is normalized to zero. The firm knows all demand parameters when choosing a pricing strategy. Optimal uniform prices come from optimizing either Eq. 1 (if serving high types only) or Eq. 2 (if serving both types).

For a firm choosing a randomizing strategy, the firm optimizes by choosing p H, p L, and f ∈ (0,1). Optimal choices are given from first-order conditions for each variable in Eq. 4. The optimal p H solves

$$ \frac{\partial\pi}{\partial p^{H}}=s^{H}(p^{H})-\beta\cdot(p^{H}-c)\cdot s^{H}(p^{H})\cdot(1-s^{H}(p^{H}))=0 $$
(5)

Since π f, L is independent of p H, the optimal high price p H is the standard optimal price in a Logit demand model. As it does not depend on the mixing frequency f or the share of high types α, it is the same price the firm would choose if it were serving only the high types.

Due to our assumption that a share ρ of the previous period’s low types remains in the market and considers purchasing in the current period in (1 − f) percent of cases, the optimal choices for p L and f are inter-dependent, jointly solving the two first-order conditions:

$$\begin{array}{@{}rcl@{}} \frac{\partial\pi}{\partial p^{L}} & = & \bar{s}(p^{L})-\beta(p^{L}-c)\left\{ \alpha s^{H}(p^{L})(1-s^{H}(p^{L}))+\frac{1-\alpha}{1-\rho f}s^{L}(p^{L})(1-s^{L}(p^{L}))\right\} =0 \\ \frac{\partial\pi}{\partial f} & = & \pi^{H}(p^{H*})-\pi^{f,L}(p^{L})+(1-f)\rho\left( \frac{1}{1-\rho f}\right)^{2}(1-\alpha)M(p^{L}-c)s^{L}(p^{L})=0.\\ \end{array} $$
(6)

As the first order conditions highlight, finding the three profit-maximizing components of the firm’s pricing strategy, (p L, p H, f), thus requires estimates of the firm’s cost c, the share of high types α, and the two sets of preference parameters (β k , δ k ). We describe in Section 7.2 how we use our data to derive estimates of these parameters.

7.1.2 Discussion

The mechanism at work is that charging multiple prices separates the two types of consumers, allowing for price discrimination. The firm is able to earn the high profits from high types on occasion, without entirely sacrificing the low types. If there were a single rival charging a uniform price, this firm would be better off charging the low price deterministically after an optimal amount of time. However that cannot be an equilibrium as the rival would then want to “sweep” the low types from the market immediately prior. The fact that the bargain-hunter takes a draw from a distribution of rival firms means the firm can do no better than the randomizing strategy.

We make the simplifying assumption that bargain-hunters are restrained from buying at the higher price, rather than endowing them with Logit demand regardless of price, to ensure that the observed mixed pricing strategy is indeed an equilibrium outcome. In the alternative model where both types have Logit preferences for the product, the firm would randomize in equilibrium only if profits at any two prices it randomizes between are the same. Furthermore, even if the firm would continue to choose the high price to be the one that maximizes profit from the high type segment, low types would now also purchase at that price. Then, however, the concavity properties of the Logit demand imply that the firm would typically earn higher profits from charging the weighted-average price instead of mixing between an optimal price targeting the loyal segment and an optimal low price targeting the bargain hunter segment, for any mixing probability. The model would thus not produce mixed pricing as an optimal equilibrium outcome unless one made a strong assumption about firm behavior, such as that the firm had to mix between the two prices for exogenous reasons. The existing literature on mixed pricing typically makes the behavioral assumption about consumers instead of firms, and so we have done the same.

Our assumption that the low type does not buy at the high price can be interpreted in several ways. One way is that low type demand is given by a discrete-continuous model of demand, as in Hanemann (1984) or Dubin and McFadden (1984), so that instead of “staying in the market” with some probability after observing a high price, these consumers have zero demand at high prices but demand greater than one unit of the good at low prices. Another interpretation is that a low type searches for a similar but different product after observing a high price, such as a substitute good within the same product category, where the firm is employing the same randomization strategy on different goods. These interpretations rely on the low type understanding that the firm uses a mixed pricing strategy, which is possibly not the case for all consumers in our particular empirical application.

Finally, the exposition above describes the firm’s pricing strategy for a single market; applying the model to multiple markets would result in market-specific prices and mixing frequencies, as preferences, the type distribution, and the search intensity of low types vary across markets. In Staples’ case, however, the mixing frequency, but neither the high nor the low price, vary across zip codes, their chosen definition of local markets. In our empirical implementation, we therefore allow the type distribution and search intensity to vary with zip code attributes to rationalize variation in the optimal mixing frequency, but follow Staples’ practice in constraining the optimal prices to be constant across zip codes.

7.2 Estimation

Our data are limited in their ability to pin down the four sets of parameters - type distribution α z , search intensity ρ z , type-specific base product preference δ k z and price responsiveness β k z . In particular, since we do not observe quantity data, we are unable to estimate all of the model parameters freely. We instead assume that the two types’ Logit preferences are identical across zip code, but allow the share of loyal customers, α z , and the search parameter, ρ z , to vary by zip code.

We calibrate the consumer’s price responsiveness β k , k ∈ {L, H}, to rationalize the high and low prices we observe for the stapler, $14.29 and $15.79. We do so by relying on the firm’s first order conditions for the two prices, together with an estimate of the firm’s cost and an assumption that the base demand for loyal consumers, δ H , is three times the base demand for bargain hunters, which we normalize to δ L = 1. This implies that the base utility of the good is higher for high types, and will also result in them being less responsive to price, as expected. As an estimate of the stapler’s marginal cost, we use its lowest available price on Amazon Marketplace, which implies that c = $10.46.

Under these assumptions, we can immediately find the high type’s price responsiveness β H that solves the firm’s first optimal pricing condition, Eq. 5, given the observed high price. To calibrate the low type’s price responsiveness, we rely on the firm’s first-order condition with respect to p L in Eq. 6. It requires information on the share of low types, α z , together with information on the equilibrium mixing probability, \(f_{z}^{*}\). Since both vary by zip code, but the optimal low price does not, we use the “naive” p L that arises under the assumption α = 0. In that case, the firm’s optimal pricing strategy collapses to charging the optimal low price that solves Eq. 2; when there are low types only, there is no need to randomize.Footnote 10 , Footnote 11 The calibrated values are depicted in Appendix Table 9; in line with intuition, bargain hunter low types respond more to price than loyal high type consumers.

Given the types’ Logit preferences, we then want to estimate zip code-level α z , the share of loyal customers, and ρ z , the search parameter, as a function of local characteristics. Given our reduced-form analysis of covariates of the empirical mixing distribution f, we choose to model them as follows:

$$\begin{array}{@{}rcl@{}} \alpha_{z} & = & \alpha_{0}+\gamma\cdot d_{z}\\ \rho_{z} & = & \rho_{0}+\eta\cdot y_{z} \end{array} $$

where d z is log of the minimum distance from zip code z to either an OfficeMax or Office Depot location, y z is the log of the number of establishments in zip code z from the census, and 𝜃 = {α 0, γ, ρ 0, η} is a parameter vector to estimate.

Given a parameter vector 𝜃, we can compute the optimal f z for zip code z as the profit-maximizing probability of showing high prices.

$$f_{z}\left( \alpha_{z},\rho_{z},\theta\right)=argmax_{f}\pi\left( f,\theta\right), $$

which amounts to solving the firm’s first-order condition with respect to f in Eq. 6 given the observed low and high price, assumed marginal cost, and calibrated Logit preferences. We then compare the resulting optimal f to the observed high price frequency in the data, \({f_{z}^{0}}\), and estimate the parameter vector via minimum distance.

$$\hat{\theta}=argmin_{\theta}{\sum}_{z=1}^{Z}(f_{z}\left( \theta\right)-{f_{z}^{0}})^{2} $$

Note that the estimation routine does not weight according to zip code populations or other characteristics; all zip codes are weighted equally as each is a realization of the model for an individual market.Footnote 12

7.3 Results

The results of the estimation exercise are displayed in Table 6.

Table 6 Estimation results: determinants of the optimal mixing probability

These estimates imply that the share of loyal types Staples has in a zip code is increasing in the distance to rivals, and the probability of searching again among bargain-hunter types is falling in the number of establishments in the zip code. The raw correlation between the predicted mixing probabilities and the empirical mixing probabilities is 0.776, although this is biased away from one due to the coarseness of the empirical mixing probabilities. Regressing the empirical mixing probabilities on the predicted ones yields a slope coefficient of 0.9703.Footnote 13 The figures below show the distribution of values of α z = α 0 + γd z and ρ z = ρ 0 + ηy z across zip codes, using our estimates of α 0, γ, ρ 0, η and the empirical distributions of d z and y z . Figure 4 shows that there is a mass of zip codes, approximately 25 %, where the share of high types is zero, and that the remaining zip codes span shares of high types from zero to one. Figure 5 shows that the distribution of ρ is bimodal, with its range located between 0.84 and 0.94.

Fig. 4
figure 4

Estimated distributions of loyal types across zip codes

Fig. 5
figure 5

Estimated distributions of search parameter across zip codes

Figure 6 shows the joint distribution of α and ρ across zip codes, along with the optimal probability of showing a high price for a given (α, ρ) pair. Intuitively, if there are more loyal types, or if the probability of searching again is high, a higher frequency for the high price is optimal. This figure highlights the interplay between the two aspects of the model.

Fig. 6
figure 6

Joint distribution of loyal types and search parameter

The estimates imply that engaging in randomization is strictly more profitable than either only serving the high types by always charging a high price in all markets, or the strategy of serving both types by always charging a low price in all markets. Table 7 shows the percent change in profit from either alternative strategy; both reduce profit, although much more so when the firm only serves the high types in the market. The values were computed using Eqs. 1 and 2 at our parameter estimates and calibrated parameters. This is consistent with Fig. 6 and the fact that our estimates imply that the profit-maximizing mixing ratio f has more mass in the lower range of f, and many markets have an optimal f of zero when the single high price is never shown.

Table 7 Profits under different strategies

7.4 Implications and discussion

Table 8 shows some basic simulations that illustrate the implications of the model estimates. We first show the effect of increasing rival distances by 20 % and increasing number of establishments by 20 % by scaling the observed values of d z and y z but holding our estimates for α 0, γ, ρ 0 and η fixed and solving for new profit-maximizing mixing probabilities f; both have modest effects on the effective average price shown in the entire sample, although the distance to rival carries more weight. A 20 % increase in distance to the nearest rival implies a nearly 32 % increase in the share of “loyal” types, while increasing the number of establishments in the zip code by 20 % decreases the probability of searching again by a little over 1 %. The implication of these two exercises is that the distance to rivals is the dominating force in pricing for Staples. The final column uses data on the change in OfficeMax and Office Depot store locations in 2014 as a result of their merger to determine the effect on loyal types in a market for Staples.com. The merger affects the distance to the nearest store, d z , for only 6.3 % of zip codes in our sample. This seems reasonable as the combined entity would be most likely to eliminate locations in close proximity to one another, while retaining locations of one chain not in the immediate vicinity of the other chain.Footnote 14 One limitation of this analysis is that we do not re-compute an optimal low price in each scenario, as doing so would be sensitive to the calibrated parameters. As our data are more informative about mixing than the price levels, prices p L and p H remain fixed for these simulations.

Table 8 Model estimates and simulations

While the predicted price increase from the merger is small, it can still imply a significant dollar value given the size of the site and the fact it can be applied to a large selection of products. To put these estimates in context, in 2013, Staples reported $10.4B in online revenue. A back-of-the-envelope calculation says that if their rivals’ merger allows them to raise average prices by even 1 % on roughly half of their product selection, with little demand response, they could see a profit increase of $50M.

8 Conclusion

This paper makes several contributions to the literature on pricing. First, we test for and document a case of a firm using a mixed pricing strategy for posted prices in an online marketplace. Second, we document the relationship between online and offline pricing for a major product market. Third, we estimate a model for online price discrimination consistent with our data and use it to quantify the magnitudes of the competitive forces on pricing. While our analysis is limited in that it examines an idiosyncratic instance of such a strategy, our results show that online price discrimination can be an effective tool for firms, and that it is sensitive to offline alternatives. This work revisits large literatures in marketing and economics on price dispersion and price discrimination, and has implications for policy as merger analysis should take into account online price effects from industry consolidation. As pointed out by Mikians et al. (2012), online price discrimination is widespread, and work such as Mikians et al. (2013) shows the challenge in analyzing such behavior.