Abstract
This chapter provides a nontechnical summary of how to deal with endogeneity in regression models for marketing research applications. When researchers want to make causal inference of a marketing variable (e.g., price) on an outcome variable (e.g., sales), using observational data and a regression approach, they need the marketing variable to be exogenous. If the marketing variable is driven by factors unobserved by the researcher, such as the weather or other factors, then the assumption that the marketing variable is exogenous is not tenable, and the estimated effect of the marketing variable on the outcome variable may be biased. This is the essence of the endogeneity problem in regression models. The classical approach to address endogeneity is based on instrumental variables (IVs). IVs are variables that isolate the exogenous variation in the marketing variable. However, finding IVs of good quality is challenging. We discuss good practice in finding IVs, and we examine common IV estimation approaches, such as the two-stage least squares approach and the control function approach. Furthermore, we consider other implementation challenges, such as dealing with endogeneity when there is an interaction term in the regression model. Importantly, we also discuss when endogeneity matters and when it does not matter, as the “cure” to the problem can be worse than the “disease.”
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Many demand models in marketing are nonlinear. At the end of this chapter, we briefly discuss nonlinear models. A popular nonlinear demand model to estimate price elasticities is the log-log demand model, where both the dependent and independent variables are the natural logs of the original variables, which can be estimated using standard approaches for linear regression models. Log-log models are also prone to an endogeneity problem.
- 2.
We use the cross-sectional setup in Eq. 1 as the leading example in this chapter. A similar logic applies to a time series setup (e.g., when we would view Eq. 1 as a time series model). However, this would require an additional discussion of dealing with potential autocorrelation in the model error terms, which is beyond the scope of this chapter. Therefore, we assume that the error terms εi are independent and identically distributed in this chapter.
- 3.
For the sake of simplicity in this example, we assume that the researcher estimates a constant elasticity model (e.g., using a log-log regression). The optimal price can then be computed as p∗ = c(β/(β + 1)), where c is marginal cost and β is the estimated price elasticity (Amoroso-Robinson theorem, e.g., Homburg et al. 2009, p. 181).
- 4.
We simplified their model here for sake of exposition. The full model of Dinner et al. (2014) splits online advertising into search advertising and banner ads, allows for advertising carryover effects, and for the effects of other covariates.
- 5.
This discussion is similar in spirit to the rationale behind mediation analysis. We refer the reader to chapter “Mediation Analysis in Experimental Research” in this handbook for more details.
- 6.
The option sigmamore specifies that the covariance matrices are based on the estimated error variance from the efficient OLS estimator. Stata’s online help provides more information (“help Hausman”).
- 7.
In case the variables are mean centered before they enter the product, i.e., \( \left({P}_i-\overline{P}\right)\left({X}_i-\overline{X}\right) \), we need to use the mean-centered interaction term on the left-hand side of (16) and on the right-hand side of (14), instead of PiXi. The IVs do not require mean centering as the first-stage predictions will not be affected by mean centering.
- 8.
Ascarza et al. (2017) leverage a field experiment to address endogeneity concerns in the context of a customer relationship management (CRM) campaign, customer targeting, and social influence.
References
Albers, S., Mantrala, M. K., & Sridhar, S. (2010). Personal selling elasticities: A meta-analysis. Journal of Marketing Research, 47(5), 840–853.
Andrews, R. L., & Ebbes, P. (2014). Properties of instrumental variables estimation in logit-based demand models: Finite sample results. Journal of Modelling in Management, 9(3), 261–289.
Angrist, J. D., & Pischke, J. S. (2008). Mostly harmless econometrics. An empiricist’s companion. Princeton: Princeton University Press.
Ascarza, E., Ebbes, P., Netzer, O., & Danielson, M. (2017). Beyond the target customer: Social effects in CRM campaigns. forthcoming in the Journal of Marketing Research.
Baltagi, B. H. (2013). Econometric analysis of panel data (5th ed.). Chichester: Wiley.
Bascle, G. (2008). Controlling for endogeneity with instrumental variables in strategic management research. Strategic Organization, 6(3), 285.
Baum, C. F., Schaffer, M. E., & Stillman, S. (2007). Enhanced routines for instrumental variables/GMM estimation and testing. Stata Journal, 7(4), 465–506.
Baum, C. F., Schaffer, M. E., & Stillman, S. (2015). IVREG2: Stata module for extended instrumental variables/2SLS and GMM estimation. Boston College Department of Economics. Retrieved from https://ideas.repec.org/c/boc/bocode/s425401.html
Berry, S. T. (1994). Estimating discrete-choice models of product differentiation. The Rand Journal of Economics, 25(2), 242–262.
Bhattacharjee, S., Gopal, R. D., Lertwachara, K., Marsden, J. R., & Telang, R. (2007). The effect of digital sharing technologies on music markets: A survival analysis of albums on ranking charts. Management Science, 53(9), 1359–1374.
Bijmolt, T. H. A., van Heerde, H. J., & Pieters, R. G. M. (2005). New empirical generalizations on the determinants of price elasticity. Journal of Marketing Research, 42(2), 141–156.
Bound, J., Jaeger, D. A., & Baker, R. M. (1995). Problems with instrumental variables estimation when the correlation between the instruments and the endogenous explanatory variable is weak. Journal of the American Statistical Association, 90(430), 443–450.
Chintagunta, P. C., & Desiraju, R. (2005). Strategic pricing and detailing behavior in international markets. Marketing Science, 24(1), 67–80.
Dinner, I. M., van Heerde, H. J., & Neslin, S. A. (2014). Driving online and offline sales: The cross-channel effects of traditional, online display, and paid search advertising. Journal of Marketing Research (JMR), 51(5), 527–545.
Ebbes, P., Böckenholt, U., & Wedel, M. (2004). Regressor and random-effects dependencies in multilevel models. Statistica Neerlandica, 58(2), 161–178.
Ebbes, P., Wedel, M., Böckenholt, U., & Steerneman, T. (2005). Solving and testing for regressor-error (in)dependence when no instrumental variables are available: With new evidence for the effect of education on income. Quantitative Marketing & Economics, 3(4), 365–392.
Ebbes, P., Papies, D., & van Heerde, H. J. (2011). The sense and non-sense of holdout sample validation in the presence of endogeneity. Marketing Science, 30(6), 1115–1122.
Franses, P. H. (2005). On the use of econometric models for policy simulation in marketing. Journal of Marketing Research, 42, 4–14.
Gentzkow, M. (2007). Valuing new goods in a model with complementarity: Online newspapers. American Economic Review, 97(3), 713–744.
Germann, F., Ebbes, P., & Grewal, R. (2015). The chief marketing officer matters! Journal of Marketing, 79(3), 1–22.
Greene, W. H. (2011). Econometric analysis. Upper Saddle River: Prentice Hall.
Homburg, C., Kuester, S., & Krohmer, H. (2009). Marketing management: A contemporary perspective. London: Mcgraw-Hill Higher Education.
Karaca-Mandic, P., & Train, K. (2003). Standard error correction in two-stage estimation with nested samples. The Econometrics Journal, 6(2), 401–407. https://doi.org/10.1111/1368-423X.t01-1-00115.
Ledolter, J., & Swersey, A. J. (2007). Testing 1-2-3 experimental design with applications in marketing and service operations. Stanford, California: Stanford Business Books.
Leeflang, P., Wittink, D. R., Wedel, M., & Naert, P. A. (2000). Building models for marketing decisions. Dordrecht: Kluwer.
Leenheer, J., Van Heerde, H. J., Bijmolt, T. H. A., & Smidts, A. (2007). Do loyalty programs really enhance behavioral loyalty? An empirical analysis accounting for self-selecting members. International Journal of Research in Marketing, 24, 31–47.
Liebowitz, S. J. (2016). How much of the decline in sound recording sales is due to file-sharing? Journal of Cultural Economics, 40(1), 13–28.
Lindgren, B. W. (1993). Statistical theory. London: Chapman and Hall.
Manchanda, P., Rossi, P. E., & Chintagunta, P. K. (2004). Response modeling with nonrandom marketing-mix variables. Journal of Marketing Research (JMR), 41(4), 467–478.
Nevo, A. (2001). Measuring market power in the ready-to-eat cereal industry. Econometrica, 69(2), 307–342.
Papies, D., Ebbes, P., Van Heerde, H. J. (2017). Addressing endogeneity in marketing models. In P. S. H. Leeflang, J. E. Wieringa, T. H. A. Bijmolt, & K. H. Pauwels (Eds.), Advanced methods for modelling marketing. Switzerland: Springer. http://www.springer.com/de/book/9783319534671
Park, S., & Gupta, S. (2012). Handling endogenous regressors by joint estimation using copulas. Marketing Science, 31(4), 567–586.
Petrin, A., & Train, K. (2010). A control function approach to endogeneity in consumer choice models. Journal of Marketing Research (JMR), 47(1), 3–13.
Podsakoff, P. M., MacKenzie, S. B., Lee, J.-Y., & Podsakoff, N. P. (2003). Common method biases in behavioural research: A critical review of the literature and recommended remedies. Journal of Applied Psychology, 88(5), 879–903.
Rooderkerk, R. P., van Heerde, H. J., & Bijmolt, T. H. A. (2013). Optimizing retail assortments. Marketing Science, 32(5), 699–715.
Rossi, P. (2014). Even the rich can make themselves poor: A critical examination of IV methods in marketing applications. Marketing Science, 33(5), 655–672.
Sanderson, E., & Windmeijer, F. (2016). A weak instrument -test in linear IV models with multiple endogenous variables. Journal of Econometrics, 190(2), 212–221.
Sethuraman, R., Tellis, G. J., & Briesch, R. A. (2011). How well does advertising work? Generalizations from meta-analysis of brand advertising elasticities. Journal of Marketing Research, 48(3), 457–471.
Shaver, J. M. (1998). Accounting for endogeneity when assessing strategy performance: Does entry mode choice affect. Management Science, 44(4), 571–585.
Stock, J. H., Wright, J. H., & Yogo, M. (2002). A survey of weak instruments and weak identification in generalized method of moments. Journal of Business Statistics and Economic Statistics, 20, 518–529.
Verbeek, M. (2012). A guide to modern econometrics (4th ed.). Hoboken: Wiley.
Verma, I. M. (2014). Editorial expression of concern: Experimental evidence of massive scale emotional contagion through social networks. Proceedings of the National Academy of Sciences, 111(29), 10779–10779.
Villas-Boas, J. M., & Winer, R. S. (1999). Endogeneity in brand choice models. Management Science, 45(10), 1324–1338.
Wooldridge, J. M. (2010). Econometric analysis of cross section and panel data (2nd ed.). Cambridge, MA: MIT Press.
Wooldridge, J. M. (2015). Control function methods in applied econometrics. Journal of Human Resources, 50(2), 420–445.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this entry
Cite this entry
Ebbes, P., Papies, D., van Heerde, H.J. (2022). Dealing with Endogeneity: A Nontechnical Guide for Marketing Researchers. In: Homburg, C., Klarmann, M., Vomberg, A. (eds) Handbook of Market Research. Springer, Cham. https://doi.org/10.1007/978-3-319-57413-4_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-57413-4_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-57411-0
Online ISBN: 978-3-319-57413-4
eBook Packages: Business and ManagementReference Module Humanities and Social SciencesReference Module Business, Economics and Social Sciences