Abstract
Many scientific problems reduce to modeling the relationship between two sets of variables. Regression methodology is designed to quantify these relationships. Due to their mathematical simplicity, linear regression for continuous data, logistic regression for binary data, proportional hazard regression for censored survival data, and mixed-effect regression for longitudinal data are among the most commonly used statistical methods. These parametric (or semiparametric) regression methods, however, may not lead to faithful data descriptions when the underlying assumptions are not satisfied. As remedies, extensive literature exists to perform diagnosis of parametric or semiparametric regression models, but the practice of the model diagnosis is uneven at best. A common practice is the visualization of the residual plots, which is a straightforward task for a simple regression model, but can be highly sophisticated as the model complexity grows. Furthermore, model interpretation can be problematic in the presence of higher-order interactions among potent predictors. Nonparametric regression has evolved to relax or remove the restrictive assumptions.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Keywords
- Multivariate Adaptive Regression Spline
- Recursive Partitioning
- Mammalian Sperm
- Semiparametric Regression Model
- Ordinary Linear Regression
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
U. Alon, N. Barkai, D.A. Notterman, K. Gish, S. Ybarra, D. Mack, and A.J. Levine. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonu-cleotide arrays. Proc. Natl. Acad. Sci. USA, 96:6745–6750, 1999.
E.I. Altman. Bankruptcy, Credit Risk, and High Yield Junk Bonds. Blackwell Publishers, Malden, Massachusetts, 2002.
S. Appavu and R. Rajaram. Detection of e-mail concerning criminal activities using association rule-based decision tree. International Journal of Electronic Security and Digital Forensics, 1:131–145, 2008.
M.B. Bracken. Perinatal Epidemiology. Oxford University Press, New York, 1984.
L.-S. Chen and C.-T. Su. Using granular computing model to induce scheduling knowledge in dynamic manufacturing environments. International Journal of Computer Integrated Manufacturing, 21:569– 583, 2008.
X. Chen, A. Rusinko, and S.S. Young. Recursive partitioning analysis of a large structure-activity data set using three-dimensional descriptors. J. Chem. Inf. Comput. Sci., 38:1054–1062, 1998.
S.C. Choi, J.P. Muizelaar, T.Y. Barnes, et al. Prediction tree for severely head-injured patients. Journal of Neurosurgery, 75:251–255, 1991.
G.L. Desilva and J.J. Hull. Proper noun detection in document images. Pattern Recognition, 27:311–320, 1994.
D. Geman and B. Jedynak. An active testing model for tracking roads in satellite images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 18:1–14, 1996.
L. Goldman, M. Weinberg, R.A. Olshen, F. Cook, R. Sargent, et al. A computer protocol to predict myocardial infarction in emergency department patients with chest pain. The New England Journal of Medicine, 307:588–597, 1982.
E.G. Hebertson and M.J. Jenkins. Factors associated with historic spruce beetle (Coleoptera: Curculionidae) outbreaks in Utah and Colorado. Environmental Entomology, 37:281–292, 2008.
D.A. Kumar and V. Ravi. Predicting credit card customer churn in banks using data mining. International Journal of Data Analysis Techniques and Strategies, 1:4–28, 2008.
N. Levin, J. Zahavi, and M. Olitsky. Amos—a probability-driven, customer-oriented decision support system for target marketing of solo mailings. European Journal of Operational Research, 87:708– 721, 1995.
D.E. Levy, J.J. Caronna, B.H. Singer, et al. Predicting outcome from hypoxic-ischemic coma. Journal of the American Medical Association, 253:1420–1426, 1985.
E.A. Owens, R.E. Griffiths, and K.U. Ratnatunga. Using oblique decision trees for the morphological classification of galaxies. Monthly Notices of the Royal Ast ron om ical Societ y, 281:153–157, 1996.
R.K. Pace. Parametric, semiparametric, and nonparametric estimation of characteristic values within mass assessment and hedonic pricing models. Journal of Real Estate, Finance and Economics, 11:195– 217, 1995.
E.G. Raymond, N. Tafari, J.F. Troendle, and J.D. Clemens. Development of a practical screening tool to identify preterm, low-birthweight neonates in Ethiopia. Lancet, 344:520–523, 1994.
N.R. Temkin, R. Holubkov, J.E. Machamer, H.R. Winn, and S.S. Dikmen. Classification and regression trees (CART) for prediction of function at 1 year following head trauma. Journal of Neurosurgery, 82:764–771, 1995.
J. Terhune, D. Quin, A. DellApa, M. Mirhaj, J. Plötz, L. Kinder-mann, and H. Bornemann. Geographic variations in underwater male Weddell seal trills suggest breeding area fidelity. Polar Biology, 31:671–680, 2008.
R.J. Young and B.A. Bod. Development of computer-directed methods for the identification of hyperactivated motion using motion patterns developed by rabbit sperm during incubation under capacita-tion conditions. Journal of Andrology, 15:362–377, 1994.
H.P. Zhang, J. Crowley, H.C. Sox, and R.A. Olshen. Tree-structured statistical methods. Encyclopedia of Biostatistics, 6:4561–4573, 1998.
H.P Zhang, C.Y. Yu, B. Singer, and M.M. Xiong. Recursive partitioning for tumor classification with gene expression microarray data. Proc. Natl. Acad. Sci. USA, 98:6730–6735, 2001.
P. Cashin and R. Duttagupta. The anatomy of banking crises. In IMF Working Papers, pages 1–37. International Monetary Fund, 2008.
H. Frydman, E.I. Altman, and D.-I. Kao. Introducing recursive partitioning for financial classification: the case of financial distress. In Bankruptcy, Credit Risk, and High Yield Junk Bonds, E.I. Altman ed., pages 37–59, 2002.
N. Brennan, P. Parameswaran, et al. A Method for Selecting Stocks within Sectors. Schroder Salomon Smith Barney, 2001.
I. Shmulevich, O. Yli-Harja, E. Coyle, D.-J. Povel, and K. Lemström. Perceptual issues in music pattern recognition: complexity of rhythm and key finding. Computers and the Humanities, pages 23–35, 2001.
L. Goldman, F. Cook, P. Johnson, D. Brand, G. Rouan, and T. Lee. Prediction of the need for intensive care in patients who come to emergency departments with acute chest pain. The New England Journal of Medicine, 334:1498–1504, 1996.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Zhang, H., Singer, B.H. (2010). Introduction. In: Recursive Partitioning and Applications. Springer Series in Statistics, vol 0. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-6824-1_1
Download citation
DOI: https://doi.org/10.1007/978-1-4419-6824-1_1
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-6823-4
Online ISBN: 978-1-4419-6824-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)