Abstract
The performance of computationally inexpensive model selection criteria in the context of tree-structured subgroup analysis is investigated. It is shown through simulation that no single model selection criterion exhibits a uniformly superior performance over a wide range of scenarios. Therefore, a two-stage approach for model selection is proposed and shown to perform satisfactorily. Applied example of subgroup analysis is presented. Problems associated with tree-structured subgroup analysis are discussed and practical solutions are suggested.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Akaike H. 1974. A new look at the statistical model identification. IEEE Transactions on Automatic Control AC-19: 716–723.
Breiman L., Friedman J.H., Olshen R.A. and Stone C.J. 1984. Classification and Regression Trees. Wadsworth, Belmont, CA.
Bulpitt C.J. 1988. Subgroup analysis. Lancet 2: 31–34.
Ciampi A., Lou Z., Lin Q. and Negassa A. 1991. Recursive partition and amalgamation with the exponential family: Theory and application. Applied Stochastic Models and Data Analysis 7: 121–137.
Ciampi A. 1991. Generalized regression trees. Computational Statistic and Data Analysis 12: 57–78.
Ciampi A., Negassa A. and Lou Z. 1995. Tree-structured prediction for censored survival data and the cox model. Journal of Clinical Epidemiology 48: 675–689.
Cox D.R. 1972. Regression models and life tables (with discussion). Journal of the Royal Statistical Society B 34: 187–220.
Davis R.B. and Anderson J.R. 1989. Exponential survival trees. Statistics in Medicine 8: 947–961.
Efron B. 1983. Estimating the error rate of a prediction rule: Improvement on cross-validation. Journal of the American Statistical Association 78: 316–331.
Gail M. and Simon R. 1985. Testing for qualitative interactions between treatment effects and patient subsets. Biometrics 41: 361–372.
Kalbfleisch J.D. and Prentice R.L. 1980. The Statistical Analysis of Failure Time Data. J. Wiley and Sons, New York.
Krzanowski W.J. and Lai Y.T. 1988. A criterion for determining the number of groups in a data set using sum-of-squares clustering. Biometrics 44: 23–34.
LeBlanc M. and Crowley J. 1992. Relative risk tree for survival data. Biometrics 48: 411–425.
LeBlanc M. and Crowley J. 1993. Survival tree by goodness of split. Journal of the American Statistical Association 88: 457–467.
Negassa A., Ciampi A., Abrahamowicz M., Shapiro S. and Boivin J.-F. 2000. Tree-structured prognostic classification for censored survival data: Validation of computationally inexpensive model selection criteria. Journal of Statistical Computation and Simulation 67: 289–318.
Schmoor C., Ulm K. and Schumacher. 1993. Comparison of the cox model and the regression tree procedure in analysing a randomized clinical trial. Statistics in Medicine 12: 2351– 2366.
Segal R.M. 1988. Regression tree for censored data. Biometrics 44: 35–47.
Yusuf S., Wittes J., Probstfield J. and Tyroler H.A. 1991. Analysis and interpretation of treatment effects in subgroups of patients in randomized clinical trials. JAMA 266: 93–98.
Zhang H. and Singer B. 1999. Recursive Partitioning in the Health Sciences. Springer-Verlag, New York.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Negassa, A., Ciampi, A., Abrahamowicz, M. et al. Tree-structured subgroup analysis for censored survival data: Validation of computationally inexpensive model selection criteria. Stat Comput 15, 231–239 (2005). https://doi.org/10.1007/s11222-005-1311-z
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/s11222-005-1311-z