Abstract
Detection of multiple outliers in multivariate data using Mahalanobis distances requires robust estimates of the means and covariance of the data. We obtain this by sequential construction of an outlier free subset of the data, starting from a small random subset. The stalactite plot provides a cogent summary of suspected outliers as the subset size increases. The dependence on subset size can be virtually removed by a simulation-based normalization. Combined with probability plots and resampling procedures, the stalactite plot, particularly in its normalized form, leads to identification of multivariate outliers, even in the presence of appreciable masking.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Atkinson, A. C. (1985) Plots, Transformation, and Regression. Clarendon Press, Oxford.
Atkinson, A. C. (1993). Robust estimation for outlier detection. In Data Analysis and Robustness, S. Morgenthaler and W. Stahel (eds.) Birkhäuser, Basel.
Cook, R. D. and Hawkins, D. M. (1990) Comment on Rousseeuw and van Zomeren (1990). Journal of the American Statistical Association 85, 640–644.
Cook, R. D. and Weisberg, S. (1982) Residuals and Influence in Regression. Chapman and Hall, London.
Draper, N. R. and Smith, H. (1966). Applied Regression Analysis. Wiley, New York.
Hadi, A. S. (1992) Identifying multiple outliers in multivariate data. Journal of the Royal Statistical Society B54, 761–771.
Hawkins, D. M., Bradu, D. and Kass, G. V. (1984) Location of several outliers in multiple-regression data using elemental sets. Technometrics 26, 197–208.
Rousseeuw, P. J. (1984) Least median of squares regression. Journal of the American Statistical Association 79, 871–880.
Rousseeuw, P. J. and Leroy, A. M. (1987). Robust Regression and Outlier Detection. Wiley, New York.
Rousseeuw, P. J. and van Zomeren, B. C. (1990) Unmasking multivariate outliers and leverage points. Journal of the American Statistical Association 85, 633–639.
Weisberg, S. (1985) Applied Linear Regression (second edition). Wiley, New York.
Woodruff, D. L. and Rocke, D. M. (1992) Computation of minimum volume ellipsoid estimates using heuristic search. Technical report, Graduate School of Management, University of California at Davis.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Atkinson, A.C., Mulira, HM. The stalactite plot for the detection of multivariate outliers. Stat Comput 3, 27–35 (1993). https://doi.org/10.1007/BF00146951
Issue Date:
DOI: https://doi.org/10.1007/BF00146951