Abstract
Sampling is a well-established statistical technique that selects a part from a whole to make inferences about the whole. It can be employed to overcome problems caused by high dimensionality of attributes as well as large volumes of data in data mining. This chapter summarizes the basic ideas, assumptions, considerations and advantages as well as limitations of sampling, categorizes representative sampling methods by their features, provides a preliminary guideline on how to choose suitable sampling methods. We hope this can help users build a big picture of sampling methods and apply them in data mining.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Buckland, S., Anderson, D., Burnham, K., and Laake, J. (1993). Distance Sampling: Estimating Abundance of Biological Populations. Chapman — Hall, Inc.
Cochran, W. (1977). Sampling Techniques. Wiley, third edition.
Efron, B. and Tibshirani, R. (1993). An Introduction to the Bootstrap. Chapman & Hall, Inc.
Fan, S. (1967). Basic Sampling Methods. University of Singapore.
Gamerman, D. (1997). Markov Chain Monte Carlo: Stochastic Simula-tion For Bayesian Inference. Chapman — Hall.
Gu, B., Hu, F., and Liu, H. (2000). Sampling and its application in data mining. Technical Report http://techrep.comp.nus.edu.sg/techreports/2000/TRA6-00.asp, Department of Computer Science, National University of Singapore.
Hedayat, A. (1991). Design and Inference in Finite Population Sampling. NY: John Wiley — Sons, Inc.
Higgins, J. (1996). Sampling Theory in Fourier and Signal Analysis. Oxford Science Publications.
Krishnaiah, P. and Rao, C. (1988). Handbook of Statistics 6: Sampling. North-Holland.
Mackay, D. (1998). Introduction to Monte Carlo Methods, in Learning in Graphical Models. Kluwer Academic Publishers.
Scheaffer, R., Mendenhall, W., and L. O. (1996). Elementary Survey Sampling. Duxbury Press, fifth edition.
Singh, R. and Mangat, N. (1996). Elements of Survey Sampling. Kluwer Academic Publishers.
Som, R. (1995). Practical Sampling Techniques. M. Dekker, second edition.
Stuart, A. (1976). Basic Ideas of Scientific Sampling. London: Griffin.
Thompson, S. (1992). Sampling. Wiley.
Thompson, S. and Seber, G. A. F. (1996). Adaptive Sampling. Wiley.
Trochim, W. (1999). Research Methods in Knowledge Base. Cornell Custom Publishing, second edition.
Tryfos, P. (1996). Sampling Methods for Applied Research: Text and Cases. John Wiley & Sons, Inc.
Yamane, T. (1967). Elementary Sampling Theory. Prentice Hall.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Gu, B., Hu, F., Liu, H. (2001). Sampling: Knowing Whole from Its Part. In: Liu, H., Motoda, H. (eds) Instance Selection and Construction for Data Mining. The Springer International Series in Engineering and Computer Science, vol 608. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3359-4_2
Download citation
DOI: https://doi.org/10.1007/978-1-4757-3359-4_2
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-4861-8
Online ISBN: 978-1-4757-3359-4
eBook Packages: Springer Book Archive