Skip to main content

Part of the book series: The Springer International Series in Engineering and Computer Science ((SECS,volume 608))

Abstract

Sampling is a well-established statistical technique that selects a part from a whole to make inferences about the whole. It can be employed to overcome problems caused by high dimensionality of attributes as well as large volumes of data in data mining. This chapter summarizes the basic ideas, assumptions, considerations and advantages as well as limitations of sampling, categorizes representative sampling methods by their features, provides a preliminary guideline on how to choose suitable sampling methods. We hope this can help users build a big picture of sampling methods and apply them in data mining.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Buckland, S., Anderson, D., Burnham, K., and Laake, J. (1993). Distance Sampling: Estimating Abundance of Biological Populations. Chapman — Hall, Inc.

    Google Scholar 

  • Cochran, W. (1977). Sampling Techniques. Wiley, third edition.

    MATH  Google Scholar 

  • Efron, B. and Tibshirani, R. (1993). An Introduction to the Bootstrap. Chapman & Hall, Inc.

    MATH  Google Scholar 

  • Fan, S. (1967). Basic Sampling Methods. University of Singapore.

    Google Scholar 

  • Gamerman, D. (1997). Markov Chain Monte Carlo: Stochastic Simula-tion For Bayesian Inference. Chapman — Hall.

    Google Scholar 

  • Gu, B., Hu, F., and Liu, H. (2000). Sampling and its application in data mining. Technical Report http://techrep.comp.nus.edu.sg/techreports/2000/TRA6-00.asp, Department of Computer Science, National University of Singapore.

    Google Scholar 

  • Hedayat, A. (1991). Design and Inference in Finite Population Sampling. NY: John Wiley — Sons, Inc.

    MATH  Google Scholar 

  • Higgins, J. (1996). Sampling Theory in Fourier and Signal Analysis. Oxford Science Publications.

    MATH  Google Scholar 

  • Krishnaiah, P. and Rao, C. (1988). Handbook of Statistics 6: Sampling. North-Holland.

    Google Scholar 

  • Mackay, D. (1998). Introduction to Monte Carlo Methods, in Learning in Graphical Models. Kluwer Academic Publishers.

    Google Scholar 

  • Scheaffer, R., Mendenhall, W., and L. O. (1996). Elementary Survey Sampling. Duxbury Press, fifth edition.

    Google Scholar 

  • Singh, R. and Mangat, N. (1996). Elements of Survey Sampling. Kluwer Academic Publishers.

    MATH  Google Scholar 

  • Som, R. (1995). Practical Sampling Techniques. M. Dekker, second edition.

    Google Scholar 

  • Stuart, A. (1976). Basic Ideas of Scientific Sampling. London: Griffin.

    Google Scholar 

  • Thompson, S. (1992). Sampling. Wiley.

    MATH  Google Scholar 

  • Thompson, S. and Seber, G. A. F. (1996). Adaptive Sampling. Wiley.

    MATH  Google Scholar 

  • Trochim, W. (1999). Research Methods in Knowledge Base. Cornell Custom Publishing, second edition.

    Google Scholar 

  • Tryfos, P. (1996). Sampling Methods for Applied Research: Text and Cases. John Wiley & Sons, Inc.

    Google Scholar 

  • Yamane, T. (1967). Elementary Sampling Theory. Prentice Hall.

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Gu, B., Hu, F., Liu, H. (2001). Sampling: Knowing Whole from Its Part. In: Liu, H., Motoda, H. (eds) Instance Selection and Construction for Data Mining. The Springer International Series in Engineering and Computer Science, vol 608. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3359-4_2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4757-3359-4_2

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4419-4861-8

  • Online ISBN: 978-1-4757-3359-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics