Skip to main content

Evaluating the Distance between Two Uncertain Categorical Objects

  • Conference paper
Advanced Data Mining and Applications (ADMA 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6441))

Included in the following conference series:

  • 3018 Accesses

Abstract

Evaluating distances between uncertain objects is needed for some uncertain data mining techniques based on distance. An uncertain object can be described by uncertain numerical or categorical attributes. However, many uncertain data mining algorithms mainly discuss methods of evaluating distances between uncertain numerical objects. In this paper, an efficient method of evaluating distances between uncertain categorical objects is presented. The method is used in nearest-neighbor classifying. Experiments with datasets based on UCI datasets and the plant dataset of “Three Parallel Rivers of Yunnan Protected Areas” verify the method is efficient.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Singh, S., Mayfield, C., Prabhakar, S., Shah, R., Hambrusch, S.: Indexing Uncertain Categorical Data. In: 23th IEEE International Conference on Data Engineering, pp. 616–625. IEEE Computer Society, New York (2007)

    Google Scholar 

  2. Wang, L.Z.: A Method of the Abstract Generalization on the Basis of the Semantic Proximity. Chinese Journal of Computers 10, 1114–1121 (2000)

    Google Scholar 

  3. Chau, M., Cheng, R., Kao, B.: Uncertain Data Mining: A New Research Direction. In: The Workshop on the Sciences of the Artificial, Taiwan (2005)

    Google Scholar 

  4. Ngai, W.K., Kao, B., Chui, C.K., Cheng, R., Chau, M., Yip, K.Y.: Efficient Clustering of Uncertain Data. In: 6th IEEE International Conference on Data Mining, pp. 436–445. IEEE Computer Society, New York (2006)

    Google Scholar 

  5. Lee, S.D., Kao, B., Cheng, R.: Reducing UK-means to K-means. In: 7th IEEE International Conference on Data Mining Workshops, pp. 483–488. IEEE Computer Society, New York (2007)

    Google Scholar 

  6. Kao, B., Lee, S.D., Cheung, D.W., Ho, W.S., Chan, K.F.: Clustering Uncertain Data Using Voronoi Diagrams. In: 8th IEEE International Conference on Data Mining, pp. 333–342. IEEE Computer Society, New York (2008)

    Google Scholar 

  7. Aggarwal, C.C., Yu, P.S.: Outlier Detection with Uncertain Data. In: 8th SIAM International Conference on Data Mining, pp. 483–493. SIAM, Philadelphia (2008)

    Google Scholar 

  8. Beskales, G., Soliman, M.A., Llyas, I.F.: Efficient Search for the Top-k Probable Nearest Neighbors in Uncertain Databases. In: Very Large Data Base, pp. 326–339. ACM, New York (2008)

    Google Scholar 

  9. Aggarwal, C.C., Yu, P.S.: A Survey of Uncertain Data Algorithms and Applications. IEEE Transactions On Knowledge And Data Engineering 21, 609–623 (2009)

    Article  Google Scholar 

  10. Leung, C.K.-S., Carmichael, C.L., Hao, B.: Efficient Mining of Frequent Patterns from Uncertain Data. In: 7th IEEE International Conference on Data Mining, pp. 489–494. IEEE Computer Society, New York (2007)

    Google Scholar 

  11. Qin, B., Xia, Y., Prabhakar, S., Tu, Y.: A Rule-Based Classification Algorithm for Uncertain Data. In: 25th IEEE International Conference on Data Engineering, pp. 1633–1640. IEEE Computer Society, New York (2009)

    Google Scholar 

  12. Wang, L.Z., Zhou, L.H., Lu, J., Yip, J.: An Order-clique-based Approach for Mining Maximal Co-locations. Information Sciences 179, 3370–3382 (2009)

    Article  MATH  Google Scholar 

  13. UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, H., Wang, L., Liu, W., Xiao, Q. (2010). Evaluating the Distance between Two Uncertain Categorical Objects. In: Cao, L., Zhong, J., Feng, Y. (eds) Advanced Data Mining and Applications. ADMA 2010. Lecture Notes in Computer Science(), vol 6441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17313-4_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17313-4_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17312-7

  • Online ISBN: 978-3-642-17313-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics