Skip to main content

A Discretization Algorithm of Numerical Attributes for Digital Library Evaluation Based on Data Mining Technology

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7008))

Abstract

We present here a discretization algorithm for numerical attributes of digital collections. In our research data mining technology is imported into digital library evaluation to provide a better decision-making support. But data prediction algorithms work not well based on the traditional discretization method during the data mining process. The reason is that numerical attributes of digital collections are complicated and not in the same scale of distribution distance. We study the characteristic of numerical attributes and put forward a discretization method based on the Z-score idea of mathematical statistics. This algorithm can reflect the dynamic semantic distance for different numerical attributes and significantly enhance the precision rate and recall rate of data prediction algorithms. Furthermore a ‘nonlinear conditional relationship’ among attributes of digital collections is discovered during the study of discretization algorithm and impacts the actual application result of traditional data mining algorithms.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hsieh-Yee, I.: Digital Library Evaluation: Progress & Next Steps. In: 2005 Annual Meeting of the American Society for Information Science and Technology Charlotte, N.C. (2005)

    Google Scholar 

  2. Snead, J.T., Bertot, J.C., Jaeger, P.T., McClure, C.R.: Developing multi-method, iterative, and user-centered evaluation strategies for digital libraries: Functionality, usability, and accessibility. Proceedings of the American Society for Information Science and Technology 42 (2005)

    Google Scholar 

  3. Kyrillidou, M., Giersch, S.: Developing the DigiQUAL protocol for digital library evaluation. In: Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries, Denver, CO, USA (2005)

    Google Scholar 

  4. Zhao, Y., Niu, Z., Dai, L.: Evaluation algorithm about digital library collections based on data mining technology. In: Proceedings of Role of Digital Libraries in a Time of Global Change - 12th International Conference on Asia-Pacific Digital Libraries (2010)

    Google Scholar 

  5. Han, J., Kamber, M.: Data mining: concepts and techniques, 3rd edn. Morgan Kaufmann Pub., San Francisco (2011)

    MATH  Google Scholar 

  6. Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann Pub., San Francisco (2005)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhao, Y., Niu, Z., Peng, X., Dai, L. (2011). A Discretization Algorithm of Numerical Attributes for Digital Library Evaluation Based on Data Mining Technology. In: Xing, C., Crestani, F., Rauber, A. (eds) Digital Libraries: For Cultural Heritage, Knowledge Dissemination, and Future Creation. ICADL 2011. Lecture Notes in Computer Science, vol 7008. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24826-9_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-24826-9_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-24825-2

  • Online ISBN: 978-3-642-24826-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics