Skip to main content

A Method for Integrating Interfaces Based on Cluster Ensemble in Digital Library Federation

  • Conference paper
  • First Online:
Frontier and Future Development of Information Technology in Medicine and Education

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 269))

  • 117 Accesses

Abstract

Recently, there are more demands in a digital library federation to integrate multiple query interfaces into one for users. Since different interfaces have various descriptions for the same concept and the amount of interfaces are numerous, it is hard to provide complete and exact domain knowledge. Hence, the methods of clustering are usually adopted to generate an integrated interface. However, over one same properties set, the results for clustering may be diverse according to the differences of clustering algorithms or parameters setting for the same algorithm. Nevertheless, we could obtain one more complete and exact integrated interface with the aid of cluster ensemble by merging multiple clustering results. In this paper, based on the principle of cluster ensemble, we propose a single clustering algorithm with uncertainty regarding that one property may belong to more than one possible cluster division during integration. We also propose a fusing cluster algorithm to obtain cluster ensemble that satisfying interface integration and it shows favorable performances than the existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 429.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 549.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 549.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Fox EA (1993) Source book on digital libraries. Technical Report TR-93-35 Virginia Polytechnic Institute and State University

    Google Scholar 

  2. Harter SP (1996) What is a digital library? definitions, content, and issues. In: KOLISS DL 1996

    Google Scholar 

  3. Birmingham B et al.(2001) EU-NSF digital library working group on interoperability between digital libraries. http://www.iei.pi.cnr.it/DELOS/NSF/interop.htm Accessed 15 Dec 2001

  4. He H, Meng W, Yu C, Wu Z (2003) WISE-integrator: an automatic integrator of web search interfaces for e-commerce. In: Proceedings of the 29th international conference on very large data bases (VLDB), Berlin, 2003 pp 357–368

    Google Scholar 

  5. He B, Chang KC-C (2003) Statistical schema matching across web query interfaces. In: Proceedings of the 2003 ACM SIGMOD international conference on management of data, San Diego, California, 2003 pp 217–228

    Google Scholar 

  6. Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv 31(3):264–323

    Article  Google Scholar 

  7. Wu W, Yu C, Doan A, Meng W (2004) An interactive clustering-based approach to integrating source query interfaces on the deep web. In: Proceedings of the 23th ACM SIGMOD international conference on management of data, Paris, 2004 pp 95–106

    Google Scholar 

  8. He B, Chang KC-C, Han J (2004) Discovering complex matchings across web query interfaces: a correlation mining approach. In: Proceedings of the 10th ACM SIGKDD international conference on knowledge discovery and data mining, Seattle, 2004 pp 148–157

    Google Scholar 

  9. He B, Chang KC-C, Han J (2004) Mining complex matchings across web query interfaces. In: Proceedings of the 9th ACM SIGMOD workshop on research issues in data mining and knowledge discovery, Paris, 2004 pp 3–10

    Google Scholar 

  10. WordNet http://wordnet.princeton.edu/

  11. Wang J, Wen JR, Lochovsky F, Ma WY (2004) Instance-based schema matching for web databases by domain-specific query probing. In: Proceedings of the thirtieth international conference on very large data bases, Toronto, Canada, 2004, pp 408–419

    Google Scholar 

  12. Wu W, Doan AH, Yu C (2006) WebIQ: learning from the web to match deep-web query interfaces. In: Proceedings of the 22nd international conference on data engineering, Washington, DC, USA, 2006, pp 44

    Google Scholar 

  13. Topchy AP, Jain AK, Punch WF (2004) A mixture model for clustering ensembles. In: Proceedings of the fourth SIAM international conference on data mining, Lake Buena Vista, Florida, USA pp 379–390

    Google Scholar 

  14. Chang KC-C, He B ,Li C, Zhang Z (2003) The UIUC web integration repository. Computer Science Department, University of Illinois at Urbana-Champaign. http://metaquerier.cs.uiuc.edu/repository, 2003

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qingzhong Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media Dordrecht

About this paper

Cite this paper

Pan, P., Li, Q., Fang, X. (2014). A Method for Integrating Interfaces Based on Cluster Ensemble in Digital Library Federation. In: Li, S., Jin, Q., Jiang, X., Park, J. (eds) Frontier and Future Development of Information Technology in Medicine and Education. Lecture Notes in Electrical Engineering, vol 269. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-7618-0_86

Download citation

  • DOI: https://doi.org/10.1007/978-94-007-7618-0_86

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-007-7617-3

  • Online ISBN: 978-94-007-7618-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics