Skip to main content

Link-the-Wiki: Performance Evaluation Based on Frequent Phrases

  • Conference paper
Advances in Focused Retrieval (INEX 2008)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5631))

  • 398 Accesses

Abstract

In this paper, we discuss our participation to the INEX 2008 Link-the-Wiki track. We utilized a sliding window based algorithm to extract the frequent terms and phrases. Using the extracted phrases and term as descriptive vectors, the anchors and relevant links (both incoming and outgoing) are recognized efficiently.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Trotman, A., Geva, S.: Passage Retrieval and other XML-Retrieval Tasks. In: Proceedings of SIGIR 2006 Workshop on XML Element Retrieval Methodology, Seattle, Washington, USA, pp. 48–50 (2006)

    Google Scholar 

  2. Porter, M.F.: An algorithm for suffix stripping. Automated Library and Information Systems 14, 130–137 (1980)

    Article  Google Scholar 

  3. Kostoff, R.N., Tshiteya, R., Pfeil, K.M., Humenik, J.A.: Electrochemical power text mining using bibliometrics and database tomography. Journal of Power Sources 110, 163–176 (2002)

    Article  Google Scholar 

  4. Myat, N.N., Hla, K.H.S.: A Combined Approach of Formal Concept Analysis And Text Mining For Concept Based Document Clustering. In: IEEE/WIC/ACM International Conference on Web Intelligence 2005, p. 4 (2005)

    Google Scholar 

  5. Girju, R., Badulescu, A., Moldovan, D.: Learning semantic constraints for the automatic discovery of part-whole relations. In: 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1. Association for Computational Linguistics, Edmonton (2003)

    Google Scholar 

  6. Hideo, J., Mark, S.: Retrieving descriptive phrases from large amounts of free text. In: 9th international conference on Information and knowledge management. ACM, McLean (2000)

    Google Scholar 

  7. Parisut, J., Worapoj, K.: Dimensionality reduction of features for text categorization. In: 3rd conference on IASTED International Conference: Advances in Computer Science and Technology. ACTA Press, Phuket (2007)

    Google Scholar 

  8. Beil, F., Ester, M., Xu, X.: Frequent term-based text clustering. In: 8th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, Edmonton (2002)

    Google Scholar 

  9. Yanjun, L., Soon, M.C.: Text document clustering based on frequent word sequences. In: 14th ACM international conference on Information and knowledge management, pp. 293–294. ACM, Bremen (2005)

    Google Scholar 

  10. Shen, D., Chen, Z., Yang, Q.Z.: H., Zhang, B., Lu, Y., Ma, W.: Web-page classification through summarization. In: SIGIR 2004: Proceeding of the 27th ACM Int. Conference on Research and development in information retrieval, Sheffield, pp. 242–249 (2004)

    Google Scholar 

  11. Lei, Z., Debbie, Z., Simeon, J.S., John, D.: Weighted kernel model for text categorization. In: 5th Australasian conference on Data mining and analystics, vol. 61. Australian Computer Society, Inc., Sydney (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, ML.(., Nayak, R., Geva, S. (2009). Link-the-Wiki: Performance Evaluation Based on Frequent Phrases. In: Geva, S., Kamps, J., Trotman, A. (eds) Advances in Focused Retrieval. INEX 2008. Lecture Notes in Computer Science, vol 5631. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03761-0_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03761-0_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03760-3

  • Online ISBN: 978-3-642-03761-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics