Link-the-Wiki: Performance Evaluation Based on Frequent Phrases

Chen, Mao-Lung (Edward); Nayak, Richi; Geva, Shlomo

doi:10.1007/978-3-642-03761-0_33

Mao-Lung (Edward) Chen¹⁹,
Richi Nayak¹⁹ &
Shlomo Geva¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5631))

Included in the following conference series:

International Workshop of the Initiative for the Evaluation of XML Retrieval

398 Accesses

Abstract

In this paper, we discuss our participation to the INEX 2008 Link-the-Wiki track. We utilized a sliding window based algorithm to extract the frequent terms and phrases. Using the extracted phrases and term as descriptive vectors, the anchors and relevant links (both incoming and outgoing) are recognized efficiently.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Trotman, A., Geva, S.: Passage Retrieval and other XML-Retrieval Tasks. In: Proceedings of SIGIR 2006 Workshop on XML Element Retrieval Methodology, Seattle, Washington, USA, pp. 48–50 (2006)
Google Scholar
Porter, M.F.: An algorithm for suffix stripping. Automated Library and Information Systems 14, 130–137 (1980)
Article Google Scholar
Kostoff, R.N., Tshiteya, R., Pfeil, K.M., Humenik, J.A.: Electrochemical power text mining using bibliometrics and database tomography. Journal of Power Sources 110, 163–176 (2002)
Article Google Scholar
Myat, N.N., Hla, K.H.S.: A Combined Approach of Formal Concept Analysis And Text Mining For Concept Based Document Clustering. In: IEEE/WIC/ACM International Conference on Web Intelligence 2005, p. 4 (2005)
Google Scholar
Girju, R., Badulescu, A., Moldovan, D.: Learning semantic constraints for the automatic discovery of part-whole relations. In: 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1. Association for Computational Linguistics, Edmonton (2003)
Google Scholar
Hideo, J., Mark, S.: Retrieving descriptive phrases from large amounts of free text. In: 9th international conference on Information and knowledge management. ACM, McLean (2000)
Google Scholar
Parisut, J., Worapoj, K.: Dimensionality reduction of features for text categorization. In: 3rd conference on IASTED International Conference: Advances in Computer Science and Technology. ACTA Press, Phuket (2007)
Google Scholar
Beil, F., Ester, M., Xu, X.: Frequent term-based text clustering. In: 8th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, Edmonton (2002)
Google Scholar
Yanjun, L., Soon, M.C.: Text document clustering based on frequent word sequences. In: 14th ACM international conference on Information and knowledge management, pp. 293–294. ACM, Bremen (2005)
Google Scholar
Shen, D., Chen, Z., Yang, Q.Z.: H., Zhang, B., Lu, Y., Ma, W.: Web-page classification through summarization. In: SIGIR 2004: Proceeding of the 27th ACM Int. Conference on Research and development in information retrieval, Sheffield, pp. 242–249 (2004)
Google Scholar
Lei, Z., Debbie, Z., Simeon, J.S., John, D.: Weighted kernel model for text categorization. In: 5th Australasian conference on Data mining and analystics, vol. 61. Australian Computer Society, Inc., Sydney (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Science and Technology, Queensland University of Technology, Australia
Mao-Lung (Edward) Chen, Richi Nayak & Shlomo Geva

Authors

Mao-Lung (Edward) Chen
View author publications
You can also search for this author in PubMed Google Scholar
Richi Nayak
View author publications
You can also search for this author in PubMed Google Scholar
Shlomo Geva
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Science and Technology, Queensland University of Technology, GPO Box 2434, 4001, Brisband, Qld, Australia
Shlomo Geva
Archives and Information Studies/Humanities, University of Amsterdam, Turfdraagsterpad 9, 1012 XT, Amsterdam, The Netherlands
Jaap Kamps
Department of Computer Science, University of Otago, P.O. Box 56, 9054, Dunedin, New Zealand
Andrew Trotman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, ML.(., Nayak, R., Geva, S. (2009). Link-the-Wiki: Performance Evaluation Based on Frequent Phrases. In: Geva, S., Kamps, J., Trotman, A. (eds) Advances in Focused Retrieval. INEX 2008. Lecture Notes in Computer Science, vol 5631. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03761-0_33

Download citation

DOI: https://doi.org/10.1007/978-3-642-03761-0_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03760-3
Online ISBN: 978-3-642-03761-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics