Abstract
In this paper, we discuss our participation to the INEX 2008 Link-the-Wiki track. We utilized a sliding window based algorithm to extract the frequent terms and phrases. Using the extracted phrases and term as descriptive vectors, the anchors and relevant links (both incoming and outgoing) are recognized efficiently.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Trotman, A., Geva, S.: Passage Retrieval and other XML-Retrieval Tasks. In: Proceedings of SIGIR 2006 Workshop on XML Element Retrieval Methodology, Seattle, Washington, USA, pp. 48–50 (2006)
Porter, M.F.: An algorithm for suffix stripping. Automated Library and Information Systems 14, 130–137 (1980)
Kostoff, R.N., Tshiteya, R., Pfeil, K.M., Humenik, J.A.: Electrochemical power text mining using bibliometrics and database tomography. Journal of Power Sources 110, 163–176 (2002)
Myat, N.N., Hla, K.H.S.: A Combined Approach of Formal Concept Analysis And Text Mining For Concept Based Document Clustering. In: IEEE/WIC/ACM International Conference on Web Intelligence 2005, p. 4 (2005)
Girju, R., Badulescu, A., Moldovan, D.: Learning semantic constraints for the automatic discovery of part-whole relations. In: 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1. Association for Computational Linguistics, Edmonton (2003)
Hideo, J., Mark, S.: Retrieving descriptive phrases from large amounts of free text. In: 9th international conference on Information and knowledge management. ACM, McLean (2000)
Parisut, J., Worapoj, K.: Dimensionality reduction of features for text categorization. In: 3rd conference on IASTED International Conference: Advances in Computer Science and Technology. ACTA Press, Phuket (2007)
Beil, F., Ester, M., Xu, X.: Frequent term-based text clustering. In: 8th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, Edmonton (2002)
Yanjun, L., Soon, M.C.: Text document clustering based on frequent word sequences. In: 14th ACM international conference on Information and knowledge management, pp. 293–294. ACM, Bremen (2005)
Shen, D., Chen, Z., Yang, Q.Z.: H., Zhang, B., Lu, Y., Ma, W.: Web-page classification through summarization. In: SIGIR 2004: Proceeding of the 27th ACM Int. Conference on Research and development in information retrieval, Sheffield, pp. 242–249 (2004)
Lei, Z., Debbie, Z., Simeon, J.S., John, D.: Weighted kernel model for text categorization. In: 5th Australasian conference on Data mining and analystics, vol. 61. Australian Computer Society, Inc., Sydney (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, ML.(., Nayak, R., Geva, S. (2009). Link-the-Wiki: Performance Evaluation Based on Frequent Phrases. In: Geva, S., Kamps, J., Trotman, A. (eds) Advances in Focused Retrieval. INEX 2008. Lecture Notes in Computer Science, vol 5631. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03761-0_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-03761-0_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03760-3
Online ISBN: 978-3-642-03761-0
eBook Packages: Computer ScienceComputer Science (R0)