Abstract
Web content has exploded dramatically in the last decade and search is becoming increasingly com plex. In the current search paradigm, the user has to enter the query and is immediately presented results that are typically accessed sequentially. However, there are scenarios where the above model is not appropriate, either because results being in consumable form is more important than immediacy of results, or because the it is difficult and time consuming to navigate the results in sequential fashion. In this work, we describe the architecture, implementation and utility of STAIR- The System for Topical and Aggregated Information Retrieval, that uses a variant of focused crawling and retrieves just the relevant information from the web. We present a new interface that selects search results from different search engines, ranks the results and presents the most relevant results as an aggregated PDF document. User studies indicate that the relevance of the results produced by our approach is competitive with those of current search engines
Preview
Unable to display preview. Download preview PDF.
References
Aggarwal, C. C.: Learning Strategies for Topic Specific Web Crawling. IBM T.J. Watson Research Center.
Sizov, S., Biwer M., et al.: The BINGO! System for Information portal Generation and Expert Web Search. CIDR Conference. (2003)
Chakrabarti, S., Martin, H., Berg, V. D., Dom B.: Distributed Hypertext Resource Discovery through Examples. VLDB Conference. (1999)
Hersovici M., Jacovi M., et al.: Shark Search Algorithm. An application: Tailored Web Site Mapping, Elsevier. (1998)
Pandey S., Olston C.: Crawl Ordering by Search Impact. WSDM’ 08. (2008)
Raghavan P. et al.: Introduction to Information Retrieval. Cambridge University Press. (2008)
Pandit S., Olston C.: Navigation aided retrieval, WWW conference, 2007. pp. 391–400. (2007)
Ryen White et al.: Enhancing web search by promoting multiple search engine use, SIGIR 2008, pp. 43–50.(2008)
McBruyan O. A.: Tools for Taming the Web, First International Conference on the World Wide Web. GENVL and WWWW CERN. Geneva (Switzerland). May 25–27 (1994)
Brin S., Page L.: The anatomy of a large-scale hyper textual web search engine. WWW7. pp 107–117. (1998)
Aktas M. S., Nacar M. A., Menczer F.: Using Hyperlink Features to Personalize Web Search. Indiana University.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Indian Institute of Information Technology, India
About this paper
Cite this paper
Krishnakumar, C.V., Ramanathan, K. (2009). STAIR: A System for Topical and Aggregated Information Retrieval. In: Tiwary, U.S., Siddiqui, T.J., Radhakrishna, M., Tiwari, M.D. (eds) Proceedings of the First International Conference on Intelligent Human Computer Interaction. Springer, New Delhi. https://doi.org/10.1007/978-81-8489-203-1_28
Download citation
DOI: https://doi.org/10.1007/978-81-8489-203-1_28
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-8489-404-2
Online ISBN: 978-81-8489-203-1
eBook Packages: Computer ScienceComputer Science (R0)