STAIR: A System for Topical and Aggregated Information Retrieval

Krishnakumar, C. V.; Ramanathan, Krishnan

doi:10.1007/978-81-8489-203-1_28

C. V. Krishnakumar² &
Krishnan Ramanathan³

1118 Accesses

Abstract

Web content has exploded dramatically in the last decade and search is becoming increasingly com plex. In the current search paradigm, the user has to enter the query and is immediately presented results that are typically accessed sequentially. However, there are scenarios where the above model is not appropriate, either because results being in consumable form is more important than immediacy of results, or because the it is difficult and time consuming to navigate the results in sequential fashion. In this work, we describe the architecture, implementation and utility of STAIR- The System for Topical and Aggregated Information Retrieval, that uses a variant of focused crawling and retrieves just the relevant information from the web. We present a new interface that selects search results from different search engines, ranks the results and presents the most relevant results as an aggregated PDF document. User studies indicate that the relevance of the results produced by our approach is competitive with those of current search engines

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aggarwal, C. C.: Learning Strategies for Topic Specific Web Crawling. IBM T.J. Watson Research Center.
Google Scholar
Sizov, S., Biwer M., et al.: The BINGO! System for Information portal Generation and Expert Web Search. CIDR Conference. (2003)
Google Scholar
Chakrabarti, S., Martin, H., Berg, V. D., Dom B.: Distributed Hypertext Resource Discovery through Examples. VLDB Conference. (1999)
Google Scholar
Hersovici M., Jacovi M., et al.: Shark Search Algorithm. An application: Tailored Web Site Mapping, Elsevier. (1998)
Google Scholar
Pandey S., Olston C.: Crawl Ordering by Search Impact. WSDM’ 08. (2008)
Google Scholar
Raghavan P. et al.: Introduction to Information Retrieval. Cambridge University Press. (2008)
Google Scholar
Pandit S., Olston C.: Navigation aided retrieval, WWW conference, 2007. pp. 391–400. (2007)
Google Scholar
Ryen White et al.: Enhancing web search by promoting multiple search engine use, SIGIR 2008, pp. 43–50.(2008)
Google Scholar
McBruyan O. A.: Tools for Taming the Web, First International Conference on the World Wide Web. GENVL and WWWW CERN. Geneva (Switzerland). May 25–27 (1994)
Google Scholar
Brin S., Page L.: The anatomy of a large-scale hyper textual web search engine. WWW7. pp 107–117. (1998)
Google Scholar
Aktas M. S., Nacar M. A., Menczer F.: Using Hyperlink Features to Personalize Web Search. Indiana University.
Google Scholar

Download references

Author information

Authors and Affiliations

Stanford University, California, USA
C. V. Krishnakumar
HP Laboratories, India
Krishnan Ramanathan

Authors

C. V. Krishnakumar
View author publications
You can also search for this author in PubMed Google Scholar
Krishnan Ramanathan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Indian Institute of Information Technology, Allahabad, India
U. S. Tiwary (Professor), Tanveer J. Siddiqui (Assistant Professor), M. Radhakrishna (Professor) & M. D. Tiwari (Director) (Professor), (Assistant Professor), (Professor) & (Director)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Krishnakumar, C.V., Ramanathan, K. (2009). STAIR: A System for Topical and Aggregated Information Retrieval. In: Tiwary, U.S., Siddiqui, T.J., Radhakrishna, M., Tiwari, M.D. (eds) Proceedings of the First International Conference on Intelligent Human Computer Interaction. Springer, New Delhi. https://doi.org/10.1007/978-81-8489-203-1_28

Download citation

DOI: https://doi.org/10.1007/978-81-8489-203-1_28
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-8489-404-2
Online ISBN: 978-81-8489-203-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics