Skip to main content

Abstract

Web content has exploded dramatically in the last decade and search is becoming increasingly com plex. In the current search paradigm, the user has to enter the query and is immediately presented results that are typically accessed sequentially. However, there are scenarios where the above model is not appropriate, either because results being in consumable form is more important than immediacy of results, or because the it is difficult and time consuming to navigate the results in sequential fashion. In this work, we describe the architecture, implementation and utility of STAIR- The System for Topical and Aggregated Information Retrieval, that uses a variant of focused crawling and retrieves just the relevant information from the web. We present a new interface that selects search results from different search engines, ranks the results and presents the most relevant results as an aggregated PDF document. User studies indicate that the relevance of the results produced by our approach is competitive with those of current search engines

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aggarwal, C. C.: Learning Strategies for Topic Specific Web Crawling. IBM T.J. Watson Research Center.

    Google Scholar 

  2. Sizov, S., Biwer M., et al.: The BINGO! System for Information portal Generation and Expert Web Search. CIDR Conference. (2003)

    Google Scholar 

  3. Chakrabarti, S., Martin, H., Berg, V. D., Dom B.: Distributed Hypertext Resource Discovery through Examples. VLDB Conference. (1999)

    Google Scholar 

  4. Hersovici M., Jacovi M., et al.: Shark Search Algorithm. An application: Tailored Web Site Mapping, Elsevier. (1998)

    Google Scholar 

  5. Pandey S., Olston C.: Crawl Ordering by Search Impact. WSDM’ 08. (2008)

    Google Scholar 

  6. Raghavan P. et al.: Introduction to Information Retrieval. Cambridge University Press. (2008)

    Google Scholar 

  7. Pandit S., Olston C.: Navigation aided retrieval, WWW conference, 2007. pp. 391–400. (2007)

    Google Scholar 

  8. Ryen White et al.: Enhancing web search by promoting multiple search engine use, SIGIR 2008, pp. 43–50.(2008)

    Google Scholar 

  9. McBruyan O. A.: Tools for Taming the Web, First International Conference on the World Wide Web. GENVL and WWWW CERN. Geneva (Switzerland). May 25–27 (1994)

    Google Scholar 

  10. Brin S., Page L.: The anatomy of a large-scale hyper textual web search engine. WWW7. pp 107–117. (1998)

    Google Scholar 

  11. Aktas M. S., Nacar M. A., Menczer F.: Using Hyperlink Features to Personalize Web Search. Indiana University.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Indian Institute of Information Technology, India

About this paper

Cite this paper

Krishnakumar, C.V., Ramanathan, K. (2009). STAIR: A System for Topical and Aggregated Information Retrieval. In: Tiwary, U.S., Siddiqui, T.J., Radhakrishna, M., Tiwari, M.D. (eds) Proceedings of the First International Conference on Intelligent Human Computer Interaction. Springer, New Delhi. https://doi.org/10.1007/978-81-8489-203-1_28

Download citation

  • DOI: https://doi.org/10.1007/978-81-8489-203-1_28

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-8489-404-2

  • Online ISBN: 978-81-8489-203-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics