STAIR: A System for Topical and Aggregated Information Retrieval

  • C. V. Krishnakumar
  • Krishnan Ramanathan
Conference paper


Web content has exploded dramatically in the last decade and search is becoming increasingly com plex. In the current search paradigm, the user has to enter the query and is immediately presented results that are typically accessed sequentially. However, there are scenarios where the above model is not appropriate, either because results being in consumable form is more important than immediacy of results, or because the it is difficult and time consuming to navigate the results in sequential fashion. In this work, we describe the architecture, implementation and utility of STAIR- The System for Topical and Aggregated Information Retrieval, that uses a variant of focused crawling and retrieves just the relevant information from the web. We present a new interface that selects search results from different search engines, ranks the results and presents the most relevant results as an aggregated PDF document. User studies indicate that the relevance of the results produced by our approach is competitive with those of current search engines


Search Engine User Profile Collaborative Filter Relevant Page Focus Crawling 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Aggarwal, C. C.: Learning Strategies for Topic Specific Web Crawling. IBM T.J. Watson Research Center.Google Scholar
  2. 2.
    Sizov, S., Biwer M., et al.: The BINGO! System for Information portal Generation and Expert Web Search. CIDR Conference. (2003)Google Scholar
  3. 3.
    Chakrabarti, S., Martin, H., Berg, V. D., Dom B.: Distributed Hypertext Resource Discovery through Examples. VLDB Conference. (1999)Google Scholar
  4. 4.
    Hersovici M., Jacovi M., et al.: Shark Search Algorithm. An application: Tailored Web Site Mapping, Elsevier. (1998)Google Scholar
  5. 5.
    Pandey S., Olston C.: Crawl Ordering by Search Impact. WSDM’ 08. (2008)Google Scholar
  6. 6.
    Raghavan P. et al.: Introduction to Information Retrieval. Cambridge University Press. (2008)Google Scholar
  7. 7.
    Pandit S., Olston C.: Navigation aided retrieval, WWW conference, 2007. pp. 391–400. (2007)Google Scholar
  8. 8.
    Ryen White et al.: Enhancing web search by promoting multiple search engine use, SIGIR 2008, pp. 43–50.(2008)Google Scholar
  9. 9.
    McBruyan O. A.: Tools for Taming the Web, First International Conference on the World Wide Web. GENVL and WWWW CERN. Geneva (Switzerland). May 25–27 (1994)Google Scholar
  10. 10.
    Brin S., Page L.: The anatomy of a large-scale hyper textual web search engine. WWW7. pp 107–117. (1998)Google Scholar
  11. 11.
    Aktas M. S., Nacar M. A., Menczer F.: Using Hyperlink Features to Personalize Web Search. Indiana University.Google Scholar

Copyright information

© Indian Institute of Information Technology, India 2009

Authors and Affiliations

  • C. V. Krishnakumar
    • 1
  • Krishnan Ramanathan
    • 2
  1. 1.Stanford UniversityCaliforniaUSA
  2. 2.HP LaboratoriesIndia

Personalised recommendations