Skip to main content

Word Sieve: A Method for Real-Time Context Extraction

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2116))

Abstract

In order to be useful, intelligent information retrieval agents must provide their users with context-relevant information. This paper presents WordSieve, an algorithm for automatically extracting information about the context in which documents are consulted during web browsing. Using information extracted from the stream of documents consulted by the user, WordSieve automatically builds context profiles which differentiate sets of documents that users tend to access in groups. These profiles are used in a research-aiding system to index documents consulted in the current context and pro-actively suggest them to users in similar future contexts. In initial experiments on the capability to match documents to the task contexts in which they were consulted, WordSieve indexing outperformed indexing based on Term Frequency/Inverse Document Frequency, a common document indexing approach for intelligent agents in information retrieval.

Travis Bauer’s research is supported in part by the Department of Education under award P200A80301-98. David Leake’s research is supported in part by NASA under awards NCC 2-1035 and NCC 2-1216.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Gediminas Adomavicius and Alexander Tuzhilin. Using data mining methods to build customer profiles. Computer, 34(2):74–82, February 2001.

    Google Scholar 

  2. V. Akman and M. Surav. Steps toward formalizing context. AI Magazine, 17(3):55–72, 1996.

    Google Scholar 

  3. Marko Balabanović. An interface for learning multi-topic user profiles from implicit feedback. In AAAI-98 Workshop on Recommender Systems, 1998.

    Google Scholar 

  4. M. Benerecetti, P. Bouquet, and C. Ghidini. Contextual reasoning distilled. Philosophical Foundations of Artificial Intelligence, 12(3), July 2000.

    Google Scholar 

  5. Bruno Bouzy and Tristan Cazenave. Using the object oriented paradigm to model context in computer go. In Proceedings of the First International and Interdisciplinary Conference on Modeling and Using Context, 1997.

    Google Scholar 

  6. J. Budzik and J. K. Hammond. Watson: Anticipating and contextualizing information needs. In Proceedings of the Sixty-second Annual Meeting of the American Society for Information Science, Medford, NJ, 1999. Information Today, Inc.

    Google Scholar 

  7. J. Budzik, K. Hammond, and L. Birnbaum. Information access in context. In Knowledge based systems, 2001.

    Google Scholar 

  8. J. Budzik, K. Hammond, L. Birnbaum, and M. Krema. Beyond similarity. In Working Notes of the AAAI-2000 Workshop on AI for Web Search. AAAI Press, Menlo Park, 2000.

    Google Scholar 

  9. P. Cotter and B. Smyth. PTV: Intelligent personalised tv guides. In Proceedings of the 12th Innovative Applications of Artificial Intelligence (IAAI-2000) Conference. AAAI Press, 2000.

    Google Scholar 

  10. Marti A. Hearst. Context and Structure in Automated Full-Text Information Access. PhD thesis, University of California at Berkeley, 1994.

    Google Scholar 

  11. Graeme Hirst. Context as a spurious concept. In Proceedings, Conference on Intelligent Processing and Computational Linguistics, pages 273–287, Mexico City, February 2000.

    Google Scholar 

  12. Eric Horvitz, Jack Breese, David Heckerman, David Hovel, and Koos Rommelse. Inferring the goals and needs of software users. In Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pages 256–265, 1998.

    Google Scholar 

  13. David Leake, Travis Bauer, Anna Maguitman, and David Wilson. Capture, storage and reuse of lessons about information resources: Supporting task-based information search. In Proceedings of the AAAI-2000 Workshop on Intelligent Lessons Learned Systems, Menlo Park, CA, 2000. AAAI Press.

    Google Scholar 

  14. Henry Lieberman, Neil Van Dyke, and Adriana Vivacqua. Let’s browse: A collaborative web browsing agent. In Proceedings of the 1999 international conference on Intelligent user interfaces, pages 65–68, 1999.

    Google Scholar 

  15. Carlo Penco. Objective and cognitive context. In Paolo Bouquet, Luigi Serafini, Patrick Brézillon, Massimo Benerecetti, and Francesca Castellani, editors, Modeling and Using Contexts: Proceedings of the Second International and Interdisciplinary Conference, pages 270–283. Springer-Verlag, 1999.

    Google Scholar 

  16. Bradley J. Rhodes. Margin notes: Building a contextually aware associative memory. In Proceedings of the 2000 international conference on Intelligent user interfaces, pages 219–224, Jan 2000.

    Google Scholar 

  17. Gerard Salton. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley Series in Computer Science. Addison-Wesley Publishing Company, Inc., 1989.

    Google Scholar 

  18. Roy Turner. Context-sensitive reasoning for autonomous agents and cooperative distributed problem solving. In Proceedings of the IJCAI Workshop on Using Knowledge in its Context, Chambery, France, 1993.

    Google Scholar 

  19. Roy Turner. Context-mediated behavior. In J. Mira, A.P. del Pobil, and M. Ali, editors, Lecture Notes in Artificial Intelligence 1415: Methodology and Tools in Know ledge-Based Systems, pages 538–547, Benicassim, Spain, June 1998. Springer, New York.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bauer, T., Leake, D.B. (2001). Word Sieve: A Method for Real-Time Context Extraction. In: Akman, V., Bouquet, P., Thomason, R., Young, R. (eds) Modeling and Using Context. CONTEXT 2001. Lecture Notes in Computer Science(), vol 2116. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44607-9_3

Download citation

  • DOI: https://doi.org/10.1007/3-540-44607-9_3

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-42379-9

  • Online ISBN: 978-3-540-44607-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics