Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3321))

Included in the following conference series:

Abstract

The fast vector space and probabilistic methods use the term counts and the slower proximity methods use term positions. We present the spectral-based information retrieval method which is able to use both term count and position information to obtain high precision document rankings. We are able to perform this, in a time comparable to the vector space method, by examining the query term spectra rather than query term positions. This method is a generalisation of the vector space method (VSM). Therefore, our spectral method can use the weighting schemes and enhancements used in the VSM.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zobel, J., Moffat, A.: Exploring the similarity space. ACM SIGIR Forum 32, 18–34 (1998)

    Article  Google Scholar 

  2. Jones, K.S., Walker, S., Robertson, S.E.: A probabilistic model of information retrieval: development and comparative experiments. Information Processing and Management 36, 779–808 (2000)

    Article  Google Scholar 

  3. Clarke, C.L.A., Cormack, G.V.: Shortest-substring retrieval and ranking. ACM Transactions on Information Systems 18, 44–78 (2000)

    Article  Google Scholar 

  4. Hawking, D., Thistlewaite, P.: Relevance weighting using distance between term occurrences. Technical Report TR-CS-96-08, The Australian National University (1996)

    Google Scholar 

  5. Harman, D.: Relevance feedback revisited. In: Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 1–10. ACM Press, New York (1992)

    Chapter  Google Scholar 

  6. Jing, Y., Croft, W.B.: An association thesaurus for information retrieval. In: Proc. of Intelligent Multimedia Retrieval Systems and Management Conference (RIAO), pp. 146–160 (1994)

    Google Scholar 

  7. Park, L.A.F., Palaniswami, M., Kotagiri, R.: Internet document filtering using fourier domain scoring. In: Siebes, A., De Raedt, L. (eds.) PKDD 2001. LNCS (LNAI), vol. 2168, pp. 362–373. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  8. Park, L.A.F., Ramamohanarao, K., Palaniswami, M.: Fourier domain scoring: A novel document ranking method. IEEE Transactions on Knowledge and Data Engineering 16, 529–539 (2004)

    Article  Google Scholar 

  9. Park, L.A.F., Palaniswami, M., Ramamohanarao, K.: A novel web text mining method using the discrete cosine transform. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) PKDD 2002. LNCS (LNAI), vol. 2431, pp. 385–396. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  10. Park, L.A.F., Ramamohanarao, K., Palaniswami, M.: A new implementation technique for fast spectral based document retrieval systems. In: Kumar, V., Tsumoto, S. (eds.) 2002 IEEE International Conference on Data Mining, pp. 346–353. IEEE Computer Society, Los Alamitos (2002)

    Chapter  Google Scholar 

  11. Park, L.A.F., Palaniswami, M., Ramamohanarao, K.: A novel document ranking method using the discrete cosine transform. IEEE Transactions on Pattern Analysis and Machine Intelligence (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ramamohanarao, K., Park, L.A.F. (2004). Spectral-Based Document Retrieval. In: Maher, M.J. (eds) Advances in Computer Science - ASIAN 2004. Higher-Level Decision Making. ASIAN 2004. Lecture Notes in Computer Science, vol 3321. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30502-6_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30502-6_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-24087-7

  • Online ISBN: 978-3-540-30502-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics