Skip to main content

Augmenting Suffix Trees, with Applications

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1461))

Abstract

Information retrieval and data compression are the two main application areas where the rich theory of string algorithmics plays a fundamental role. In this paper, we consider one algorithmic problem from each of these areas and present highly efficient (linear or near linear time) algorithms for both problems. Our algorithms rely on augmenting the suffix tree, a fundamental data structure in string algorithmics. The augmentations are nontrivial and they form the technical crux of this paper. In particular, they consist of adding extra edges to suffix trees, resulting in Directed Acyclic Graphs (DAGs). Our algorithms construct these “suffix DAGs” and manipulate them to solve the two problems efficiently.

Partly supported by Alon Fellowship.

Partly supported by ESPRIT LTR Project no. 20244 - ALCOM IT.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. T. Bell, T. Cleary, and I. Witten. Text Compression. Academic Press, 1990.

    Google Scholar 

  2. G. S. Brodal. Finger search trees with constant insertion time. In ACM-SIAM Symposium on Discrete Algorithms, 1998.

    Google Scholar 

  3. M. Burrows and D. J. Wheeler. A block sorting lossless data compression algorithm. Technical Report 124, DEC SRC, 1994.

    Google Scholar 

  4. M. Crochemore and W. Rytter. Text Algorithms. Oxford Press, 1994.

    Google Scholar 

  5. J. G. Cleary and I. H. Witten. Data compression using adaptive coding and partial string matching. IEEE Transactions on Communications, 32(4):396–402, 1984.

    Article  Google Scholar 

  6. D. M. Gusfield. Algorithms on Strings, Trees, and Sequences. Addison Wesley, 1998.

    Google Scholar 

  7. J. Hui. Color set size problem with applications to string matching. In Combinatorial Pattern Matching, 1992.

    Google Scholar 

  8. Y. Hershkovits and J. Ziv. On sliding window universal data compression with limited memory. In Information Theory symposium, pages 17–22, September 1995.

    Google Scholar 

  9. Y. Hershkovits and J. Ziv. On sliding window universal data compression with limited memory. IEEE Trans. on Information Theory, 44:66–78, January 1998.

    Article  MATH  MathSciNet  Google Scholar 

  10. E. M. McCreight. A space economical suffix tree construction algorithm. Journal of the ACM, 23(2):262–272, April 1976.

    Article  MATH  MathSciNet  Google Scholar 

  11. M. Rodeh, V. Pratt, and S. Even. Linear algorithm for data compression via string matching. Journal of the ACM, 28(1):16–24, January 1981.

    Article  MATH  MathSciNet  Google Scholar 

  12. B. Schieber and U. Vishkin. On finding lowest common ancestors:simplification and parallelization. SIAM Journal of Computing, 17:1253–1262, 1988.

    Article  MATH  MathSciNet  Google Scholar 

  13. T.A. Welch. A technique for high-performance data compression. IEEE Computer, pages 8–19, January 1984.

    Google Scholar 

  14. M. J. Weinberger, J. J. Rissanen, and M. Feder. A universal finite memory source. IEEE Transactions on Information Theory, 41(3):643–652, 1995.

    Article  MATH  Google Scholar 

  15. H. Yokoo. An adaptive data compression method based on context sorting. In IEEE Data Compression Conference, 1996.

    Google Scholar 

  16. J. Ziv and A. Lempel. A universal algorithm for sequential data compression. IEEE Transactions on Information Theory, IT-23(3):337–343, May 1977.

    Article  MathSciNet  Google Scholar 

  17. J. Ziv and A. Lempel. Compression of individual sequences via variable-rate coding. IEEE Transactions on Information Theory, IT-24(5):530–536, September 1978.

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Matias, Y., Muthukrishnan, S., Sahinalp, S.C., Ziv, J. (1998). Augmenting Suffix Trees, with Applications. In: Bilardi, G., Italiano, G.F., Pietracaprina, A., Pucci, G. (eds) Algorithms — ESA’ 98. ESA 1998. Lecture Notes in Computer Science, vol 1461. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-68530-8_6

Download citation

  • DOI: https://doi.org/10.1007/3-540-68530-8_6

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-64848-2

  • Online ISBN: 978-3-540-68530-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics