Skip to main content

Coding Trees as Strings for Approximate Tree Matching

  • Conference paper
Sequences II

Abstract

In this paper we consider matching problems on arbitrary ordered labelled trees and ranked trees, which have important applications in many fields such as molecular biology, term rewriting systems and language processing. Given a text tree T and a pattern tree P, we derive an algorithm to find all occurrences of P in T with bounded distance k, in time O(kǀTǀ + ǀPǀ). The distance refers to the number of subtrees to be inserted or deleted from T to obtain P. This problem is an extension of the tree pattern matching problem where deletions of subtrees occur only in T, and of the approximate string matching problem applied to trees. Extensions of the algorithm to solve other relevant problems, such as ranked trees matching, as well as their parallel versions are then devised.

This work has been partially supported by MURST of Italy

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. A.V. Aho, M. Ganapathi and S.W.K. Tjiang, Code generation using tree matching and dynamic programming, ACM Trans. on Prog. Lang. Sys. 11 (1989) 491–516.

    Article  Google Scholar 

  2. J. Cai, R. Paige and R. Tarjan, More efficient bottom-up tree pattern matching, Proc. CAAP’90, in: LNCS 431 (1990) 72–86.

    Google Scholar 

  3. M. Dubiner, Z. Galil and E. Magen, Faster tree pattern matching, Proc. 31-st IEEE Symp. on Found. of Comp. Sc., (1990) 145–150.

    Google Scholar 

  4. Z. Galil and R. Giancarlo, Data structures and algorithms for approximate string matching, J. Complexity 4 (1988) 33–72.

    Article  MathSciNet  MATH  Google Scholar 

  5. R. Grossi, A note on the subtree isomorphism for ordered trees and related problems, Inf. Proc. Let. (to appear).

    Google Scholar 

  6. D. Harel and R.E. Tarjan, Fast algorithms for finding nearest common ancestors, SIAM J. Comp. 13 (1984) 338–355.

    Article  MathSciNet  MATH  Google Scholar 

  7. C.M. Hoffmann and M.J. O’Donnel, Pattern matching in trees, J. ACM 29 (1982) 68–95.

    Article  MATH  Google Scholar 

  8. P. Kilpelänen and H. Mannila, The tree inclusion problem, Proc. CAAP’91, in: LNCS 493, 1 (1991) 202–214.

    Google Scholar 

  9. S.R. Kosaraju, Efficient tree pattern matching, Proc. 30-th IEEE Symp. on found. of Comp. Sc, (1989) 178–183.

    Google Scholar 

  10. G.M. Landau and U. Vishkin, Fast parallel and serial approximate string matching, J. of Algorithms 10 (1989) 157–169.

    Article  MathSciNet  MATH  Google Scholar 

  11. F. Luccio and L. Pagli, Simple solutions for approximate tree matching problems, Proc. CAAP’91, in: LNCS 493, 1 (1991) 193–201.

    MathSciNet  Google Scholar 

  12. F. Luccio and L. Pagli, An efficient algorithm for some tree matching problems, Inf. Proc. Let. (to appear).

    Google Scholar 

  13. E. Mäkinen, On the subtree isomorphism problem for ordered trees, Inf. Proc. Let. 32 (1989) 271–273.

    Article  MATH  Google Scholar 

  14. R. Ramesh and I.V. Ramakrishnan, Nonlinear pattern matching, Proc. ICALP’88, in: LNCS 317 (1988) 473–488.

    Google Scholar 

  15. D. Sankoff and J.B. Kruskal, Eds., Time warps, string edits and macro-molecules: the theory and practice of sequence comparison, Addison-Wesley, Reading, MA 1983.

    Google Scholar 

  16. B. Schieber and U. Vishkin, On finding lowest common ancestor: simplification and parallelization, SIAM J. Comp. 17 (1988) 1253–1262.

    Article  MathSciNet  MATH  Google Scholar 

  17. D. Shasha and K. Zhang, Fast algorithms for the unit cost editing distance between trees, J. of Algorithms 11 (1990) 581–621.

    Article  MathSciNet  MATH  Google Scholar 

  18. J.M. Steyaert and P. Flajolet, Patterns and pattern-matching in trees: an analysis, Information and Control 58 (1983) 19–58.

    Article  MathSciNet  MATH  Google Scholar 

  19. L. Stryer, Biochemestry, 3-rd edition, W.H.Freeman and Co., New York, NY 1988.

    Google Scholar 

  20. K.-C. Tai, The tree-to-tree correction problem, J. ACM 26 (1979) 422–433.

    Article  MathSciNet  MATH  Google Scholar 

  21. K. Zhang and D. Shasha, Simple fast algorithms for the editing distance between trees and related problems, SIAM J. Comp. 18 (1989) 1245–1262.

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1993 Springer-Verlag New York, Inc.

About this paper

Cite this paper

Grossi, R., Luccio, F., Pagli, L. (1993). Coding Trees as Strings for Approximate Tree Matching. In: Capocelli, R., De Santis, A., Vaccaro, U. (eds) Sequences II. Springer, New York, NY. https://doi.org/10.1007/978-1-4613-9323-8_18

Download citation

  • DOI: https://doi.org/10.1007/978-1-4613-9323-8_18

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4613-9325-2

  • Online ISBN: 978-1-4613-9323-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics