Abstract
The problem of approximate pattern matching on hypertext is defined and solved by Amir et al. in O(m(n logm+e)) time, where m is the length of the pattern, n is the total text size and e is the total number of edges. Their space complexity is O(mn). We present a new algorithm which is O(mk(n+e)) time and needs only O(n) extra space, where k < m is the number of allowed errors in the pattern. If the graph is acyclic, our time complexity drops to O(m(n+e)), improving Amir's results.
This work has been supported in part by Fondecyt grants 1-950622 and 1-960881.
This is a preview of subscription content, log in via an institution.
Preview
Unable to display preview. Download preview PDF.
References
T. Akutsu. A linear time pattern matching algorithm between a string and a tree. In Proc. CPM'93, pages 1–10, 1993.
A. Amir, M. Lewenstein, and N. Lewenstein. Pattern matching in hypertext. In Proc. WADS'97, LNCS 1272, pages 160–173, 1997.
R. Baeza-Yates and G. Navarro. A faster algorithm for approximate string matching. In Proc. CPM'96, LNCS 1075, pages 1–23, 1996. ftp://ftp.dcc.uchile.cl/-pub/users/gnavarro/cpm96.ps.gz.
R. Baeza-Yates and C. Perleberg. Fast and practical approximate pattern matching. In Proc. CPM'92, LNCS 644, pages 185–192, 1992.
W. Chang and J. Lampe. Theoretical and empirical comparisons of approximate string matching algorithms. In Proc. CPM'92, LNCS 644, pages 172–181, 1992.
J. Conklin. Hypertext: An introduction and survey. IEEE Computer, 20(9):17–41, September 1987.
G. Das, R. Fleischer, L. Gasieniec, D. Gunopulos, and J. Karkäinen. Episode matching. In Proc. CPM'97, LNCS 1264, pages 12–27, 1997.
G. Landau and U. Vishkin. Fast string matching with k differences. J. of Computer Systems Science, 37:63–78, 1988.
U. Manber and S. Wu. Approximate string matching with arbitrary costs for text and hypertext. In Proc. IAPR Workshop on Structural and Syntactic Pattern Recognition, pages 22–33, Bern, Switzerland, 1992.
K. Park and D. Kim. String matching in hypertext. In Proc. CPM'95, pages 318–329, 1995.
P. Sellers. The theory and computation of evolutionary distances: pattern recognition. J. of Algorithms, 1:359–373, 1980.
E. Sutinen and J. Tarhio. On using q-gram locations in approximate string matching. In Proc. ESA'95, LNCS 979, 1995.
Esko Ukkonen. Finding approximate patterns in strings. J. of Algorithms, 6:132–137, 1985.
S. Wu and U. Manber. Fast text searching allowing errors. CACM, 35(10):83–91, October 1992.
S. Wu, U. Manber, and E. Myers. A sub-quadratic algorithm for approximate limited expression matching. Algorithmica, 15(1):50–67, 1996.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Navarro, G. (1998). Improved approximate pattern matching on hypertext. In: Lucchesi, C.L., Moura, A.V. (eds) LATIN'98: Theoretical Informatics. LATIN 1998. Lecture Notes in Computer Science, vol 1380. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0054335
Download citation
DOI: https://doi.org/10.1007/BFb0054335
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64275-6
Online ISBN: 978-3-540-69715-2
eBook Packages: Springer Book Archive