Skip to main content

Pattern Matching with Pair Correlation Distance

  • Conference paper
String Processing and Information Retrieval (SPIRE 2008)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5280))

Included in the following conference series:

  • 773 Accesses

Abstract

In pattern matching with pair correlation distance problem, the goal is to find all occurrences of a pattern P of length m, in a text T of length n, where the distance between them is less than a threshold k. For each text location i, the distance is defined as the number of different kinds of mismatched pairs (α,β), between P and T[i ...i + m]. We present an algorithm with running time of \(O\left(min\{\left|\Sigma_P\right|^2 n \log m,n \!\left({m \log m}\right)^\frac{2}{3}\}\right)\!\) for this problem. Another interesting problem is the one-side pair correlation distance where it is desired to find all occurrences of P where the number of mismatched characters in P is less than k. For this problem, we present an algorithm with running time of \(O\left(min\{\left|\Sigma_P\right| n \log m,n\right.\left.\sqrt{m \log m}\}\right)\).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Amir, A., Aumann, Y., Cole, R., Lewenstein, M., Porat, E.: Function matching: Algorithms, applications, and a lower bound. In: Baeten, J.C.M., Lenstra, J.K., Parrow, J., Woeginger, G.J. (eds.) ICALP 2003. LNCS, vol. 2719, pp. 929–942. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  2. Amir, A., Farach, M., Muthukrishnan, S.: Alphabet dependence in parameterized matching. Information Processing Letters 49, 111–115 (1994)

    Article  MATH  Google Scholar 

  3. Amir, A., Landau, G.M., Lewenstein, M., Lewenstein, N.: Efficient special cases of pattern matching with swaps. Information Processing Letters 68(3), 125–132 (1998)

    Article  MathSciNet  MATH  Google Scholar 

  4. Amir, A., Lewenstein, M., Porat, E.: Faster algorithms for string matching with k mismatches. In: Proc. 11th ACM-SIAM Symp. on Discrete Algorithms (SODA), pp. 794–803 (2000)

    Google Scholar 

  5. Baker, B.S.: A theory of parameterized pattern matching: algorithms and applications. In: Proc. 25th Annual ACM Symposium on the Theory of Computation, pp. 71–80 (1993)

    Google Scholar 

  6. Baker, B.S.: Parameterized pattern matching: Algorithms and applications. J. Comput. Syst. Sci. 52(1), 28–42 (1996)

    Article  MathSciNet  MATH  Google Scholar 

  7. Baker, B.S.: Parameterized duplication in strings: Algorithms and an application to software maintenance. SIAM J. Comput. 26(5), 1343–1362 (1997)

    Article  MathSciNet  MATH  Google Scholar 

  8. Hazay, C., Lewenstein, M., Sokol, D.: Approximate parameterized matching. In: Albers, S., Radzik, T. (eds.) ESA 2004, vol. 3221, pp. 414–425. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  9. Levenstein, V.I.: Binary codes capable of correcting, deletions, insertions and reversals. Soviet Phys. Dokl. 10, 707–710 (1966)

    MathSciNet  Google Scholar 

  10. Landau, G.M., Vishkin, U.: Introducing efficient parallelism into approximate string matching. In: Proc. 18th ACM Symposium on Theory of Computing, pp. 220–230 (1986)

    Google Scholar 

  11. Lowrance, R., Wagner, R.A.: An extension of the string-to-string correction problem. Journal of the ACM 22(2), 177–183 (1975)

    Article  MathSciNet  MATH  Google Scholar 

  12. Nosovskij, G.V.: Mathematical analysis of stock market movement. In: 3rd International Conference on Cyberworlds (CW), pp. 320–321 (2004)

    Google Scholar 

  13. Schleimer, S., Wilkerson, D.S., Winnowing, A.A.: Local algorithms for document fingerprinting. In: Proceedings of the International Conference on Management of Data (SIGMOD), pp. 76–85 (2003)

    Google Scholar 

  14. Shmulevich, I., Yli-Harja, O., Coyle, E., Povel, D., Lemstrom, K.: Perceptual issues in music pattern recognition - complexity of rhythm and key finding. In: Proc. of AISB Symposium on Musical Creativity, pp. 64–69 (1999)

    Google Scholar 

  15. Wagner, R.A.: On the complexity of the extended string-to-string correction problem. In: Proc. of the 7th ACM Symposium on the Theory of Computing (STOC), pp. 218–223 (1975)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Porat, B., Porat, E., Zur, A. (2008). Pattern Matching with Pair Correlation Distance. In: Amir, A., Turpin, A., Moffat, A. (eds) String Processing and Information Retrieval. SPIRE 2008. Lecture Notes in Computer Science, vol 5280. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89097-3_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-89097-3_24

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-89096-6

  • Online ISBN: 978-3-540-89097-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics