Skip to main content

The Parallel BMH Algorithm of String Matching

  • Conference paper
Book cover Information and Business Intelligence (IBI 2011)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 268))

Included in the following conference series:

  • 1412 Accesses

Abstract

One of the critical problems of analyzing internet content is string matching, it is a basic problem in computer fields. During many years study, many classical algorithms were offered. But a bottleneck was met in traditional algorithm: the algorithm’s time complexity of string matching and multiple string matching limits is \(O(\frac{n}{m}log_{|\sum|}m)\) and \(O(\frac{n}{m}log_{|\sum|}rm)\), many algorithms approach or reach this limit. The article researches parallel algorithm to break the bottlenecks. The paper optimized traditional algorithm by applying multi-core, SSE, AVX instruction. We analyses Suffix based approach, and offer parallel BMH approach, Experimental results show the parallel approach can reach good speedup.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alpem, B., Carter, L.: Performance protein matching. In: Proceedings the 1995 ACM/IEEE Microparallelism Supercomputing Conference, vol. (11), pp. 3–8 (1995)

    Google Scholar 

  2. Wozniak, A.: Using video-oriented instructions to speed up sequence comparison. Comput. Appl. Biosci. (13), 145–150 (2007)

    Google Scholar 

  3. Rogues, T., Seeberg, E.: Six-fold sequence database searches using parallel processing speed-up of Smith-Waterman on common microprocessors. Australian Computer Science Communications 16(8), 699–706 (2000)

    Google Scholar 

  4. Nishimura, T., Fukamachi, S.: Aho-Corasick Pattern Matching Machines by Speed-up of Rearranging States. In: SPIRE 2001, pp. 175–185 (2001)

    Google Scholar 

  5. Tuck, N., Sherwood, T., Calder, B., Varghese, G.: Deterministic Memory- Efficient String Matching Algorithms for Intrusion Detection. In: Proceedings of IEEE Infocom. vol. (3), pp. 415–421 (2009)

    Google Scholar 

  6. Boyer, R.S., Moore, J.S.: A fast string searching algorithm. Communications of the ACM 20(10), 762–772 (1997)

    Article  Google Scholar 

  7. Horspool, R.N.: Practical fast searching in strings. Software Practice and Experience 10(6), 501–506 (1990)

    Article  Google Scholar 

  8. Liang, C.G.: Design and Analysis of String Matching Algorithm on Distributed Memory Machine. Journal of Software 11(6), 771–778 (2000)

    Google Scholar 

  9. Cheng, Z., Liang, C.G.: Parallel Algorithms for Approximate String Matching on PRAM and LARPBS. Journal of Software 15(2), 159–169 (2004)

    MathSciNet  MATH  Google Scholar 

  10. Park, J.H., Demirdag, B.A.: High Performance Pattern Matching with Dynamic Load Balancing on Heterogeneous Systems. In: Proceedings of the 14th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing. IEEE Computer Society Press (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kun, H., Xilong, Q., Hong, Y. (2012). The Parallel BMH Algorithm of String Matching. In: Qu, X., Yang, Y. (eds) Information and Business Intelligence. IBI 2011. Communications in Computer and Information Science, vol 268. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29087-9_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-29087-9_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-29086-2

  • Online ISBN: 978-3-642-29087-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics