String searching algorithms revisited

  • Ricardo A. Baeza-Yates
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 382)


We present bounds for the average case of the Knuth-Morris-Pratt (KMP) algorithm and the Boyer-Moore-Horspool (BMH) algorithm for random text. Experimental results in both random and English text suggests that the bounds are tight. We also present a hybrid algorithm which combines the KMP and BMH algorithms, and which, in practice, is faster than the Boyer-Moore algorithm.


