Abstract
A challenging variant of the motif finding problem (MFP) is called planted (l,d)-motif finding. It is where the pattern has expected number of mismatches on each of its occurrence in the sequence. Difficulty arises when the planted pattern is longer and the expected number of mismatches is high. An algorithm (FMURP) which uses a random projection technique followed by local search algorithms is shown to work better with longer motifs with higher accuracy. A parallel algorithm for FMURP already exist in the literature. However, the parallel algorithm uses an excessive amount of memory. In this paper, we propose some improvements on the existing parallel algorithm. We also prove that the modified parallelization is equivalent to the sequential version of FMURP.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Allouche, J.P., Shallit, J.: Automatic Sequences: Theory Applications and Generalization. In: Numeration Systems, ch. 3, pp. 70–73. Cambridge University Press (2003)
Buhler, J., Tompa, M.: Finding Motifs Using Random Projections. In: RECOMB 2001 Proceedings of the Fifth Annual International Conference on Computational Biology (2001)
Buhler, J.: Projection Genomics Toolkit (2004), http://www1.cse.wustl.edu/~jbuhler/pgt/
Clemente, J.B., Cabarle, F.G.C., Adorna, H.N.: PROJECTION algorithm for motif finding on gPUs. In: Nishizaki, S.-y., Numao, M., Caro, J., Suarez, M.T. (eds.) WCTP 2011. PICT, vol. 5, pp. 101–115. Springer, Heidelberg (2012)
Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms, 2nd edn. MIT Press (2001)
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, Series B (1977)
Jones, N., Pevzner, P.: An Introduction to Bioinformatics Algorithms. Massachusetts Institute of Technology Press (2004)
Pevzner, P., Sze, S.H.: Combinatorial Approaches to Finding Subtle Signals in DNA Sequences. In: Proceedings of 8th Int. Conf. Intelligent Systems for Molecular Biology (ISMB), pp. 269–278 (2000)
Lakshmivarahan, S., Dhall, S.: Analysis and Design of Parallel Algorithms: Arithmetic and Matrix Problems, ch. 1, pp. 18–19. Mcgraw-hill College (1990)
Shashidhara, S.H., Joseph, P., Srinivasa, K.G.: Improving Motif Refinement using Hybrid Expectation Maximization and Random Projection. In: Proceedings of the International Symposium on Biocomputing, Calicut, India (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Tokyo
About this paper
Cite this paper
Clemente, J.B., Adorna, H.N. (2013). Some Improvements of Parallel Random Projection for Finding Planted (l,d)-Motifs. In: Nishizaki, Sy., Numao, M., Caro, J., Suarez, M.T. (eds) Theory and Practice of Computation. Proceedings in Information and Communications Technology, vol 7. Springer, Tokyo. https://doi.org/10.1007/978-4-431-54436-4_5
Download citation
DOI: https://doi.org/10.1007/978-4-431-54436-4_5
Publisher Name: Springer, Tokyo
Print ISBN: 978-4-431-54435-7
Online ISBN: 978-4-431-54436-4
eBook Packages: Computer ScienceComputer Science (R0)