Advertisement

Preliminary Analysis of the Cell BE Processor Limitations for Sequence Alignment Applications

  • Sebastian Isaza
  • Friman Sánchez
  • Georgi Gaydadjiev
  • Alex Ramirez
  • Mateo Valero
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5114)

Abstract

The fast growth of bioinformatics field has attracted the attention of computer scientists in the last few years. At the same time the increasing database sizes require greater efforts to improve the computational performance. From a computer architecture point of view, we intend to investigate how bioinformatics applications can benefit from future multi-core processors. In this paper we present a preliminary study of the Cell BE processor limitations when executing two representative sequence alignment applications (Ssearch and ClustalW). The inherent large parallelism of the targeted algorithms makes them ideal for architectures supporting multiple dimensions of parallelism (TLP and DLP). However, in the case of Cell BE we identified several architectural limitations that need a careful study and quantification.

Keywords

Local Store Forward Pass Pairwise Sequence Alignment Bioinformatics Application Architectural Limitation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Altivec enabled clustalw1.83, http://powerdev.osuosl.org/node/49
  2. 2.
  3. 3.
    Swissprot, universal protein database, http://www.expasy.org/sprot/
  4. 4.
    Bioinformatics market study for washington technology center (June 2003), http://www.altabiomedical.com
  5. 5.
    Bader, D.A., Li, Y., Li, T., Sachdeva, V.: Bioperf: A benchmark suite to evaluate high-performance computer architecture on bioinformatics applications. In: IEEE International Symposium on Workload Characterization (IISWC), pp. 1–8 (October 2005)Google Scholar
  6. 6.
    Henikoff, J., Henikoff, S., Pietrokovski, S.: Blocks+: a non-redundant database of protein alignment blocks derived from multiple compilations. Bioinformatics 15 (1999)Google Scholar
  7. 7.
    Higgins, D., Thompson, J., Gibson, T., Thompson, J.: Clustal w: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research 22, 4673–4680 (1994)CrossRefGoogle Scholar
  8. 8.
    Kahle, J.A., Day, M.N., Hofstee, H.P., Johns, C.R., Shippy, D.: Introduction to the cell multiprocessor. IBM Systems Journal 49(4/5), 589–604 (2005)Google Scholar
  9. 9.
    Needleman, S., Wunsch, C.: A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology 48, 443–453 (1970)CrossRefGoogle Scholar
  10. 10.
    Pearson, W.R.: Searching protein sequence libraries: comparison of the sensitivity and selectivity of the smith-waterman and FASTA algorithms. Genomics 11, 635–650 (1991)CrossRefGoogle Scholar
  11. 11.
    Petrini, F., Fossum, G., Fernandez, J., Varbanescu, A.L., Kistler, M., Perrone, M.: Multicore surprises: Lessons learned from optimizing sweep3d on the cellbe. In: IEEE International Parallel and Distributed Processing Symposium, IPDPS, pp. 1–10 (2007)Google Scholar
  12. 12.
    Rognes, T.: Rapid and sensitive methods for protein sequence comparison and database searching. PhD thesis, Institue of Medical Microbiology, University of Oslo (2000)Google Scholar
  13. 13.
    Sachdeva, V., Kistler, M., Speight, E., Tzeng, T.H.K.: Exploring the viability of the cell broadband engine for bioinformatics applications. In: Proceedings of the 6th Workshop on High Performance Computational Biology, pp. 1–8 (2007)Google Scholar
  14. 14.
    Sanchez, F., Salami, E., Ramirez, A., Valero, M.: Performance analysis of sequence alignment applications. In: Proceedings of the IEEE International Symposium on Workload Characterization (IISWC), pp. 51–60 (2006)Google Scholar
  15. 15.
    Shpaer, E., Robinson, M., Yee, D., Candlin, J., Mines, R., Hunkapiller, T.: Sensitivity and selectivity in protein similarity searches: A comparison of smith-waterman in hardware to blast and fasta. Genomics 38, 179–191 (1996)CrossRefGoogle Scholar
  16. 16.
    Smith, S., Frenzel, J.: Bioinformatics application of a scalable supercomputer-on-chip architecture. Proceedings of the International Conference on Parallel and Distributed Processing Techniques 1, 385–391 (2003)Google Scholar
  17. 17.
    Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. Journal of Molecular Biology 147, 195–197 (1981)CrossRefGoogle Scholar
  18. 18.
    Vandierendonck, H., Rul, S., Questier, M., Bosschere, K.D.: Experiences with parallelizing a bio-informatics program on the cell be. In: Third International Conference, HiPEAC, pp. 161–175 (2008)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Sebastian Isaza
    • 1
  • Friman Sánchez
    • 2
  • Georgi Gaydadjiev
    • 1
  • Alex Ramirez
    • 2
    • 3
  • Mateo Valero
    • 2
    • 3
  1. 1.Computer Engineering LabDelft University of TechnologyThe Netherlands
  2. 2.Computer Architecture DepartmentTechnical University of CataloniaSpain
  3. 3.Barcelona Supercomputing Center-CNSSpain

Personalised recommendations