Performance and Scalability Evaluation of Short Fragment Sequence Alignment Applications

  • Gergely WindischEmail author
  • Miklos Kozlovszky
  • Ákos Balaskó
Part of the Modeling and Optimization in Science and Technologies book series (MOST, volume 2)


Mapping short fragments to open access eukaryotic genomes at a very large scale presents a data processing challenge to the scientific world. The main tool used for such an application is BLAST which is the one we use in our portlets developed at Obuda University.

Lately most of our work has been focused on evaluating the performance and scalability of our applications by profiling, analyzing the results of the tests and improving the performance of both the portlets and the server-side massively parallel algorithm by environment optimization using the data collected during the testing phase.

In this paper we will describe the two portlets (Deep Aligner and Disease Gene Mapper), discuss the issues and challenges during the development and the performance analysis and present our results on the performance and scalability of the applications.


Application porting sequence alignment workflow HP-SEE gUSE scalability mpiBlast 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Gesing, S., et al.: Granular security for a science gateway in structural bioinformatics. In: Proc. IWSG-Life 2011 (2011)Google Scholar
  2. 2.
    Gesing, S., et al.: Special issue: Portals for life sciences|providing intuitive access to bioinformatic tools. Concurrency and Computation 23(3), 223–234 (2011)CrossRefGoogle Scholar
  3. 3.
    Aloisio, G., Cafaro, M., Fiore, S., Mirto, M.: ProGenGrid: a workflow service infrastructure for composing and executing bioinformatics grid services. In: Proceedings. 18th IEEE Symposium on Computer-Based Medical Systems, June 23-24, pp. 555–560 (2005), doi:10.1109/CBMS.2005.90Google Scholar
  4. 4.
  5. 5.
    Kacsuk, P., Karoczkai, K., et al.: WS-PGRADE: Supporting parameter sweep applications in workflows. In: Proc. of 3rd Workshop on Workflows in Support of Large-Scale Science, Austin, TX, USA, November 17, pp. 1–10 (2008) ISBN: 978-1-4244-2827-4Google Scholar
  6. 6.
    Kozlovszky, M., Windisch, G., Balasko, A.: Short fragment sequence alignment on the HP-SEE infrastructure. In: MIPRO, 2012 Proceedings of the 35th International Convention, May 21-25, pp. 442–445 (2012)Google Scholar
  7. 7.
    Kacsuk, P., Farkas, Z., Sipos, G., Hermann, G., Kiss, T.: Supporting Workflow-level PS Applications by the P-GRADE Grid portal. In: Towards Next Generation Grids Proceedings of the CoreGRID Symposium 2007 (2007)Google Scholar
  8. 8.
    Balaskó, Á., Kozlovszky, M., Schnautigel, A., Karóczkai, K., Márton, I., Strodl, T., Kacsuk, P.: Converting P-GRADE grid portal into e-science gateways. In: Proceedings of International Workshop on Science Gateways, IWSG 2010, Catania, pp. 1–6 (2010)Google Scholar
  9. 9.
    Lin, H., Balaji, P., Poole, R., Sosa, C., Ma, X., Feng, W.: Massively Parallel Genomic Sequence Search on the Blue Gene/P Architecture. In: IEEE/ACM SC 2008 (November 2008)Google Scholar
  10. 10.
    Darling, A., Carey, L., Feng, W.: The Design, Implementation, and Evaluation of mpiBLAST. In: 4th International Conference on Linux Clusters (June 2003)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Gergely Windisch
    • 1
    Email author
  • Miklos Kozlovszky
    • 1
  • Ákos Balaskó
    • 2
  1. 1.John von Neumann Faculty of InformaticsÓbuda UniversityBudapestHungary
  2. 2.MTA SZTAKI/Laboratory of Parallel and Distributed ComputingBudapestHungary

Personalised recommendations