Engineering High Performance Service-Oriented Pipeline Applications with MeDICi

  • Ian Gorton
  • Adam Wynne
  • Yan Liu
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6568)


The pipeline software architecture pattern is commonly used in many application domains to structure a software system. A pipeline comprises a sequence of processing steps that progressively transform data to some desired outputs. As pipeline-based systems are required to handle increasingly large volumes of data and provide high throughput services, simple scripting-based technologies that have traditionally been used for constructing pipelines do not scale. In this paper we describe the MeDICI Integration Framework (MIF), which is specifically designed for building flexible, efficient and scalable pipelines that exploit distributed services as elements of the pipeline. We explain the core runtime and development infrastructures that MIF provides, and demonstrate how MIF has been used in two complex applications to improve performance and modifiability.


middleware software pipelines SOA component-based systems 


  1. 1.
    Buschmann, F., Meunier, R., Rohnert, H., Sommerlad, P., Stal, M.: Pattern-Oriented Software Architecture: A System of Patterns, vol. 1. Wiley, Chichester (2009)Google Scholar
  2. 2.
    Yang, X., Bruin, R.P., Dove, M.T.: Developing an End-to-End Scientific Workflow. Computing in Science and Engineering, 52–61 (May/June 2010)Google Scholar
  3. 3.
    Shah, A.R., Singhal, M., Gibson, T.D., Sivaramakrishnan, C., Waters, K.M., Gorton, I.: An extensible, scalable architecture for managing bioinformatics data and analysis. In: IEEE 4th International Conference on e-Science, Indianapolis, Indiana, December 7-12, pp. 190–197. IEEE Computer Society, Los Alamitos (2008)Google Scholar
  4. 4.
    Gorton, I., Huang, Z., Chen, Y., Kalahar, B., Jin, S., Chavarria-Miranda, D., Baxter, D., Feo, J.: A High-Performance Hybrid Computing Approach to Massive Contingency Analysis in the Power Grid. In: Fifth IEEE International Conference on e-Science and Grid Computing, pp. 277–283. IEEE, Los Alamitos (2009)Google Scholar
  5. 5.
    Chase, J.M., Gorton, I., Sivaramakrishnan, C., Almquist, J.P., Wynne, A.S., Chin, G., Critchlow, T.J.: Kepler + MeDICi - Service-Oriented Scientific Workflow Applications. In: 2009 IEEE Congress on Services - Part I (Services-1 2009), pp. 275–282. IEEE, Los Alamitos (2009)CrossRefGoogle Scholar
  6. 6.
    Barker, A., van Hemert, J.: Scientific Workflow: A Survey and Research Directions. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds.) PPAM 2007. LNCS, vol. 4967, pp. 746–753. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  7. 7.
    Kiebel, G.R., Auberry, K.J., Jaitly, N., Clark, D., Monroe, M.E., Peterson, E.S., Tolic, N., Anderson, G.A., Smith, R.D.: PRISM: A Data Management System for High-Throughput Proteomics. Proteomics 6(6), 1783–1790 (2006)CrossRefGoogle Scholar
  8. 8.
    Best, D.M., Bohn, S., Love, D., Wynne, A., Pike, W.A.: Real-time visualization of network behaviors for situational awareness. In: Proceedings of the Seventh international Symposium on Visualization For Cyber Security, VizSec 2010, Ottawa, Ontario, Canada, September 14, pp. 79–90. ACM, New York (2010)CrossRefGoogle Scholar
  9. 9.
    Wilde, M., Foster, I., Iskra, K., Beckman, P., Zhang, Z., Espinosa, A., Hategan, M., Clifford, B., Raicu, I.: Parallel Scripting for Applications at the Petascale and Beyond. Computer 42(11) (2009)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Ian Gorton
    • 1
  • Adam Wynne
    • 1
  • Yan Liu
    • 1
  1. 1.Pacific Nothwest National LabRichlandUSA

Personalised recommendations