Methods for Optimisation and Acceleration of an Explicit Navier-Stokes Code with Application to Shock/Boundary-Layer Interaction

  • E. Katzer
  • M. Dowling
Part of the Notes on Numerical Fluid Mechanics book series (NNFM, volume 12)


The two parts of this paper discuss techniques for improving the programming efficiency of explicit algorithms on the CRAY-1 vector computer at the Fortran and Assembler levels respectively. The high degree of modularity was essential for both simplifying the application of the techniques of vectorisation and for calculating the maximal execution speed of the algorithm and achieving this speed using assembly language.

At the Fortran level increasing vector lengths by reducing the dimension of arrays achieved a performance improvement of between 10 % and 50 %, the module coded in assembly language a-chieved a further improvement of 37 %. The FORTRAN coded program attained an average speed of 60 MFLOPS.


Functional Unit Memory Access Separation Bubble Vector Length Assembly Language 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [1]
    KATZER, E.: Numerische Untersuchung der laminaren Stoß-Grenzschicht-Wechselwirkung . DFVLR-FB 85–34 (1985).Google Scholar
  2. [2]
    GRAHAM, C.G., KOST, F.H.: Shock Boundary Layer Interaction on High Turning Transonic Turbine Cascades. ASME Publ. Nr. 79-GT-37 (1979).Google Scholar
  3. [3]
    MACCORMACK, R.W., BALDWIN, B.S.: A Numerical Method for Solving the Navier-Stokes Equations with Application to Shock-Boundary Layer Interactions. AIAA Paper 75–1 (1975).Google Scholar
  4. [4]
    DETERT, U.: Performance Comparison for CRAY-1/S and CRAY X-MP by Means of Fortran Kernels und User Programs.Google Scholar
  5. [5]
    DETERT, U.: Vector Processing on CRAY-1 and CRAY X-MP Workshop on “Use of Supercomputers in Theoretical Science” at University of Antwerp, July 1984.Google Scholar
  6. [6]
    HOCKNEY and JESSHOPE: Parallel Computers: Architecture, Programming and Algorithms. Bristol, 1981.zbMATHGoogle Scholar
  7. [7]
    HOCKNEY: (r∞, n1/2, s1/2) Measurements on the 2-CPU CRAY X-MP, to appear in “Parallel Computing”, Vol.2, No.1.Google Scholar
  8. [8]
    WIESE, G.: Entwicklung, Laufzeitanalyse und Optimierung von CRAY-Assemlber-Programmen. Diplomarbeit, Inst.Angew.Mathematik, Abt.Rechnentechnik? Technische Universität Braunschweig, 1983.Google Scholar
  9. [9]
    CRAY-1 Computer Systems, S Series Mainframe Reference Manual, HR-0029, CRAY Research Inc., 1982.Google Scholar

Copyright information

© Springer Fachmedien Wiesbaden 1986

Authors and Affiliations

  • E. Katzer
    • 1
  • M. Dowling
    • 2
  1. 1.Institut für AeroelastikDFVLR-AVAGöttingenGermany
  2. 2.Institut für Angewandte MathematikTechnischen Universität BraunschweigBraunschweigGermany

Personalised recommendations