Beruflich Dokumente
Kultur Dokumente
FINAL REPORT
The following reports the results from work on the ASCI Level 3 contract, Agreement No. B513235.
The original Statement of Work (SOW) is annotated to indicate the results status.
Instrumentation / Measurement
Goal: Integrate the TAU performance system with the dynamic instrumentation capabilities offered by
DyninstAPI. Enable TAU performance measurement on the Compaq Alpha Cluster. Improve
PDT program analysis system for Fortran 90 instrumentation.
Tasks
1. INSTR-1: Develop dynamic TAU performance measurement mechanisms for MPI using
DyninstAPI.
Status: Complete. We implemented a technique which spawns a Dyninst mutator with each MPI
generated executable image. The mutator inserts TAU instrumentation in the executable before
starting the MPI process and then waits for the child process to terminate. (This is similar to the
approach used in Dynaprof.) We demonstrated this capability with the SIMPLE hydrodynamics
benchmark in our PDPTA ’01 paper [1]. TAU v2.11 ships with support for DyninstAPI and MPI.
2. INSTR-2: Port the TAU performance measurement system to Compaq Alpha Cluster and
demonstrate with MPI applications.
Status: Complete. TAU supports Compaq (cxx, f90) and KAI (KCC, KAP/Pro) compilers under
Tru64. TAU also supports Compaq Linux clusters. This capability has been demonstrated with the
SAMRAI (Andy Wissinsk, LLNL) and SAGE (Jack Horner, LANL) projects.
3. INSTR-3: Complete PDT F90 implementation.
Status: Complete. TAU’s PDT system now supports F90 as well as C99 and C++. The PDT F90
front end has been validated on F90 test suites from the University of Colorado ELI project and the
PCRC HPF compiler project. A total of 309 programs were tested with no errors reported.
4. INSTR-4: Develop tool for automatic source-level F90 instrumentation and demonstrate on F90
application code.
Status: Complete. The PDT F90 capability has been used to build F90 instrumentation support for
TAU. We have tested the instrumentor partly on the SAGE code and the POP code (Phil Jones,
LANL), and more extensively in the Caltech CACR ASCI/ASAP VTF project (Julian Cummings).
In addition to its use in the TAU F90 instrumentor, the PDT F90 capability is being used in the
CHASM [2] project (Craig Rasmussen, LANL).
Personnel
References
1. S. Shende, A. Malony, and R. Ansell-Bell, "Instrumentation and Measurement Strategies for
Flexible and Portable Empirical Performance Evaluation," Proc. Int'l. Conf. on Parallel and
Distributed Processing Techniques and Applications (PDPTA 2001), June 2001.
4. B. Mohr, A. Malony, S. Shende, and F. Wolf, "Towards a Performance Tool Interface for
OpenMP: An Approach Based on Directive Rewriting," Proc. Third European Workshop on
OpenMP (EWOMP 2001), Sept. 2001.
5. A. Malony, B. Mohr, S. Shende, and F. Wolf, "Design and Prototype of a Performance Tool
Interface for OpenMP," Proc. Los Alamos Computer Science (LACSI) Symp. 2001, Oct. 2001.
6. B. Kuhn, A. Malony, B. Mohr, and S. Shende, "A Performance Tool Interface for OpenMP,"
Report for Accelerated Strategic Computing Initiative (ASCI), ASCI Path Forward program,
Ultrascale Tools Initiative, RTS - Parallel System Performance, submitted by KAI Software, A
Division of Intel America, Inc., Aug. 2001.
8. A. Malony, B. Mohr, and S. Shende, "Performance Technology for Complex Parallel Systems,"
SC 2001 tutorial, Nov. 2001.