Beruflich Dokumente
Kultur Dokumente
Motivations
Numerical approach
Node topology
Code Scaling
IO performances
Conclusions
Outline
Motivations
Separation of scales:
XY
63 Mb
ZY
11 Mb
ZY
XY
Blue Gene/P
4x450 PowerPC
2 Gb RAM (DDR2)
INCITE project (ANL)
PRACE project (Jugene)
Plane to Plane
decomposition
Tier-0
(16 R*8 buffers)
Computational setup & domain decomposition
New parallelization strategy
+
Hybrid OpenMP-MPI
Computational setup & domain decomposition
Global transposes:
4 OpenMP threads
Static Scheduling:
Loop blocking in Y
Tridiagonal LS LU solver
Posix calls
Conclusions