Download Presentation
Transcript
A Recipe for Tuning MPI + OpenMP • In priority order – get the large-scale MPI parallelization right • if processes are blocked, performance will be lost – get the OpenMP threading right • if threads are blocked, performance will be lost – get the node performance details right • assess memory hierarchy performance (TLB, cache) • assess pipeline performance (graduated instructions, …) 36