-
Notifications
You must be signed in to change notification settings - Fork 50
Benchmark Gather Scatter performance
The plots below show performance of particle (gather and scatter) communication on the ORNL Frontier supercomputer. Both CPU and GPU performance are compared as a function of total particle count per rank (with 8 MPI ranks). These are the only core library benchmarks intended to run with multiple MPI ranks.
Create
refers to building the communication steering vectors (construction of the Halo
) and gather
and scatter
refer to executing the communication (copying particles to new ranks and updating particles copied to new ranks, respectively). Each point represents a single fraction of particles communicated.
Default parameters with the commandline "large" setting were used for these results.
Cabana - A Co-Designed Library for Exascale Particle Simulations