Exploiting multi-scale parallelism for large scale numerical modelling of laser wakefield accelerators
Plasma Physics and Controlled Fusion, 2013•iopscience.iop.org
A new generation of laser wakefield accelerators (LWFA), supported by the extreme
accelerating fields generated in the interaction of PW-Class lasers and underdense targets,
promises the production of high quality electron beams in short distances for multiple
applications. Achieving this goal will rely heavily on numerical modelling to further
understand the underlying physics and identify optimal regimes, but large scale modelling of
these scenarios is computationally heavy and requires the efficient use of state-of-the-art …
accelerating fields generated in the interaction of PW-Class lasers and underdense targets,
promises the production of high quality electron beams in short distances for multiple
applications. Achieving this goal will rely heavily on numerical modelling to further
understand the underlying physics and identify optimal regimes, but large scale modelling of
these scenarios is computationally heavy and requires the efficient use of state-of-the-art …
Abstract
A new generation of laser wakefield accelerators (LWFA), supported by the extreme accelerating fields generated in the interaction of PW-Class lasers and underdense targets, promises the production of high quality electron beams in short distances for multiple applications. Achieving this goal will rely heavily on numerical modelling to further understand the underlying physics and identify optimal regimes, but large scale modelling of these scenarios is computationally heavy and requires the efficient use of state-of-the-art petascale supercomputing systems. We discuss the main difficulties involved in running these simulations and the new developments implemented in the OSIRIS framework to address these issues, ranging from multi-dimensional dynamic load balancing and hybrid distributed/shared memory parallelism to the vectorization of the PIC algorithm. We present the results of the OASCR Joule Metric program on the issue of large scale modelling of LWFA, demonstrating speedups of over 1 order of magnitude on the same hardware. Finally, scalability to over∼ 10 6 cores and sustained performance over∼ 2 P Flops is demonstrated, opening the way for large scale modelling of LWFA scenarios.
iopscience.iop.org