Dynamic resizing of parallel scientific simulations: A case study using LAMMPS. (English)
Allen, Gabrielle (ed.) et al., Computational science ‒ ICCS 2009. 9th international conference Baton Rouge, LA, USA, May 25‒27, 2009. Proceedings, Part I. Berlin: Springer (ISBN 978-3-642-01969-2/pbk). Lecture Notes in Computer Science 5544, 175-184 (2009).
Summary: Large-scale computational science simulations are a dominant component of the workload on modern supercomputers. Efficient use of high-end resources for these large computations is of considerable scientific and economic importance. However, conventional job schedulers limit flexibility in that they are ‘static’, i.e., the number of processors allocated to an application can not be changed at runtime. In earlier work, we described ReSHAPE, a system that eliminates this drawback by supporting dynamic resizability in distributed-memory parallel applications. The goal of this paper is to present a case study highlighting the steps involved in adapting a production scientific simulation code to take advantage of ReSHAPE. LAMMPS, a widely used molecular dynamics code, is the test case. Minor extensions to LAMMPS allow it to be resized using ReSHAPE, and experimental results show that resizing significantly improves overall system utilization as well as performance of an individual LAMMPS job.