Abstract
In this contribution we look into the efficiency and scalability of our Lattice Boltzmann implementation Musubi when using OpenMP threads within an MPI parallel computation on Hawk. The Lattice Boltzmann method enables explicit computation of incompressible flows and the mesh discretization can be automatically generated, even for complex geometries. The basic Lattice Boltzmann kernel is fairly simple and involves only few floating point operations for each lattice node. A simple loop over all lattice nodes in each partition of the MPI parallel setup lends to a straight forward loop parallelization with OpenMP. With increased core counts per compute node, the use of threads on the shared memory nodes is gaining importance, as it avoids overly small partitions with many outbound communications to neighboring partitions. We briefly discuss the hybrid parallelization of Musubi and investigate how the usage of OpenMP threads affects the performance when running simulations on the Hawk supercomputer at HLRS.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Klimach, H., Masilamani, K., Roller, S. (2023). Exploiting Hybrid Parallelism in the LBM Implementation Musubi on Hawk. In: Resch, M.M., Gebert, J., Kobayashi, H., Bez, W. (eds) Sustained Simulation Performance 2021. WSSP 2021. Springer, Cham. https://doi.org/10.1007/978-3-031-18046-0_4
Download citation
DOI: https://doi.org/10.1007/978-3-031-18046-0_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-18045-3
Online ISBN: 978-3-031-18046-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)