Exploiting Hybrid Parallelism in the LBM Implementation Musubi on Hawk

Klimach, Harald; Masilamani, Kannan; Roller, Sabine

doi:10.1007/978-3-031-18046-0_4

Harald Klimach⁵,
Kannan Masilamani⁵ &
Sabine Roller⁵

Included in the following conference series:

Joint Workshop on Sustained Simulation Performance

106 Accesses

Abstract

In this contribution we look into the efficiency and scalability of our Lattice Boltzmann implementation Musubi when using OpenMP threads within an MPI parallel computation on Hawk. The Lattice Boltzmann method enables explicit computation of incompressible flows and the mesh discretization can be automatically generated, even for complex geometries. The basic Lattice Boltzmann kernel is fairly simple and involves only few floating point operations for each lattice node. A simple loop over all lattice nodes in each partition of the MPI parallel setup lends to a straight forward loop parallelization with OpenMP. With increased core counts per compute node, the use of threads on the shared memory nodes is gaining importance, as it avoids overly small partitions with many outbound communications to neighboring partitions. We briefly discuss the hybrid parallelization of Musubi and investigate how the usage of OpenMP threads affects the performance when running simulations on the Hawk supercomputer at HLRS.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

On Portability, Performance and Scalability of an MPI OpenCL Lattice Boltzmann Code

Parallel Implementation of the Hybrid Lattice Boltzmann Method on Graphics Accelerators

Article 01 July 2022

MPC and Coarray Fortran: Alternatives to Classic MPI Implementations on the Examples of Scalable Lattice Boltzmann Flow Solvers

Author information

Authors and Affiliations

DLR e.V., Institut für Softwaremethoden zur Produkt-Virtualisierung, Zwickauer Str. 45, 01069, Dresden, Germany
Harald Klimach, Kannan Masilamani & Sabine Roller

Authors

Harald Klimach
View author publications
You can also search for this author in PubMed Google Scholar
Kannan Masilamani
View author publications
You can also search for this author in PubMed Google Scholar
Sabine Roller
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Harald Klimach .

Editor information

Editors and Affiliations

High-Performance Computing Center, University of Stuttgart, HLRS, Stuttgart, Germany
Michael M. Resch
High-Performance Computing Center, University of Stuttgart, Stuttgart, Germany
Johannes Gebert
Graduate School of Information Sciences, Tohoku University, Aoba-ku, Japan
Hiroaki Kobayashi
NEC High Performance Computing Europe GmbH, Düsseldorf, Germany
Wolfgang Bez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Klimach, H., Masilamani, K., Roller, S. (2023). Exploiting Hybrid Parallelism in the LBM Implementation Musubi on Hawk. In: Resch, M.M., Gebert, J., Kobayashi, H., Bez, W. (eds) Sustained Simulation Performance 2021. WSSP 2021. Springer, Cham. https://doi.org/10.1007/978-3-031-18046-0_4

Download citation

DOI: https://doi.org/10.1007/978-3-031-18046-0_4
Published: 18 February 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-18045-3
Online ISBN: 978-3-031-18046-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Exploiting Hybrid Parallelism in the LBM Implementation Musubi on Hawk

Abstract

Chapter PDF

Similar content being viewed by others

On Portability, Performance and Scalability of an MPI OpenCL Lattice Boltzmann Code

Parallel Implementation of the Hybrid Lattice Boltzmann Method on Graphics Accelerators

MPC and Coarray Fortran: Alternatives to Classic MPI Implementations on the Examples of Scalable Lattice Boltzmann Flow Solvers

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Exploiting Hybrid Parallelism in the LBM Implementation Musubi on Hawk

Abstract

Chapter PDF

Similar content being viewed by others

On Portability, Performance and Scalability of an MPI OpenCL Lattice Boltzmann Code

Parallel Implementation of the Hybrid Lattice Boltzmann Method on Graphics Accelerators

MPC and Coarray Fortran: Alternatives to Classic MPI Implementations on the Examples of Scalable Lattice Boltzmann Flow Solvers

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation