Abstract
The DUNE framework provides a PDE toolbox which is both flexible and efficient. Integration of hardware oriented techniques into DUNE will be necessary to maintain performance on modern and future architectures. We present the current effort to add hybrid parallelization to the DUNE grid interface, which up to now only supports MPI parallelization. In current hardware trends, we see a transition from multi-core to many-core architectures, like the Intel PHI. Techniques which worked well on traditional multi-core CPUs don’t scale anymore on many-core systems. We compare different strategies to add a thread parallel layer to DUNE and discuss their scalability and performance.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
- Discontinuous Galerkin Method
- Cache Line
- Instruction Level Parallelism
- Grid Interface
- Hybrid Parallelization
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
W. Bangerth, C. Burstedde, T. Heister, M. Kronbichler, Algorithms and data structures for massively parallel generic adaptive finite element codes. ACM Trans. Math. Softw. 38(2), 14:1–14:28 (2012)
P. Bastian, M. Blatt, A. Dedner, C. Engwer, R. Klöfkorn, R. Kornhuber, M. Ohlberger, O. Sander, A generic grid interface for parallel and adaptive scientific computing. part II: implementation and tests in DUNE. Computing 82(2–3), 121–138 (2008)
P. Bastian, M. Blatt, A. Dedner, C. Engwer, R. Klöfkorn, M. Ohlberger, O. Sander, A generic grid interface for parallel and adaptive scientific computing. part I: abstract framework. Computing 82(2–3), 103–119 (2008)
A. Ern, A.F. Stephansen, P. Zunino, A discontinuous galerkin method with weighted averages for advection–diffusion equations with locally small and anisotropic diffusivity. IMA J. Numer. Anal. 29(2), 235–256 (2009)
X.S. Hu, R.C. Murphy, S. Dosanjh, K. Olukotun, S. Poole, Hardware/software co-design for high performance computing: challenges and opportunities, in 2010 IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ ISSS), Scottsdale (IEEE, 2010), pp. 63–64
D.E. Keyes, Exaflop/s: the why and the how. Comptes Rendus Mécanique 339(2–3), 70–77 (2011)
R. Klöfkorn, Efficient matrix-free implementation of discontinuous galerkin methods for compressible flow problems, in ALGORITMY 2012, Proceedings of Contributed Papers and Posters, Podbanske, ed. by A. Handlovičová, Z. Minarechová, D. Ševčovič (Publishing House of STU, 2012), pp. 11–21. http://www.iam.fmph.uniba.sk/algoritmy2012/
A. Logg, K.-A. Mardal, G. Wells, Automated Solution of Differential Equations by the Finite Element Method (Springer, Berlin/New York, 2012)
S. Turek, D. Göddeke, C. Becker, S. Buijssen, S. Wobker, FEAST – Realisation of hardware-oriented numerics for HPC simulations with finite elements. Concurr. Comput.: Pract. Experience 22(6), 2247–2265 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Engwer, C., Fahlke, J. (2015). Scalable Hybrid Parallelization Strategies for the DUNE Grid Interface. In: Abdulle, A., Deparis, S., Kressner, D., Nobile, F., Picasso, M. (eds) Numerical Mathematics and Advanced Applications - ENUMATH 2013. Lecture Notes in Computational Science and Engineering, vol 103. Springer, Cham. https://doi.org/10.1007/978-3-319-10705-9_57
Download citation
DOI: https://doi.org/10.1007/978-3-319-10705-9_57
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10704-2
Online ISBN: 978-3-319-10705-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)