Abstract
The significant gap between peak and realized performance of parallel systems motivates the need for performance analysis. In order to predict the performance of a class of parallel multilevel ILU preconditioner (PBILUM), we build two performance prediction models for both the preconditioner construction phase and the solution phase. These models combine theoretical features of the preconditioners with estimates on computation cost, communications overhead, etc. Experimental simulations show that our model predication based on certain reasonable assumptions is close to the simulation results. The models may be used to predict the performance of this class of parallel preconditioners.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Allan, R. J., Heggarty, J., Goodman, M. and Ward, R. R.: Parallel application software on high performance computers. Survey of parallel performance tools and debugger. http://www.cse.clrc.ac.uk/Activity/HPCI, October, 2003.
Blanco, V., Gonzalez, P., Cabaleiro, J. C., Heras, D. B., Pena, T. F., Pombo, J. J. and Rivera, F. F.: AVISPA: Visualizing the performance prediction of parallel iterative solvers, Future Gener. Comput. Syst. 19 (2003), 721–733.
Browne, S., Dongarra, J. and London, K.: Review of performance analysis tools for MPI parallel programs. http://www.cs.utk.edu/~browne/perftools-review, October, 2003.
Gupta, M. M., Manohar, R. P. and Stephenson, J. W.: A single cell high order scheme for the convection–diffusion equation with variable coefficients, Int. J. Numer. Methods Fluids 4 (1984), 641–651.
Krishnan, S. and Kale, L. V.: Automating parallel runtime optimizations using post-mortem analysis, in Proceedings of the 10th ACM International Conference on Supercomputing, Philadelphia, May, 1996.
Kumar, V., Grama, A., Gupta, A. and Karypis, G.: Introduction to Parallel Computing, Benjamin/Cummings, Redwood City, California, 1994.
Labarta, J., Girona, S., Pillet, V., Cortes, T. and Gregoris, L.: Dip: A parallel program development environment. http://www.cepba.upc.es/dimemas, October, 2003.
Saad, Y.: Iterative Methods for Sparse Linear Systems, 2nd edn., SIAM, Philadelphia, 2003.
Saad, Y. ILUM: A multi-elimination ILU preconditioner for general sparse matrices, SIAM J. Sci. Comput. 17(4) (1996), 830–847.
Saad, Y. and Zhang, J.: BILUM: block versions of multielimination and multilevel ILU preconditioner for general sparse linear systems, SIAM J. Sci. Comput. 20(6) (1999), 2103–2121.
Saad, Y. and Zhang, J.: BILUTM: A domain-based multilevel block ILUT preconditioner for general sparse matrices, SIAM J. Matrix Anal. Appl. 21(1) (1999), 279–299.
Shen, C. and Zhang, J.: Parallel two level block ILU preconditioning techniques for solving large sparse linear systems, Paral. Comput. 28(10) (2002), 1451–1475.
Shen, C., Zhang, J. and Wang, K.: Distributed block independent set algorithms and parallel multilevel ILU preconditioners, J. Paral. Dist. Comput. 65 (2005), 331–346.
Shen, C. and Zhang, J.: A fully parallel block independent set algorithm for distributed sparse matrices, Paral. Comput. 29(11-12) (2003), 1685–1699.
VAMPIR: Visualization and Analysis of MPI Programs. http://www.pallas.de, October, 2003.
van de Velde, E. F.: Concurrent Scientific Computing, Springer, Berlin Heidelberg New York, 1994.
Yan, J., Sarukhai, S. and Mehra, P.: Performance measurement, visualization and modeling of parallel and distributed programs using the aims toolkit, Software Prac. Exp. 4(25) (1995), 429–461.
Zhang, J.: An explicit fourth-order compact finite difference scheme for three dimensional convection–diffusion equation, Commun, Numer. Methods Engrg. 14 (1998), 209–218.
Author information
Authors and Affiliations
Corresponding author
Additional information
*The research work of the authors was supported in part by the U.S. National Science Foundation under grants CCR-9988165, CCR-0092532, ACR-0202934, and ACR-234270, by the U.S. Department of Energy Office of Science under grant DE-FG02-02ER45961, by the Kentucky Science & Engineering Foundation under grant KSEF-02-264-RED-002.
Rights and permissions
About this article
Cite this article
Shen, C., Zhang, J. Performance Study and Analysis of Parallel Multilevel Preconditioners. J Math Model Algor 5, 331–352 (2006). https://doi.org/10.1007/s10852-005-9026-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10852-005-9026-x