Performance Evaluation of Parallel Sparse Matrix–Vector Products on SGI Altix3700

Kotakemori, Hisashi; Hasegawa, Hidehiko; Kajiyama, Tamito; Nukada, Akira; Suda, Reiji; Nishida, Akira

doi:10.1007/978-3-540-68555-5_13

Hisashi Kotakemori¹,
Hidehiko Hasegawa²,
Tamito Kajiyama¹,
Akira Nukada¹,
Reiji Suda¹ &
…
Akira Nishida¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4315))

Included in the following conference series:

International Workshop on OpenMP

1311 Accesses
6 Citations
3 Altmetric

Abstract

The present paper discusses scalable implementations of sparse matrix-vector products, which are crucial for high performance solutions of large-scale linear equations, on a cc-NUMA machine SGI Altix3700. Three storage formats for sparse matrices are evaluated, and scalability is attained by implementations considering the page allocation mechanism of the NUMA machine. Influences of the cache/memory bus architectures on the optimum choice of the storage format are examined, and scalable converters between storage formats shown to facilitate exploitation of storage formats of higher performance.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Performance Characteristics for Sparse Matrix-Vector Multiplication on GPUs

Sparse Linear Algebra on AMD and NVIDIA GPUs – The Race Is On

Dense and Sparse Matrix-Vector Multiplication on Maxwell GPUs with PyCUDA

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Barrett, R., et al.: Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods. SIAM, Philadelphia (1994)
Google Scholar
Duff, I., Grimes, R., Lewis, J.: Sparse matrix test problems. ACM Trans. Math. Soft. 15, 1–14 (1989)
Article MATH Google Scholar
Saad, Y.: SPARSKIT: a basic took kit for sparse matrix computations, version 2, (June 1994), http://www.cs.umn.edu/~saad/software/SPARSKIT/sparskit.html
Kincaid, D., Oppe, T., Respess, J., Young, D.: ITPACKV2C User’s Guide, Report CNA191. The University of Texas at Austin (1984)
Google Scholar
Saad, Y.: Krylov subspace methods on supercomputers. SIAM J. Sci. Stat. Comput. 10, 1200–1232 (1989)
Article MATH MathSciNet Google Scholar
Matrix Market, http://math.nist.gov/MatrixMarket
Dongarra, J., Eijkhout, V., Kalhan, A.: Reverse communication interface for linear algebra templates for iterative methods. Technical Report UT-CS-95-291, University of Tennessee (May 1995)
Google Scholar
Balay, S., Buschelman, K., Eijkhout, V., Gropp, W., Kaushik, D., Knepley, M., McInnes, L., Smith, B., Zhang, H.: PETSc users manual. Technical Report ANL-95/11, Argonne National Laboratory (August 2004)
Google Scholar
Tuminaro, R.S., Heroux, M., Hutchinson, S.A., Shadid, J.N.: Official Aztec user’s guide, version 2.1. Technical Report SAND99-8801J, Sandia National Laboratories (November 1999)
Google Scholar
Toledo, S.: Improving the memory-system performance of sparse-matrix vector multiplication. IBM Journal of Research and Development 41(6), 711–725 (1997)
Article Google Scholar
Pinar, A., Heath, M.T.: Improving Performance of Sparse Matrix-Vector Multiplication. Supercomputing 99 (1999)
Google Scholar
Im, E.J.: Optimizing the performance of sparse matrix-vector multiplication. Ph.D. thesis, University of California (May 2000)
Google Scholar
Demmel, J., Dongarra, J., Eijkhout, V., Fuentes, E., Petitet, A., Vuduc, R., Whaley, R.C., Yelick, K.: Self adapting linear algebra algorithms and software. Proceedings of the IEEE: Special Issue on Program Generation, Optimization, and Adaptation 93(2), 293–312 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

CREST, Japan Science and Technology Agency Graduate School of Information Science and Technology, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan
Hisashi Kotakemori, Tamito Kajiyama, Akira Nukada, Reiji Suda & Akira Nishida
CREST, Japan Science and Technology Agency Graduate School of Library, Information and Media Studies, University of Tsukuba, Tsukuba, 305-8550, Japan
Hidehiko Hasegawa

Authors

Hisashi Kotakemori
View author publications
You can also search for this author in PubMed Google Scholar
Hidehiko Hasegawa
View author publications
You can also search for this author in PubMed Google Scholar
Tamito Kajiyama
View author publications
You can also search for this author in PubMed Google Scholar
Akira Nukada
View author publications
You can also search for this author in PubMed Google Scholar
Reiji Suda
View author publications
You can also search for this author in PubMed Google Scholar
Akira Nishida
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Matthias S. Mueller Barbara M. Chapman Bronis R. de Supinski Allen D. Malony Michael Voss

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kotakemori, H., Hasegawa, H., Kajiyama, T., Nukada, A., Suda, R., Nishida, A. (2008). Performance Evaluation of Parallel Sparse Matrix–Vector Products on SGI Altix3700. In: Mueller, M.S., Chapman, B.M., de Supinski, B.R., Malony, A.D., Voss, M. (eds) OpenMP Shared Memory Parallel Programming. IWOMP 2005. Lecture Notes in Computer Science, vol 4315. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68555-5_13

Download citation

DOI: https://doi.org/10.1007/978-3-540-68555-5_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68554-8
Online ISBN: 978-3-540-68555-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Performance Evaluation of Parallel Sparse Matrix–Vector Products on SGI Altix3700

Abstract

Chapter PDF

Similar content being viewed by others

Performance Characteristics for Sparse Matrix-Vector Multiplication on GPUs

Sparse Linear Algebra on AMD and NVIDIA GPUs – The Race Is On

Dense and Sparse Matrix-Vector Multiplication on Maxwell GPUs with PyCUDA

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Performance Evaluation of Parallel Sparse Matrix–Vector Products on SGI Altix3700

Abstract

Chapter PDF

Similar content being viewed by others

Performance Characteristics for Sparse Matrix-Vector Multiplication on GPUs

Sparse Linear Algebra on AMD and NVIDIA GPUs – The Race Is On

Dense and Sparse Matrix-Vector Multiplication on Maxwell GPUs with PyCUDA

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation