Summary
This chapter provides an introduction to the use of Graphics Processor Units (GPUs) as parallel computing devices. It describes the architecture, the available functionality and the programming model. Simple examples and references to freely available tools and resources motivate the reader to explore these new possibilities. An overview of the different applications of GPUs demonstrates their wide applicability, yet also highlights limitations of their use. Finally, a glimpse into the future of GPUs sketches the growing prospects of these inexpensive parallel computing devices.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
- Parallel Computing
- Application Program Interface
- Graphic Card
- Graphic Hardware
- Single Instruction Multiple Data
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Alienware. Alienware’s Video Array. http://www.alienware.com/alx_pages/main_content.aspx.
C. Bajaj, I. Ihm, J. Min, and J. Oh. SIMD optimization of linear expressions for programmable graphics hardware. Computer Graphics Forum, 23(4), Dec 2004.
J. Bolz, I. Farmer, E. Grinspun, and P. Schröder. Sparse matrix solvers on the GPU: Conjugate gradients and multigrid. In Proceedings of SIGGRAPH 2003, 2003.
G. Coombe, M. J. Harris, and A. Lastra. Radiosity on graphics hardware. In Proceedings. Graphics Interface 2004, 2004.
Z. Fan, F. Qiu, A. Kaufman, and S. Yoakum-Stover. GPU cluster for high performance computing. In Proceedings of the ACM/IEEE SuperComputing 2004 (SC’04), Nov 2004.
K. Fatahalian, J. Sugerman, and P. Hanrahan. Understanding the efficiency of GPU algorithms for matrix-matrix multiplication. In Graphics Hardware 2004, 2004.
R. Fernando, editor. GPU Gems: Programming Techniques, Tips, and Tricks for Real-Time Graphics. Addison-Wesley Professional, 2004.
J. Fung and S. Mann. Using multiple graphics cards as a general purpose parallel computer: Applications to computer vision. In Proceedings of the 17th International Conference on Pattern Recognition (ICPR 2004), volume 1, pages 805–808, 2004.
N. K. Govindaraju, A. Sud, S.-E. Yoon, and D. Manocha. Interactive visibility culling in complex environments using occlusion-switches. In ACM SIGGRAPH Symposium on Interactive 3D Graphics, 2003.
GPGPU-general purpose computation using graphics hardware. http://www.gpgpu.org/.
M. Harris. Real-Time Cloud Simulation and Rendering. PhD thesis, UNC Chapel Hill, Sep. 2003.
M. J. Harris, G. Coombe, T. Scheuermann, and A. Lastra. Physically-based visual simulation on graphics hardware. In Proceedings of Graphics Hardware 2002, pages 109–118, 2002.
R. Hartenstein. Data-stream-based computing: Models and architectural resources. In International Conference on Microelectronics, Devices and Materials (MIDEM 2003), Ptuj, Slovenia, Oct. 2003.
R. Hill, J. Fung, and S. Mann. Reality window manager: A user interface for mediated reality. In Proceedings of the 2004 IEEE International Conference on Image Processing. (ICIP 2004), 2004.
G. Humphreys, M. Houston, R. Ng, R. Frank, S. Ahern, P. D. Kirchner, and J. T. Klosowski. Chromium: a stream-processing framework for interactive rendering on clusters. In SIGGRAPH’02, pages 693–702, 2002.
R. A. Kendall, M. Sosonkina, W. D. Gropp, R. W. Numrich, and T. Sterling. Parallel programming models applicable to cluster computing and beyond. In A. M. Bruaset and A. Tveito, editors, Numerical Solution of Partial Differential Equations on Parallel Computers, volume 51 of Lecture Notes in Computational Science and Engineering, pages 3–54. Springer-Verlag, 2005.
T. Kim and M. Lin. Visual simulation of ice crystal growth. In Proc. ACM SIGGRAPH / Eurographics Symposium on Computer Animation, 2003.
P. Kipfer, M. Segal, and R. Westermann. UberFlow: A GPU-based particle engine. In Graphics Hardware 2004, 2004.
J. Krueger and R. Westermann. Linear algebra operators for GPU implementation of numerical algorithms. ACM Transactions on Graphics (TOG), 22(3):908–916, 2003.
A. Lefohn, J. Kniss, C. Handen, and R. Whitaker. Interactive visualization and deformation of level set surfaces using graphics hardware. In Proc. Visualization, pages 73–82. IEEE CS Press, 2003.
W. Li, X. Wei, and A. Kaufman. Implementing Lattice Boltzmann computation on graphics hardware. The Visual Computer, 2003.
Microsoft. Longhorn Developer Center. http://msdn.microsoft.com/longhorn.
NVIDIA. NVIDIA scalable link interface (SLI). http://www.nvidia.com/page/sli.html.
OpenGL Architectural Review Board (ARB). OpenGL: graphics application programming interface. http://www.opengl.org/.
M. Pharr and R. Fernando, editors. GPU Gems 2: Programming Techniques for High-Performance Graphics and General-Purpose Computation. Addison-Wesley Professional, 2005.
M. Rumpf and R. Strzodka. Level set segmentation in graphics hardware. In Proceedings. ICIP’01, volume 3, pages 1103–1106, 2001.
M. Rumpf and R. Strzodka. Using graphics cards for quantized FEM computations. In Proceedings VIIP’01, pages 193–202, 2001.
R. Samanta, T. Funkhouser, K. Li, and J. P. Singh. Hybrid sort-first and sort-last parallel rendering with a cluster of PCs. In Proceedings of SIGGRAPH/Eurographics Workshop on Graphics Hardware 2000, pages 97–108, 2000.
R. Strzodka, M. Droske, and M. Rumpf. Image registration by a regularized gradient flow-a streaming implementation in DX9 graphics hardware. Computing, 2004. to appear.
R. Strzodka and A. Telea. Generalized distance transforms and skeletons in graphics hardware. In Proceedings of EG/IEEE TCVG Symposium on Visualization VisSym’ 04, 2004.
J. D. Teresco, K. D. Devine, and J. E. Flaherty. Partitioning and dynamic load balancing for the numerical solution of partial differential equations. In A. M. Bruaset and A. Tveito, editors, Numerical Solution of Partial Differential Equations on Parallel Computers, volume 51 of Lecture Notes in Computational Science and Engineering, pages 55–88. Springer-Verlag, 2005.
M. Wilkes. The memory gap (keynote). In Solving the Memory Wall Problem Workshop, 2000. http://www.ece.neu.edu/conf/wall2k/wilkes1.pdf.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rumpf, M., Strzodka, R. (2006). Graphics Processor Units: New Prospects for Parallel Computing. In: Bruaset, A.M., Tveito, A. (eds) Numerical Solution of Partial Differential Equations on Parallel Computers. Lecture Notes in Computational Science and Engineering, vol 51. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-31619-1_3
Download citation
DOI: https://doi.org/10.1007/3-540-31619-1_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29076-6
Online ISBN: 978-3-540-31619-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)