GPU and ROS the Use of General Parallel Processing Architecture for Robot Perception

Dalmedico, Nicolas; Simões Teixeira, Marco Antônio; Barbosa, Higor Santos; de Oliveira, André Schneider; Ramos de Arruda, Lucia Valeria; Neves Jr, Flavio

doi:10.1007/978-3-319-91590-6_12

Nicolas Dalmedico³,
Marco Antônio Simões Teixeira³,
Higor Santos Barbosa³,
André Schneider de Oliveira³,
Lucia Valeria Ramos de Arruda³ &
…
Flavio Neves Jr³

Part of the book series: Studies in Computational Intelligence ((SCI,volume 778))

4843 Accesses
1 Citations

Abstract

This chapter presents a full tutorial on how to get started on performing parallel processing with ROS. The chapter starts with a guide on how to install the complete version of ROS on the Nvidia development boards Tegra K1, Tegra X1 and Tegra X2. The tutorial includes a guide on how to update the development boards with the latest OS, and configuring CUDA, ROS and OpenCV4Tegra so that they are ready to perform the sample packages included in this chapter. The chapter follows with a description on how to install CUDA in a computer with Ubuntu operating system. After that, the integration between ROS and CUDA is covered, with many examples on how to create packages and perform parallel processing over several of the most used ROS message types. The codes and examples presented on this chapter are available in GitHub and can be found under the repository in https://github.com/air-lasca/ros-cuda.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

ROS 2 for RoboCup

Threaded Applications with the roscpp API

The Khepera IV Mobile Robot: Performance Evaluation, Sensory Data and Software Toolbox

References

C.J. Thompson, S. Hahn, M. Oskin, Using modern graphics architectures for general-purpose computing: a framework and analysis, in Proceedings 35th Annual IEEE/ACM International Symposium on Micro Architecture, (MICRO-35) (IEEE, New York, 2002), pp. 306–317
Google Scholar
I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, P. Hanrahan, Brook for GPUs: stream computing on graphics hardware. ACM Trans. Graph. (TOG) 23(3), 777–786 (2004). ACM
Article Google Scholar
N.K. Govindaraju, B. Lloyd, W. Wang, M. Lin, D. Manocha, Fast computation of database operations using graphics processors, in Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data (ACM, 2004), pp. 215–226
Google Scholar
Z. Fan, F. Qiu, A. Kaufman, S. Yoakum-Stover, Gpu cluster for high performance computing, in Proceedings of the ACM/IEEE SC2004 Conference on Supercomputing (IEEE, New York, 2004), pp. 47–47
Google Scholar
A. Barak, T. Ben-Nun, E. Levy, A. Shiloh, A package for opencl based heterogeneous computing on clusters with many gpu devices, in 2010 IEEE International Conference on Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS) (IEEE, New York 2010), pp. 1–7
Google Scholar
Nvidia, Compute unified device architecture programming guide, 2007
Google Scholar
V.W. Lee, C. Kim, J. Chhugani, M. Deisher, D. Kim, A.D. Nguyen, N. Satish, M. Smelyanskiy, S. Chennupaty, P. Hammarlund et al., Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU. ACM SIGARCH Comput. Archit. News 38(3), 451–460 (2010)
Article Google Scholar
P. Micikevicius, 3d finite difference computation on GPUS using CUDA, in Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units (ACM, 2009), pp. 79–84
Google Scholar
T. Preis, P. Virnau, W. Paul, J.J. Schneider, Gpu accelerated monte carlo simulation of the 2d and 3d ising model. J. Comput. Phys. 228(12), 4468–4477 (2009)
Article Google Scholar
D. Qiu, S. May, A. Nüchter, GPU-accelerated nearest neighbor search for 3d registration, in International Conference on Computer Vision Systems (Springer, Berlin, 2009), pp. 194–203
Google Scholar
R. Ugolotti, G. Micconi, J. Aleotti, S.Cagnoni, GPU-based point cloud recognition using evolutionary algorithms, in European Conference on the Applications of Evolutionary Computation (Springer, Berlin, 2014), pp. 489–500
Google Scholar
L.M.F. Christino, Aceleração por gpu de serviços em sistemas robóticos focado no processamento de tempo real de nuvem de pontos 3d, Ph.D. dissertation, Universidade de São Paulo
Google Scholar
K.B. Kaldestad, G. Hovland, D.A. Anisi, 3d sensor-based obstacle detection comparing octrees and point clouds using CUDA. Model. Identif. Control 33(4), 123 (2012)
Article Google Scholar
M. Liu, F. Pomerleau, F. Colas, R. Siegwart, Normal estimation for pointcloud using GPU based sparse tensor voting, in 2012 IEEE International Conference on Robotics and Biomimetics (ROBIO) (IEEE, New York, 2012), pp. 91–96
Google Scholar
R.B. Rusu, S. Cousins, 3D is here: Point cloud library (PCL), in 2011 IEEE International Conference on Robotics and Automation (ICRA) (IEEE, New York, 2011), pp. 1–4
Google Scholar
P. Michel, J. Chestnutt, S. Kagami, K. Nishiwaki, J. Kuffner, T. Kanade, GPU-accelerated real-time 3D tracking for humanoid locomotion and stair climbing, in IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2007 (IEEE, New York, 2007), pp. 463–469
Google Scholar
P. Henry, M. Krainin, E. Herbst, X. Ren, D. Fox, RGB-D mapping: using kinect-style depth cameras for dense 3D modeling of indoor environments. Int. J. Robot. Res. 31(5), 647–663 (2012)
Article Google Scholar
P. Merrell, A. Akbarzadeh, L. Wang, P. Mordohai, J.-M. Frahm, R. Yang, D. Nistér, M. Pollefeys, Real-time visibility-based fusion of depth maps, in IEEE 11th International Conference on Computer Vision, ICCV 2007 (IEEE, New York, 2007), pp. 1–8
Google Scholar
C. Choi, H.I. Christensen, RGB-D object tracking: A particle filter approach on GPU, in 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE, New York, 2013), pp. 1084–1091
Google Scholar
P.J.S. Leite, J.M.X.N. Teixeira, T.S.M.C. de Farias, V. Teichrieb, J. Kelner, Massively parallel nearest neighbor queries for dynamic point clouds on the GPU, in 21st International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD’09 (IEEE, New York, 2009), pp. 19–25
Google Scholar
JetsonHacks, Jetsonhacks - developing for Nvidia jetson, [Online] (2017), http://www.jetsonhacks.com/
W. Lucetti, Ros hacking for opencv on Nvidia jetson tx1 & jetson tk1, [Online], (2016), http://myzharbot.robot-home.it/blog/software/ros-nvidia-jetson-tx1-jetson-tk1-opencv-ultimate-guide/
C. Zeller, Cuda c/c$++$ basics, Nvidia Corporation, Supercomputing Tutorial, 2011, pp. 9–11
Google Scholar
SICK, Lms200 technical description, [online][retrieved sep. 11, 2014], 2003
Google Scholar
E. Rohmer, S.P. Singh, M. Freese, V-rep: a versatile and scalable robot simulation framework, in 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE, New York, 2013), pp. 1321–1326
Google Scholar

Download references

Acknowledgements

The projects of this chapter were partially funded by National Counsel of Technological and Scientific Development of Brazil (CNPq), by Coordination for the Improvement of Higher Level People (CAPES) and by National Agency of Petroleum, Natural Gas and Biofuels (ANP) together with the Financier of Studies and Projects (FINEP) and Brazilian Ministry of Science and Technology (MCT) through the ANP Human Resources Program for the Petroleum and Gas Sector - PRH-ANP/MCT PRH10-UTFPR. We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Tegra X1 and Tegra K1 development boards used for this chapter.

Author information

Authors and Affiliations

Federal University of Technology - Parana, Av. Sete de Setembro, Curitiba, 3165, Brazil
Nicolas Dalmedico, Marco Antônio Simões Teixeira, Higor Santos Barbosa, André Schneider de Oliveira, Lucia Valeria Ramos de Arruda & Flavio Neves Jr

Authors

Nicolas Dalmedico
View author publications
You can also search for this author in PubMed Google Scholar
Marco Antônio Simões Teixeira
View author publications
You can also search for this author in PubMed Google Scholar
Higor Santos Barbosa
View author publications
You can also search for this author in PubMed Google Scholar
André Schneider de Oliveira
View author publications
You can also search for this author in PubMed Google Scholar
Lucia Valeria Ramos de Arruda
View author publications
You can also search for this author in PubMed Google Scholar
Flavio Neves Jr
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nicolas Dalmedico .

Editor information

Editors and Affiliations

College of Computer Science and Information Systems, Prince Sultan University, Riyadh, Saudi Arabia
Anis Koubaa

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Dalmedico, N., Simões Teixeira, M.A., Barbosa, H.S., de Oliveira, A.S., Ramos de Arruda, L.V., Neves Jr, F. (2019). GPU and ROS the Use of General Parallel Processing Architecture for Robot Perception. In: Koubaa, A. (eds) Robot Operating System (ROS). Studies in Computational Intelligence, vol 778. Springer, Cham. https://doi.org/10.1007/978-3-319-91590-6_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-91590-6_12
Published: 06 July 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91589-0
Online ISBN: 978-3-319-91590-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

GPU and ROS the Use of General Parallel Processing Architecture for Robot Perception

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

ROS 2 for RoboCup

Threaded Applications with the roscpp API

The Khepera IV Mobile Robot: Performance Evaluation, Sensory Data and Software Toolbox

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

GPU and ROS the Use of General Parallel Processing Architecture for Robot Perception

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

ROS 2 for RoboCup

Threaded Applications with the roscpp API

The Khepera IV Mobile Robot: Performance Evaluation, Sensory Data and Software Toolbox

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation