Abstract
In this paper we present an algorithm for the robust 6D pose estimation with an RGB-D camera in harsh and unstructured environments using object detection. While the pose estimation uses clustering and segmentation to find a robust point in multiple frames to track changes in the position of the camera, its functionality is enhanced with Faster-RCNN for classification and detection, providing the operator with information about the object of interest. This work further facilitates the goal of increasing the robot’s autonomy and helping operators to recover 3D reconstructions of the objects to be manipulated with the robot.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Engel, J., Schps, T., Cremers, D.: LSD-SLAM: Large-scale direct monocular SLAM. In: European Conference on Computer Vision, pp. 834–849, Springer, Cham, Berlin (2014)
Engel, J., Koltun, V., Cremers, D.: Direct sparse odometry. IEEE Trans. Pattern Anal. Mach. Intell. (2017)
Hinterstoisser, S., et al.: Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 858–865 (2011)
Prisacariu, V.A., Reid, I.D.: PWP3D: Real-time segmentation and tracking of 3D objects. Int. J. Comput. Vis. 98(3), 335–354 (2012)
Song, S., Xiao, J.: Sliding shapes for 3d object detection in depth images. In: European Conference on Computer Vision, pp. 634–651, Springer, Cham, Berlin (2014)
Di Castro, M., Vera, J.C., Masi, A. Ferre, M.: Novel pose estimation system for precise robotic manipulation in unstructured environment, pp. 50–55 (2017). https://doi.org/10.5220/0006426700500055
Zeng, A., et al.: Multi-view self-supervised deep learning for 6d pose estimation in the amazon picking challenge. In: 2017 IEEE International Conference on Robotics and Automation (ICRA), pp. 1386–1383, IEEE (2017)
Song, S., Xiao, J.: Deep sliding shapes for amodal 3D object detection in RGB-D images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 808–816 (2016)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: Towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Liu, W., et al.: Ssd: Single shot multibox detector. In: European Conference on Computer Vision, pp. 21–37, Springer, Cham (2016)
Hong, S., You, T., Kwak, S., Han, B.: Online tracking by learning discriminative saliency map with convolutional neural network. In: International Conference on Machine Learning pp. 597–606 (2015)
He, K., Gkioxari, G., Dollr, P., Girshick, R.: Mask r-cnn (2017). arXiv:1703.06870
Cao, Z., Simon, T., Wei, S.E., Sheikh, Y.: Realtime multi-person 2d pose estimation using part affinity fields (2016). arXiv:1611.08050
Zhang, H., et al.: Stackgan: text to photo-realistic image synthesis with stacked generative adversarial networks (2016). arXiv:1612.03242
Lin, T.Y., et al.: Microsoft coco: common objects in context. In: European Conference on Computer Vision, pp. 740–755, Springer, Cham (2014)
Microsoft: Kinect hardware. https://developer.microsoft.com/en-us/windows/kinect/hardware
Intel: Intel RealSense Camera R200 Datasheet. https://software.intel.com/sites/default/files/managed/d7/a9/realsense-camera-r200-product-datasheet.pdf
Intel: Intel RealSense Camera SR300 Datasheet. https://software.intel.com/sites/default/files/managed/0c/ec/realsense-sr300-product-datasheet-rev-1-0.pdf
Orbbec: Orbbec How our 3D camera works. [Online]. https://orbbec3d.com/products/
Orbbec: Orbbec Astra Pro. https://orbbec3d.com/product-astra-pro/
Lunghi, G., Prades, R.M., Di Castro, M.: An advanced, adaptive and Multimodal Graphical User Interface for Human-robot Teleoperation in Radioactive Scenarios. In: ICINCO (2), pp. 224–231 (2016)
Dworak, A., et al.: Middleware trends and market leaders 2011. In: Conference Proceedings, vol. 111010. No. CERN-ATS-2011-196 (2011)
CERN: Controls Middleware. https://wikis.cern.ch/display/MW/CMW+-+Controls+Middleware/
Di Castro, M., et al.: A dual arm robotic platform control for navigation, inspection and telemanipulation. In: Proceedings of ICALEPCS2017, Barcelona, Spain (2017)
Assmann, R.W., et al.: The final collimation system for the LHC. No. LHC-PROJECT-Report-919 (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
di Castro, M., Camarero Vera, J., Ferre, M., Masi, A. (2020). Object Detection and 6D Pose Estimation for Precise Robotic Manipulation in Unstructured Environments. In: Gusikhin, O., Madani, K. (eds) Informatics in Control, Automation and Robotics . ICINCO 2017. Lecture Notes in Electrical Engineering, vol 495. Springer, Cham. https://doi.org/10.1007/978-3-030-11292-9_20
Download citation
DOI: https://doi.org/10.1007/978-3-030-11292-9_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-11291-2
Online ISBN: 978-3-030-11292-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)