Abstract
Visually impaired and blind people face several difficulties in their daily life. This was the primary motivation of this work as to create and assemble an object detector that can assist people with visual impairments using OpenCV and TensorFlow API on Raspberry Pi and provide an audio output for the detected objects using Espeak; Text-to-Speech Synthesizer. Single Shot Detector (SSD) model with MobileNet v2 has been employed to perform the detection with high accuracy and processing speed. The scripts are written in Python which utilizes the model to recognize the objects with boxes and provide class of the objects. The recognized image category is extracted and stored in a text file. The developed system provides aid to a visually impaired person for performing tasks independently using real-time object detection and identification technology. Developed system can successfully provide information about detected object in the form of an audio output to the visually impaired person.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Abbas Q, Ibrahim MEA, Arfan Jaffar M (2019) A comprehensive review of recent advances on deep vision systems. Artif Intell Rev 52(1):39–76
Brady E et al (2013) Visual challenges in the everyday lives of blind people. In: Proceedings of the SIGCHI conference on human factors in computing systems
Huang J, Rathod V, Sun C, Zhu M, Korattikara A, Fathi A, Fischer I, Wojna Z, Song Y, Guadarrama S (2016) Speed/accuracy trade-offs for modern convolutional object detectors. arXiv preprint arXiv:1611.10012
Espeak, Available: http://espeak.sourceforge.net/index.html, Accessed:23Jan2020
Google Text-to-Speech—Apps on Google Play
SSD_mobilenet Available: https://github.com/tensorflow/models/tree/master/research/object_detection/models
FasterRCNN_inception. Available:https://github.com/tensorflow/models/tree/master/research/object_detection/models
TensorFlow Object Detection. Available: www.tensorflow.org
Zhang Y, Peng H, Hu P (2017) A report on towards real-time detection and camera triggering
Redmon J, Farhadi A (2016) YOLO9000: better, faster, stronger. arXiv:1612.08242. Available from https://pjreddie.com/darknet/yolov2
Vijaya NM, Kiran G (2017) Automatic surveillance using raspberry pi and arduino. IJESRT
Kirpan OR, Baviskar PI, Khawase SD, Mankar AS, Ramteke KA (2017) Object detection on raspberry pi. Int J Eng Sci Comput 7(3)
Rajalakshmi R, Vishnupriya K, Sathyapriya MS, Vishvaardhini GR (2018) Smart navigation system for the visually impaired using Tensorflow. IJARIIE
Nishajith A, Nivedha J, Nair SS, Mohammed Shaffi J (2018) Smart cap—wearable visual guidance system for blind. In: ICIRCA
https://www.raspberrypi.org/magpi-issues/Beginners_Guide_v1.pdf
Ghoury S, Sungur C, Durdu A (2019) RealTime diseases detection of grape and grape leaves using Faster R-CNN and SSD MobileNet architectures. In: ICATCES
Hui J (2018) SSD object detection: single shot multibox detector for real-time processing. Available:https://medium.com/@jonathan_hui/ssd-object-detection-single-shot-multibox-detector-for-real-time-processing-9bd8deac0e06
Khamparia A, Singh KM (2019) A systematic survey on deep learning architectures and applications. Exp Syst. https://doi.org/10.1111/exsy.12400
Caesar H, Jasper U, Ferrari V (2016) Region-based semantic segmentation with end-to-end training, computer vision–ECCV. Springer International Publishing
Juras E EdjeElectronics TensorFlow-Object-Detection-API-Tutorial-Train-Multiple-Objects-Windows-10. Available: https://github.com/EdjeElectronics/TensorFlow-Object-Detection-API-Tutorial-Train-Multiple-Objects-Windows-10
Stanford Vision Lab (2016) Stanford University, Princeton University. Available at: www.image-net.org)
Juras E (2020) Edje electronics object detection Github with raspberry pi and TensorFlow API. Available at: https://github.com/EdjeElectronics/TensorFlow-Object-Detection-on-the-Raspberry-Pi
(2020) Speech recognition in python (text to speech). [Online]. Available at: https://pythonprogramminglanguage.com/text-to-speech/
Ren S, He K, Girshick R, Sun J (2016) Faster R-CNN: towards real-time object detection with region proposal networks. arXiv:1506.01497
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Naqvi, K., Hazela, B., Mishra, S., Asthana, P. (2021). Employing Real-Time Object Detection for Visually Impaired People. In: Khanna, A., Gupta, D., Pólkowski, Z., Bhattacharyya, S., Castillo, O. (eds) Data Analytics and Management. Lecture Notes on Data Engineering and Communications Technologies, vol 54. Springer, Singapore. https://doi.org/10.1007/978-981-15-8335-3_23
Download citation
DOI: https://doi.org/10.1007/978-981-15-8335-3_23
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-8334-6
Online ISBN: 978-981-15-8335-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)