Employing Real-Time Object Detection for Visually Impaired People

Naqvi, Kashish; Hazela, Bramah; Mishra, Sumita; Asthana, Pallavi

doi:10.1007/978-981-15-8335-3_23

Kashish Naqvi⁷,
Bramah Hazela⁸,
Sumita Mishra⁸ &
…
Pallavi Asthana⁸

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 54))

1143 Accesses
10 Citations

Abstract

Visually impaired and blind people face several difficulties in their daily life. This was the primary motivation of this work as to create and assemble an object detector that can assist people with visual impairments using OpenCV and TensorFlow API on Raspberry Pi and provide an audio output for the detected objects using Espeak; Text-to-Speech Synthesizer. Single Shot Detector (SSD) model with MobileNet v2 has been employed to perform the detection with high accuracy and processing speed. The scripts are written in Python which utilizes the model to recognize the objects with boxes and provide class of the objects. The recognized image category is extracted and stored in a text file. The developed system provides aid to a visually impaired person for performing tasks independently using real-time object detection and identification technology. Developed system can successfully provide information about detected object in the form of an audio output to the visually impaired person.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Travel Aid for Visually Impaired: R-Cane

Let the Blind See: An AIIoT-Based Device for Real-Time Object Recognition with the Voice Conversion

Vision Connect: A Smartphone Based Object Detection for Visually Impaired People

References

Abbas Q, Ibrahim MEA, Arfan Jaffar M (2019) A comprehensive review of recent advances on deep vision systems. Artif Intell Rev 52(1):39–76
Google Scholar
Brady E et al (2013) Visual challenges in the everyday lives of blind people. In: Proceedings of the SIGCHI conference on human factors in computing systems
Google Scholar
Huang J, Rathod V, Sun C, Zhu M, Korattikara A, Fathi A, Fischer I, Wojna Z, Song Y, Guadarrama S (2016) Speed/accuracy trade-offs for modern convolutional object detectors. arXiv preprint arXiv:1611.10012
Espeak, Available: http://espeak.sourceforge.net/index.html, Accessed:23Jan2020
Google Text-to-Speech—Apps on Google Play
Google Scholar
SSD_mobilenet Available: https://github.com/tensorflow/models/tree/master/research/object_detection/models
FasterRCNN_inception. Available:https://github.com/tensorflow/models/tree/master/research/object_detection/models
TensorFlow Object Detection. Available: www.tensorflow.org
Zhang Y, Peng H, Hu P (2017) A report on towards real-time detection and camera triggering
Google Scholar
Redmon J, Farhadi A (2016) YOLO9000: better, faster, stronger. arXiv:1612.08242. Available from https://pjreddie.com/darknet/yolov2
Vijaya NM, Kiran G (2017) Automatic surveillance using raspberry pi and arduino. IJESRT
Google Scholar
Kirpan OR, Baviskar PI, Khawase SD, Mankar AS, Ramteke KA (2017) Object detection on raspberry pi. Int J Eng Sci Comput 7(3)
Google Scholar
Rajalakshmi R, Vishnupriya K, Sathyapriya MS, Vishvaardhini GR (2018) Smart navigation system for the visually impaired using Tensorflow. IJARIIE
Google Scholar
Nishajith A, Nivedha J, Nair SS, Mohammed Shaffi J (2018) Smart cap—wearable visual guidance system for blind. In: ICIRCA
Google Scholar
https://www.raspberrypi.org/magpi-issues/Beginners_Guide_v1.pdf
https://www.raspberrypi.org/documentation/hardware/camera/
Ghoury S, Sungur C, Durdu A (2019) RealTime diseases detection of grape and grape leaves using Faster R-CNN and SSD MobileNet architectures. In: ICATCES
Google Scholar
Hui J (2018) SSD object detection: single shot multibox detector for real-time processing. Available:https://medium.com/@jonathan_hui/ssd-object-detection-single-shot-multibox-detector-for-real-time-processing-9bd8deac0e06
Khamparia A, Singh KM (2019) A systematic survey on deep learning architectures and applications. Exp Syst. https://doi.org/10.1111/exsy.12400
Caesar H, Jasper U, Ferrari V (2016) Region-based semantic segmentation with end-to-end training, computer vision–ECCV. Springer International Publishing
Google Scholar
Juras E EdjeElectronics TensorFlow-Object-Detection-API-Tutorial-Train-Multiple-Objects-Windows-10. Available: https://github.com/EdjeElectronics/TensorFlow-Object-Detection-API-Tutorial-Train-Multiple-Objects-Windows-10
Stanford Vision Lab (2016) Stanford University, Princeton University. Available at: www.image-net.org)
Juras E (2020) Edje electronics object detection Github with raspberry pi and TensorFlow API. Available at: https://github.com/EdjeElectronics/TensorFlow-Object-Detection-on-the-Raspberry-Pi
(2020) Speech recognition in python (text to speech). [Online]. Available at: https://pythonprogramminglanguage.com/text-to-speech/
Ren S, He K, Girshick R, Sun J (2016) Faster R-CNN: towards real-time object detection with region proposal networks. arXiv:1506.01497

Download references

Author information

Authors and Affiliations

Department of Computer Science & Engineering, Amity University, Lucknow Campus, UP, India
Kashish Naqvi
Department of Computer Science & Engineering and Electronics & Communication Engineering, Amity University, Lucknow Campus, UP, India
Bramah Hazela, Sumita Mishra & Pallavi Asthana

Authors

Kashish Naqvi
View author publications
You can also search for this author in PubMed Google Scholar
Bramah Hazela
View author publications
You can also search for this author in PubMed Google Scholar
Sumita Mishra
View author publications
You can also search for this author in PubMed Google Scholar
Pallavi Asthana
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bramah Hazela .

Editor information

Editors and Affiliations

Maharaja Agrasen Institute of Technology, New Delhi, India
Ashish Khanna
Maharaja Agrasen Institute of Technology, New Delhi, India
Deepak Gupta
Jan Wyzykowski University, Polkowice, Poland
Zdzisław Pólkowski
CHRIST (Deemed to be University), Bengaluru, India
Siddhartha Bhattacharyya
Tijuana Institute of Technology, Tijuana, Mexico
Oscar Castillo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Naqvi, K., Hazela, B., Mishra, S., Asthana, P. (2021). Employing Real-Time Object Detection for Visually Impaired People. In: Khanna, A., Gupta, D., Pólkowski, Z., Bhattacharyya, S., Castillo, O. (eds) Data Analytics and Management. Lecture Notes on Data Engineering and Communications Technologies, vol 54. Springer, Singapore. https://doi.org/10.1007/978-981-15-8335-3_23

Download citation

DOI: https://doi.org/10.1007/978-981-15-8335-3_23
Published: 05 January 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-8334-6
Online ISBN: 978-981-15-8335-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Employing Real-Time Object Detection for Visually Impaired People

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Travel Aid for Visually Impaired: R-Cane

Let the Blind See: An AIIoT-Based Device for Real-Time Object Recognition with the Voice Conversion

Vision Connect: A Smartphone Based Object Detection for Visually Impaired People

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Employing Real-Time Object Detection for Visually Impaired People

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Travel Aid for Visually Impaired: R-Cane

Let the Blind See: An AIIoT-Based Device for Real-Time Object Recognition with the Voice Conversion

Vision Connect: A Smartphone Based Object Detection for Visually Impaired People

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation