Skip to main content

Employing Real-Time Object Detection for Visually Impaired People

  • Conference paper
  • First Online:
Data Analytics and Management

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 54))

Abstract

Visually impaired and blind people face several difficulties in their daily life. This was the primary motivation of this work as to create and assemble an object detector that can assist people with visual impairments using OpenCV and TensorFlow API on Raspberry Pi and provide an audio output for the detected objects using Espeak; Text-to-Speech Synthesizer. Single Shot Detector (SSD) model with MobileNet v2 has been employed to perform the detection with high accuracy and processing speed. The scripts are written in Python which utilizes the model to recognize the objects with boxes and provide class of the objects. The recognized image category is extracted and stored in a text file. The developed system provides aid to a visually impaired person for performing tasks independently using real-time object detection and identification technology. Developed system can successfully provide information about detected object in the form of an audio output to the visually impaired person.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Abbas Q, Ibrahim MEA, Arfan Jaffar M (2019) A comprehensive review of recent advances on deep vision systems. Artif Intell Rev 52(1):39–76

    Google Scholar 

  2. Brady E et al (2013) Visual challenges in the everyday lives of blind people. In: Proceedings of the SIGCHI conference on human factors in computing systems

    Google Scholar 

  3. Huang J, Rathod V, Sun C, Zhu M, Korattikara A, Fathi A, Fischer I, Wojna Z, Song Y, Guadarrama S (2016) Speed/accuracy trade-offs for modern convolutional object detectors. arXiv preprint arXiv:1611.10012

  4. Espeak, Available: http://espeak.sourceforge.net/index.html, Accessed:23Jan2020

  5. Google Text-to-Speech—Apps on Google Play

    Google Scholar 

  6. SSD_mobilenet Available: https://github.com/tensorflow/models/tree/master/research/object_detection/models

  7. FasterRCNN_inception. Available:https://github.com/tensorflow/models/tree/master/research/object_detection/models

  8. TensorFlow Object Detection. Available: www.tensorflow.org

  9. Zhang Y, Peng H, Hu P (2017) A report on towards real-time detection and camera triggering

    Google Scholar 

  10. Redmon J, Farhadi A (2016) YOLO9000: better, faster, stronger. arXiv:1612.08242. Available from https://pjreddie.com/darknet/yolov2

  11. Vijaya NM, Kiran G (2017) Automatic surveillance using raspberry pi and arduino. IJESRT

    Google Scholar 

  12. Kirpan OR, Baviskar PI, Khawase SD, Mankar AS, Ramteke KA (2017) Object detection on raspberry pi. Int J Eng Sci Comput 7(3)

    Google Scholar 

  13. Rajalakshmi R, Vishnupriya K, Sathyapriya MS, Vishvaardhini GR (2018) Smart navigation system for the visually impaired using Tensorflow. IJARIIE

    Google Scholar 

  14. Nishajith A, Nivedha J, Nair SS, Mohammed Shaffi J (2018) Smart cap—wearable visual guidance system for blind. In: ICIRCA

    Google Scholar 

  15. https://www.raspberrypi.org/magpi-issues/Beginners_Guide_v1.pdf

  16. https://www.raspberrypi.org/documentation/hardware/camera/

  17. Ghoury S, Sungur C, Durdu A (2019) RealTime diseases detection of grape and grape leaves using Faster R-CNN and SSD MobileNet architectures. In: ICATCES

    Google Scholar 

  18. Hui J (2018) SSD object detection: single shot multibox detector for real-time processing. Available:https://medium.com/@jonathan_hui/ssd-object-detection-single-shot-multibox-detector-for-real-time-processing-9bd8deac0e06

  19. Khamparia A, Singh KM (2019) A systematic survey on deep learning architectures and applications. Exp Syst. https://doi.org/10.1111/exsy.12400

  20. Caesar H, Jasper U, Ferrari V (2016) Region-based semantic segmentation with end-to-end training, computer vision–ECCV. Springer International Publishing

    Google Scholar 

  21. Juras E EdjeElectronics TensorFlow-Object-Detection-API-Tutorial-Train-Multiple-Objects-Windows-10. Available: https://github.com/EdjeElectronics/TensorFlow-Object-Detection-API-Tutorial-Train-Multiple-Objects-Windows-10

  22. Stanford Vision Lab (2016) Stanford University, Princeton University. Available at: www.image-net.org)

  23. Juras E (2020) Edje electronics object detection Github with raspberry pi and TensorFlow API. Available at: https://github.com/EdjeElectronics/TensorFlow-Object-Detection-on-the-Raspberry-Pi

  24. (2020) Speech recognition in python (text to speech). [Online]. Available at: https://pythonprogramminglanguage.com/text-to-speech/

  25. Ren S, He K, Girshick R, Sun J (2016) Faster R-CNN: towards real-time object detection with region proposal networks. arXiv:1506.01497

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bramah Hazela .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Naqvi, K., Hazela, B., Mishra, S., Asthana, P. (2021). Employing Real-Time Object Detection for Visually Impaired People. In: Khanna, A., Gupta, D., Pólkowski, Z., Bhattacharyya, S., Castillo, O. (eds) Data Analytics and Management. Lecture Notes on Data Engineering and Communications Technologies, vol 54. Springer, Singapore. https://doi.org/10.1007/978-981-15-8335-3_23

Download citation

Publish with us

Policies and ethics