Skip to main content

A “Bright-on-Dark, Dark-on-Bright” Approach to Multi-lingual Scene Text Detection

  • Conference paper
  • First Online:
Proceedings of International Conference on Frontiers in Computing and Systems

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1255))

  • 784 Accesses

Abstract

Detecting texts from natural scene images is currently becoming a popular trend in the field of information retrieval. Researchers find it interesting due to the challenges faced while processing an image. In this paper, a relatively simple but effective approach is proposed where bright texts on a dark background and dark texts on a bright background are detected in natural scene images. This approach is based on the fact that there is usually stark contrast between the background and foreground. Hence, K-means clustering algorithm is applied on the gray levels of the image where bright and dark gray level clusters are generated. Each of these clusters are then analyzed to extract the text components. This method proves to be robust compared to the existing methods, giving reasonably satisfactory results when evaluated on Multi-lingual standard datasets like KAIST and MLe2e, and an in-house dataset of images having Multi-lingual texts written in English, Bangla and Hindi.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Mukhopadhyay, A., Kumar, S., Chowdhury, S.R., Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R.: Multi-Lingual Scene Text Detection Using One-Class Classifier. Int. J. Comput. Vis. Image Proce. (IJCVIP) 9(2), 48–65 (2019)

    Google Scholar 

  2. Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide-baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2004)

    Article  Google Scholar 

  3. Chen, H., Tsai, S.S., Schroth, G., Chen, D.M., Grzeszczuk, R., Girod, B.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: 2011 18th IEEE International Conference on Image Processing, pp. 2609–2612. IEEE, Sep 2011

    Google Scholar 

  4. Chakraborty, N., Biswas, S., Mollah, A. F., Basu, S., & Sarkar, R.: Multi-lingual scene text detection by local histogram analysis and selection of optimal area for MSER. In: International Conference on Computational Intelligence, Communications, and Business Analytics, pp. 234–242. Springer, Singapore, July 2018

    Google Scholar 

  5. Panda, S., Ash, S., Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R.: Parameter tuning in MSER for text localization in multi-lingual camera-captured scene text images. In: Das A., Nayak J., Naik B., Pati S., Pelusi D. (eds.), Computational Intelligence in Pattern Recognition. Advances in Intelligent Systems and Computing, vol. 999. Springer, Singapore (2020)

    Google Scholar 

  6. Dutta, I.N., Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R.: Multi-lingual text localization from camera captured images based on foreground homogenity analysis. In: Recent Developments in Machine Learning and Data Analytics, pp. 149–158. Springer, Singapore (2019)

    Google Scholar 

  7. Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2963–2970. IEEE, June 2010

    Google Scholar 

  8. Özgen, A.C., Fasounaki, M., Ekenel, H.K.: Text detection in natural and computer-generated images. In: 2018 26th Signal Processing and Communications Applications Conference (SIU), pp. 1–4. IEEE, May 2018

    Google Scholar 

  9. Agrawal, A., Mukherjee, P., Srivastava, S., Lall, B.: Enhanced characterness for text detection in the wild. In: Proceedings of 2nd International Conference on Computer Vision & Image Processing, pp. 359–369. Springer, Singapore (2018)

    Google Scholar 

  10. Gomez, L., Karatzas, D.: A fine-grained approach to scene text script identification. In: 2016 12th IAPR Workshop on Document Analysis Systems (DAS), pp. 192–197. IEEE, Apr 2016

    Google Scholar 

  11. Gomez, L., Nicolaou, A., Karatzas, D.: Improving patch-based scene text script identification with ensembles of conjoined networks. Pattern Recogn. 67(X), 85–96 (2017)

    Google Scholar 

  12. Crete, F., Dolmiere, T., Ladret, P., Nicolas, M.: The blur effect: perception and estimation with a new no-reference perceptual blur metric. In: Human vision and electronic imaging XII, vol. 6492, p. 64920I. International Society for Optics and Photonics, Feb 2007

    Google Scholar 

  13. Fish, D.A., Brinicombe, A.M., Pike, E.R., Walker, J.G.: Blind deconvolution by means of the Richardson-Lucy algorithm. JOSA A 12(1), 58–65 (1995)

    Article  Google Scholar 

  14. Zuiderveld, K.: Contrast limited adaptive histogram equalization. In: Graphics Gems IV, pp. 474–485. Academic Press Professional, Inc, Aug 1994

    Google Scholar 

Download references

Acknowledgements

This work is partially supported by the CMATER research laboratory of the Computer Science and Engineering Department, Jadavpur University, India, PURSE-II and UPE-II, project. SB is partially funded by DBT grant (BT/PR16356/BID/7/596/2016). RS, SB and AFM are partially funded by DST grant (EMR/2016/007213).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Neelotpal Chakraborty .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R. (2021). A “Bright-on-Dark, Dark-on-Bright” Approach to Multi-lingual Scene Text Detection. In: Bhattacharjee, D., Kole, D.K., Dey, N., Basu, S., Plewczynski, D. (eds) Proceedings of International Conference on Frontiers in Computing and Systems. Advances in Intelligent Systems and Computing, vol 1255. Springer, Singapore. https://doi.org/10.1007/978-981-15-7834-2_18

Download citation

Publish with us

Policies and ethics