A “Bright-on-Dark, Dark-on-Bright” Approach to Multi-lingual Scene Text Detection

Chakraborty, Neelotpal; Mollah, Ayatullah Faruk; Basu, Subhadip; Sarkar, Ram

doi:10.1007/978-981-15-7834-2_18

Neelotpal Chakraborty¹⁹,
Ayatullah Faruk Mollah²⁰,
Subhadip Basu¹⁹ &
…
Ram Sarkar¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1255))

784 Accesses

Abstract

Detecting texts from natural scene images is currently becoming a popular trend in the field of information retrieval. Researchers find it interesting due to the challenges faced while processing an image. In this paper, a relatively simple but effective approach is proposed where bright texts on a dark background and dark texts on a bright background are detected in natural scene images. This approach is based on the fact that there is usually stark contrast between the background and foreground. Hence, K-means clustering algorithm is applied on the gray levels of the image where bright and dark gray level clusters are generated. Each of these clusters are then analyzed to extract the text components. This method proves to be robust compared to the existing methods, giving reasonably satisfactory results when evaluated on Multi-lingual standard datasets like KAIST and MLe2e, and an in-house dataset of images having Multi-lingual texts written in English, Bangla and Hindi.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Framework for Multi-lingual Scene Text Detection Using K-means++ and Memetic Algorithms

Multi-lingual Text Localization from Camera Captured Images Based on Foreground Homogenity Analysis

Multi-lingual Scene Text Detection by Local Histogram Analysis and Selection of Optimal Area for MSER

References

Mukhopadhyay, A., Kumar, S., Chowdhury, S.R., Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R.: Multi-Lingual Scene Text Detection Using One-Class Classifier. Int. J. Comput. Vis. Image Proce. (IJCVIP) 9(2), 48–65 (2019)
Google Scholar
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide-baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2004)
Article Google Scholar
Chen, H., Tsai, S.S., Schroth, G., Chen, D.M., Grzeszczuk, R., Girod, B.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: 2011 18th IEEE International Conference on Image Processing, pp. 2609–2612. IEEE, Sep 2011
Google Scholar
Chakraborty, N., Biswas, S., Mollah, A. F., Basu, S., & Sarkar, R.: Multi-lingual scene text detection by local histogram analysis and selection of optimal area for MSER. In: International Conference on Computational Intelligence, Communications, and Business Analytics, pp. 234–242. Springer, Singapore, July 2018
Google Scholar
Panda, S., Ash, S., Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R.: Parameter tuning in MSER for text localization in multi-lingual camera-captured scene text images. In: Das A., Nayak J., Naik B., Pati S., Pelusi D. (eds.), Computational Intelligence in Pattern Recognition. Advances in Intelligent Systems and Computing, vol. 999. Springer, Singapore (2020)
Google Scholar
Dutta, I.N., Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R.: Multi-lingual text localization from camera captured images based on foreground homogenity analysis. In: Recent Developments in Machine Learning and Data Analytics, pp. 149–158. Springer, Singapore (2019)
Google Scholar
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2963–2970. IEEE, June 2010
Google Scholar
Özgen, A.C., Fasounaki, M., Ekenel, H.K.: Text detection in natural and computer-generated images. In: 2018 26th Signal Processing and Communications Applications Conference (SIU), pp. 1–4. IEEE, May 2018
Google Scholar
Agrawal, A., Mukherjee, P., Srivastava, S., Lall, B.: Enhanced characterness for text detection in the wild. In: Proceedings of 2nd International Conference on Computer Vision & Image Processing, pp. 359–369. Springer, Singapore (2018)
Google Scholar
Gomez, L., Karatzas, D.: A fine-grained approach to scene text script identification. In: 2016 12th IAPR Workshop on Document Analysis Systems (DAS), pp. 192–197. IEEE, Apr 2016
Google Scholar
Gomez, L., Nicolaou, A., Karatzas, D.: Improving patch-based scene text script identification with ensembles of conjoined networks. Pattern Recogn. 67(X), 85–96 (2017)
Google Scholar
Crete, F., Dolmiere, T., Ladret, P., Nicolas, M.: The blur effect: perception and estimation with a new no-reference perceptual blur metric. In: Human vision and electronic imaging XII, vol. 6492, p. 64920I. International Society for Optics and Photonics, Feb 2007
Google Scholar
Fish, D.A., Brinicombe, A.M., Pike, E.R., Walker, J.G.: Blind deconvolution by means of the Richardson-Lucy algorithm. JOSA A 12(1), 58–65 (1995)
Article Google Scholar
Zuiderveld, K.: Contrast limited adaptive histogram equalization. In: Graphics Gems IV, pp. 474–485. Academic Press Professional, Inc, Aug 1994
Google Scholar

Download references

Acknowledgements

This work is partially supported by the CMATER research laboratory of the Computer Science and Engineering Department, Jadavpur University, India, PURSE-II and UPE-II, project. SB is partially funded by DBT grant (BT/PR16356/BID/7/596/2016). RS, SB and AFM are partially funded by DST grant (EMR/2016/007213).

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700032, India
Neelotpal Chakraborty, Subhadip Basu & Ram Sarkar
Department of Computer Science and Engineering, Aliah University, Kolkata, 700160, India
Ayatullah Faruk Mollah

Authors

Neelotpal Chakraborty
View author publications
You can also search for this author in PubMed Google Scholar
Ayatullah Faruk Mollah
View author publications
You can also search for this author in PubMed Google Scholar
Subhadip Basu
View author publications
You can also search for this author in PubMed Google Scholar
Ram Sarkar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Neelotpal Chakraborty .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Jadavpur University, Kolkata, West Bengal, India
Debotosh Bhattacharjee
Department of Computer Science and Engineering, Jalpaiguri Government Engineering College, Jalpaiguri, West Bengal, India
Dipak Kumar Kole
Department of Information Technology, Techno India College of Technology, Kolkata, West Bengal, India
Nilanjan Dey
Department of Computer Science and Engineering, Jadavpur University, Kolkata, West Bengal, India
Subhadip Basu
Center of New Technologies, University of Warsaw, Warsaw, Poland
Dariusz Plewczynski

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R. (2021). A “Bright-on-Dark, Dark-on-Bright” Approach to Multi-lingual Scene Text Detection. In: Bhattacharjee, D., Kole, D.K., Dey, N., Basu, S., Plewczynski, D. (eds) Proceedings of International Conference on Frontiers in Computing and Systems. Advances in Intelligent Systems and Computing, vol 1255. Springer, Singapore. https://doi.org/10.1007/978-981-15-7834-2_18

Download citation

DOI: https://doi.org/10.1007/978-981-15-7834-2_18
Published: 24 November 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-7833-5
Online ISBN: 978-981-15-7834-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

A “Bright-on-Dark, Dark-on-Bright” Approach to Multi-lingual Scene Text Detection

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Framework for Multi-lingual Scene Text Detection Using K-means++ and Memetic Algorithms

Multi-lingual Text Localization from Camera Captured Images Based on Foreground Homogenity Analysis

Multi-lingual Scene Text Detection by Local Histogram Analysis and Selection of Optimal Area for MSER

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A “Bright-on-Dark, Dark-on-Bright” Approach to Multi-lingual Scene Text Detection

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Framework for Multi-lingual Scene Text Detection Using K-means++ and Memetic Algorithms

Multi-lingual Text Localization from Camera Captured Images Based on Foreground Homogenity Analysis

Multi-lingual Scene Text Detection by Local Histogram Analysis and Selection of Optimal Area for MSER

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation