Abstract
Detecting texts from natural scene images is currently becoming a popular trend in the field of information retrieval. Researchers find it interesting due to the challenges faced while processing an image. In this paper, a relatively simple but effective approach is proposed where bright texts on a dark background and dark texts on a bright background are detected in natural scene images. This approach is based on the fact that there is usually stark contrast between the background and foreground. Hence, K-means clustering algorithm is applied on the gray levels of the image where bright and dark gray level clusters are generated. Each of these clusters are then analyzed to extract the text components. This method proves to be robust compared to the existing methods, giving reasonably satisfactory results when evaluated on Multi-lingual standard datasets like KAIST and MLe2e, and an in-house dataset of images having Multi-lingual texts written in English, Bangla and Hindi.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Mukhopadhyay, A., Kumar, S., Chowdhury, S.R., Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R.: Multi-Lingual Scene Text Detection Using One-Class Classifier. Int. J. Comput. Vis. Image Proce. (IJCVIP) 9(2), 48–65 (2019)
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide-baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2004)
Chen, H., Tsai, S.S., Schroth, G., Chen, D.M., Grzeszczuk, R., Girod, B.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: 2011 18th IEEE International Conference on Image Processing, pp. 2609–2612. IEEE, Sep 2011
Chakraborty, N., Biswas, S., Mollah, A. F., Basu, S., & Sarkar, R.: Multi-lingual scene text detection by local histogram analysis and selection of optimal area for MSER. In: International Conference on Computational Intelligence, Communications, and Business Analytics, pp. 234–242. Springer, Singapore, July 2018
Panda, S., Ash, S., Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R.: Parameter tuning in MSER for text localization in multi-lingual camera-captured scene text images. In: Das A., Nayak J., Naik B., Pati S., Pelusi D. (eds.), Computational Intelligence in Pattern Recognition. Advances in Intelligent Systems and Computing, vol. 999. Springer, Singapore (2020)
Dutta, I.N., Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R.: Multi-lingual text localization from camera captured images based on foreground homogenity analysis. In: Recent Developments in Machine Learning and Data Analytics, pp. 149–158. Springer, Singapore (2019)
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2963–2970. IEEE, June 2010
Özgen, A.C., Fasounaki, M., Ekenel, H.K.: Text detection in natural and computer-generated images. In: 2018 26th Signal Processing and Communications Applications Conference (SIU), pp. 1–4. IEEE, May 2018
Agrawal, A., Mukherjee, P., Srivastava, S., Lall, B.: Enhanced characterness for text detection in the wild. In: Proceedings of 2nd International Conference on Computer Vision & Image Processing, pp. 359–369. Springer, Singapore (2018)
Gomez, L., Karatzas, D.: A fine-grained approach to scene text script identification. In: 2016 12th IAPR Workshop on Document Analysis Systems (DAS), pp. 192–197. IEEE, Apr 2016
Gomez, L., Nicolaou, A., Karatzas, D.: Improving patch-based scene text script identification with ensembles of conjoined networks. Pattern Recogn. 67(X), 85–96 (2017)
Crete, F., Dolmiere, T., Ladret, P., Nicolas, M.: The blur effect: perception and estimation with a new no-reference perceptual blur metric. In: Human vision and electronic imaging XII, vol. 6492, p. 64920I. International Society for Optics and Photonics, Feb 2007
Fish, D.A., Brinicombe, A.M., Pike, E.R., Walker, J.G.: Blind deconvolution by means of the Richardson-Lucy algorithm. JOSA A 12(1), 58–65 (1995)
Zuiderveld, K.: Contrast limited adaptive histogram equalization. In: Graphics Gems IV, pp. 474–485. Academic Press Professional, Inc, Aug 1994
Acknowledgements
This work is partially supported by the CMATER research laboratory of the Computer Science and Engineering Department, Jadavpur University, India, PURSE-II and UPE-II, project. SB is partially funded by DBT grant (BT/PR16356/BID/7/596/2016). RS, SB and AFM are partially funded by DST grant (EMR/2016/007213).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Chakraborty, N., Mollah, A.F., Basu, S., Sarkar, R. (2021). A “Bright-on-Dark, Dark-on-Bright” Approach to Multi-lingual Scene Text Detection. In: Bhattacharjee, D., Kole, D.K., Dey, N., Basu, S., Plewczynski, D. (eds) Proceedings of International Conference on Frontiers in Computing and Systems. Advances in Intelligent Systems and Computing, vol 1255. Springer, Singapore. https://doi.org/10.1007/978-981-15-7834-2_18
Download citation
DOI: https://doi.org/10.1007/978-981-15-7834-2_18
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-7833-5
Online ISBN: 978-981-15-7834-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)