Abstract
In recent decades, Scene Text Extraction and Recognition (STER) has remarkably attracted many researchers from the area of image processing and computer vision. Most of the methods discussed in this paper are subjected to evaluation and validation based on different feature properties such as size, orientation, color information, edges, texture property, connected components, and geometry of scene text. Combining different features like CNN and region-based methods together have gained importance and increase in performance of STER. Critical analysis of several state-of-the-art techniques along with benchmark datasets has been carried out with standard testing procedures and evaluation metrics in order to rationalize the decision of selecting an efficient algorithm for better and fast scene text extraction and recognition. Findings of this paper are discussed exclusively by specifying open area for future research in the field of STER system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
M.J. Roopa, K. Mahantesh, An impact of frequency domain filtering technique on text localization method useful for text reading from scene images, 2019 4th International Conference (ICEECCOT), Mysuru, India, 2019, pp. 37–43
S.V. Seeri, J.D. Pujari, P.S. Hiremath, Text localization and character extraction in natural scene images using contourlet transform and SVM classifier, I.J. Image, Graph. Signal Process., 36–42 (2016)
S.A. Ali, A.T. Hashim, Wavelet transform based technique for text image localization. Karbala Int. J. Mod. Sci., 138–144 (2016)
T. Kumuda, L. Basavaraj, Hybrid approach to extract text in natural scene images. Int. J. Comput. Appl. 6(4), 1614–1618 (2016)
A. Polzounov, A. Ablavatski, S. Escalera, S. Lu, J. Cai, Wordfence: text detection in natural images with border awareness, in IEEE International Conference on Image Processing, 2017, pp. 1222–1226
X. Zhou, C. Yao, H. Wen, Y. Wang, S. Zhou, W. He, J. Liang, EAST: an efficient and accurate scene text detector, in IEEE Conference on Computer Vision and Pattern Recognition, Proceedings—30th IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 2642–2651
P. He, W. Huang, Y. Qiao, C.C. Loy, X. Tang, Reading scene text in deep convolutional sequences, AAAI Conference on Artificial Intelligence, arXiv:1506.04395, 2016
A.K. Bhunia, G. Kumar, P.P. Roy, R. Balasubramanian, U. Pal, Text recognition in scene image and video frame using color channel selection. Multimed. Tools Appl. 77(7), 8551–8578 (2018)
M. Liao, B. Shi, X. Bai, X. Wang, W. Liu, TextBoxes: a fast text Detector with a single deep neural network, 31st AAAI Conference on Artificial Intelligence, 2016, pp. 4161–4167
V.V. Rampurkar, S.K. Shah, G.J. Chhajed, S.K. Biswash, An approach towards text detection from complex images using morphological techniques, in 2nd International Conference on Inventive Systems and Control, 2018, pp. 969–973
T. Im, D. Coelho, K. Mueller, P. De, Smartphone based approximate localization using user highlighted texts from images. Pervasive Mob. Comput. 46, 1–17 (2018)
Z. Zhang, C. Zhang, W. Shen, C. Yao, W. Liu, X. Bai, Multioriented text detection with fully convolutional networks, in Proc. IEEE conf. Computer Vision and Pattern Recognition, 2016, pp. 4159–4167
G.J. Ansari, J.H. Shah, M. Yasmin, M. Sharif, S.L. Fernandes, A novel machine learning approach for scene text extraction. Futur. Gener. Comput. Syst. 87, 328–340 (2018)
B. Shi, X. Wang, P. Lyu, C. Yao, X. Bai, Robust scene text recognition with automatic rectification, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2016, pp. 4168–4176
X. Zhang, X. Gao, C. Tian, Text detection in natural scene images based on color prior guided MSER. Neurocomputing, 61–71 (2018)
L.M. Francis, N. Sreenath, TEDLESS—text detection using least-square SVM from natural scene. J. King Saud University—Computer Inf. Sci., 287–299 (2020)
M. Liao, B. Shi, X. Bai, TextBoxes ++: a single-shot oriented scene text detector, in IEEE Trans. Image Process. 27, 3676–3690 (2018)
M. Busta, L. Neumann, J. Matas, Deep textspotter: an end-to-end trainable scene text localization and recognition framework, in IEEE International Conference on Computer Vision, 2017-October, 2017, pp. 2223–2231
W. He, X.Y. Zhang, F. Yin, C.L. Liu, Multi-oriented and multi-lingual scene text detection with direct regression. IEEE Trans. Image Process. 27(11), 5406–5419 (2018)
B. Shi, X. Bai, S. Belongie, Detecting oriented text in natural images by linking segments, in Proc. IEEE conf. Computer Vision and Pattern Recognition, 2017, pp. 2550–2558
V. Andreas, M. Tomas, N. Lukas, M. Jiri, B. Serge, COCO-text: dataset and benchmark for text detection and recognition in natural images, arXiv:1601.07140, 2016
Z. Tian, W. Huang, T. He, P. He, Y. Qiao, Detecting text in natural image with connectionist text proposal network, in Proc. European Conference on Computer Vision, 2016, pp. 56–72
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Roopa, M.J., Mahantesh, K. (2021). An Evaluation and Validation of Contemporary Approaches in Scene Text Extraction and Recognition. In: Bhateja, V., Satapathy, S.C., Travieso-González, C.M., Aradhya, V.N.M. (eds) Data Engineering and Intelligent Computing. Advances in Intelligent Systems and Computing, vol 1407. Springer, Singapore. https://doi.org/10.1007/978-981-16-0171-2_5
Download citation
DOI: https://doi.org/10.1007/978-981-16-0171-2_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-0170-5
Online ISBN: 978-981-16-0171-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)