A computational model for recognition of multifont word images

Ho, Tin Kam; Hull, Jonathan J.; Srihari, Sargur N.

doi:10.1007/BF02626995

A computational model for recognition of multifont word images

Published: June 1992

Volume 5, pages 157–168, (1992)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Machine Vision and Applications Aims and scope Submit manuscript

A computational model for recognition of multifont word images

Download PDF

Tin Kam Ho Ph.D.¹,
Jonathan J. Hull¹ &
Sargur N. Srihari¹

61 Accesses
17 Citations
6 Altmetric
Explore all metrics

Abstract

A computational model for the recognition of multifont machine-printed word images of highly variable quality is given. The model integrates three word-recognition algorithms, each of which utilizes a different form of shape and context information. The approaches are character-recognition-based, segmentation-based, and word-shape-analysis based. The model overcomes limitations of previous solutions that focus on isolated characters. In an experiment using a lexicon of 33,850 words and a test set of 1,671 highly variable word images, the algorithm achieved a correct rate of 89% at the top choice and 95% in the top ten choices.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Agresti A (1990) Categorical data analysis. Wiley
Aho AV, Kernighan BW, Weinberger PJ (1980) Awk—a pattern scanning and processing language (2nd ed). In: Unix User’s Manual, Supplementary Documents, Regents of the University of California
Baird HS, Graf HP, Jackel LD, Hubbard WE (1989) A VLSI architecture for binary image classification. In: Simon JC (ed) From pixels to features. North-Holland, pp 275–286
Black D (1963) The theory of committees and elections, second edition. Cambridge University Press, London
Google Scholar
Bledsoe WW, Browning I (1959) Pattern recognition and reading by machine. Proceedings of the Eastern Joint Computer Conference 16:225–232
Google Scholar
Casey RG, Nagy G (1982) Recursive segmentation and classification of composite character patterns. Proceedings of the 6th ICPR, Munich, pp 1023–1026
Duda RO, Hart PE (1973) Pattern classification and scene analysis. Addison-Wesley, New York
MATH Google Scholar
Elliman DG, Lancaster IT (1990) A review of segmentation and contextual analysis techniques for text recognition. Pattern Recognition 23(3/4):337–346
Article Google Scholar
Ho TK, Hull JJ, Srihari SN (1990a) Combination of structural classifiers. Pre-Proceedings of the IAPR Syntactic and Structural Pattern Recognition Workshop, New Jersey, June 13–15, pp 123–136
Ho TK, Hull JJ, Srihari SN (1990b) A word shape analysis approach to recognition of degraded word images. Proceedings of the Fourth USPS Advanced Technology Conference, pp 217–231
Ho TK (1992) A theory of multiple classifier systems and its application to visual word recognition. Doctoral Dissertation, SUNY at Buffalo, Department of Computer Science
Hull JJ, Srihari SN (1982) Experiments in text recognition with binary n-gram and Viterbi algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-4(5):520–530
Google Scholar
Hull JJ (1987) Hypothesis testing in a computational theory of visual word recognition. Proceedings of the Sixth National Conference on Artificial Intelligence (AAAI), Seattle, Washington, pp 718–722
Hull JJ (1988) A computational theory of visual word recognition. Doctoral Dissertation, SUNY at Buffalo, Department of Computer Science
Mantas J (1986) An overview of character recognition methodologies. Pattern Recognition 19(6):425–430
Article Google Scholar
McClelland JL, Rumelhart DE (1981) An interactive activation model of context effects in letter perception: Part 1. An account of the basic findings. Psychological Review 88(5):375–407
Article Google Scholar
Mori S, Yamamoto K, Yasuda M (1984) Research on machine recognition of handprinted characters. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-6(4):386–405
Google Scholar
Riseman EM, Ehrich RW (1974) A contextual postprocessing system for error correction using binary n-grams. IEEE Transactions on Computers, C-23(5):480–493
Google Scholar
Rosenbaum WS, Hilliard JJ (1975) Multifont OCR post-processing system. IBM Journal of Research and Development 19:398–421
Article Google Scholar
Rumelhart DE, McClelland JL (1982) An interactive activation model of context effects in letter perception: part 2. The contextual enhancement effect and some tests and extensions of the model. Psychological Review 89(1):60–94
Article Google Scholar
Schuermann J (1978) A multifont word recognition system for postal address reading. IEEE Transactions on Computers, C-27(8):721–732
Google Scholar
Shinghal R, Toussaint GT (1979) Experiments in text recognition with the modified Viterbi algorithm. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-1(2):184–193
Article Google Scholar
Tsuji Y, Asai K (1984) Character image segmentation. SPIE Proceedings on Applications of Digital Image Processing VII, 405:2–9
Google Scholar
Wagner RA, Fischer MJ (1974) The string to string correction problem. Journal of ACM 21(1):168–173
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Center for Document Analysis and Recognition, State University of New York at Buffalo, 226 Bell Hall, 14260, Buffalo, NY, USA
Tin Kam Ho Ph.D., Jonathan J. Hull & Sargur N. Srihari

Authors

Tin Kam Ho Ph.D.
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan J. Hull
View author publications
You can also search for this author in PubMed Google Scholar
Sargur N. Srihari
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ho, T.K., Hull, J.J. & Srihari, S.N. A computational model for recognition of multifont word images. Machine Vis. Apps. 5, 157–168 (1992). https://doi.org/10.1007/BF02626995

Download citation

Issue Date: June 1992
DOI: https://doi.org/10.1007/BF02626995

Key Words

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A computational model for recognition of multifont word images

Abstract

Article PDF

Similar content being viewed by others

Multi-font Telugu Text Recognition Using Hidden Markov Models and Akshara Bi-grams

Lexicon-based probabilistic indexing of handwritten text images

Recognition of Off-line Handwritten Uyghur Words Using Bayesian Networks with Grapheme Nodes

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Key Words

Navigation

A computational model for recognition of multifont word images

Abstract

Article PDF

Similar content being viewed by others

Multi-font Telugu Text Recognition Using Hidden Markov Models and Akshara Bi-grams

Lexicon-based probabilistic indexing of handwritten text images

Recognition of Off-line Handwritten Uyghur Words Using Bayesian Networks with Grapheme Nodes

Explore related subjects

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Key Words

Search

Navigation