Abstract
A system architecture is presented for document imaging in an open, distributed environment over networks, where various kinds of imaging devices can be interconnected remotely. The key components are two sets of image processing operations to transform input images to (1) canonical image representations to absorb different visual appearance due to characteristics of imaging devices or image acquisition conditions (normalization), and (2) optimal image representations according to tasks and preferences of individual users (optimization). Images captured through a diversity of input devices can be delivered to remote sites through networks, and then will be used for a variety of tasks such as printing on paper sheets, browsing on displays, and editing. These diversities can be resolved systematically by placing the normalizations at an upper end (routing servers) and the optimizations at a lower end (clients) of the data flow over networks. In view of this architecture, we describe some instances of the normalizations and optimizations associated with a particular task of highly legible printing of scanned document images. Three essential algorithms are mentioned for optimizing document images: adaptive tone mapping with background cleaning, text super-resolution, and text color clustering. The optimization process is mentioned for highly legible printing, along with some other potential applications and tasks.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Breuel, T.M., Janssen, W.C., Popat, K., Baird, H.S.: Paper to PDA. In: Proc.16th Int. Conf. Pattern Recognition, Quebec City, Quebec, Canada, vol. 1, pp. 476–479 (2002)
Hobby, J.D., Ho, T.K.: Enhancing degraded document images via bitmap clustering and averaging. In: Proc. 4th Int. Conf. Document Analysis and Recognition (August 1997)
ISO/IEC 15444-1, Information technology – JPEG 2000 image coding system – Part 1: Core coding system (2000)
ISO/IEC 15444-6, Information technology – JPEG 2000 image coding system – Part 6: Compound image file format (2003)
Jaisimha, M.Y., Riskin, E.A., Ladner, R., Werner, S.: Model-based restoration of document images for OCR. In: Proc. SPIE, vol. 2660, pp. 297–308 (1996)
Koskinen, L., Huttunen, H., Astola, J.T.: Text enhancement method based on soft morphological filters. In: Proc. SPIE, vol. 2181, pp. 243–253 (1994)
Li, H., Kia, O.E., Doermann, D.S.: Text enhancement in digital video. In: Proc. SPIE, vol. 3651, pp. 2–9 (1999)
Matas, J., Kittler, J.: Spatial and feature space clutering: applications in image analysis. In: Proc. Int. Conf. Computer Analysis of Images and Patterns, September 1995, pp. 162–173 (1995)
Nishida, H.: Adaptive inverse halftoning for scanned document images through multiresolution and multiscale analysis. Pattern Recognition 38(2), 251–260 (2005)
Nishida, H.: Restoring high-resolution binary images for text enhancement. In: Proc. Int. Conf. Image Processing, Genova, Italy, September 2005, vol. II, pp. 506–509 (2005)
Ramponi, G., Fontanot, P.: Enhancing document images with a quadratic filter. Signal Processing 33, 23–34 (1993)
Shin, Y.C., Sridhar, R., Demjanenko, V., Palumbo, P.W., Hull, J.J.: Contrast enhancement of mail piece images. In: Proc. SPIE, vol. 1661, pp. 27–37 (1992)
Sobottka, K., Kronenberg, H., Perroud, T., Bunke, H.: Text extraction from colored book and journal covers. Int. J. Document Analysis and Recognition 2, 163–176 (2000)
Thouin, P.D., Chang, C.-I.: A method for restoration of low-resolution document images. Int. J. Document Analysis and Recognition 2, 200–210 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nishida, H. (2006). Networked Document Imaging with Normalization and Optimization. In: Bunke, H., Spitz, A.L. (eds) Document Analysis Systems VII. DAS 2006. Lecture Notes in Computer Science, vol 3872. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11669487_5
Download citation
DOI: https://doi.org/10.1007/11669487_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32140-8
Online ISBN: 978-3-540-32157-6
eBook Packages: Computer ScienceComputer Science (R0)