research-article

Content directed enhancement of degraded document images

Authors:
Sangeet Aggarwal

IIT Delhi, India

IIT Delhi, India
View Profile

,
Sanjeev Kumar

IIT Delhi, India

IIT Delhi, India
View Profile

,
Ritu Garg

IIT Delhi, India

IIT Delhi, India
View Profile

,
Santanu Chaudhury

IIT Delhi, India

IIT Delhi, India
View Profile

DAR '12: Proceeding of the workshop on Document Analysis and RecognitionDecember 2012Pages 55–61https://doi.org/10.1145/2432553.2432564

Published:16 December 2012Publication History

DAR '12: Proceeding of the workshop on Document Analysis and Recognition

Pages 55–61

ABSTRACT

Most of the document pre-processing techniques are parameter dependent. In this paper, we present a novel framework that learns optimal parameters, depending on the nature of the document image content for binarization and text/graphics segmentation. The learning problem has been formulated as an optimization problem using EM algorithm to adaptively learn optimal parameters. Experimental results have established the effectiveness of our approach.

References

J. Banerjee, A. M. Namboodiri, and C. V. Jawahar. Contextual restoration of severely degraded document images. In CVPR, pages 517--524. IEEE, 2009.Google ScholarCross Ref
K. C. Fan, C. H. Liu, and Y. K. Wang. Segmentation and classification of mixed text/graphics/image documents. Pattern Recognition Letters, 15(12): 1201--1209, 1994. Google ScholarDigital Library
R. Cao and C. L. Tan. Text/graphics separation in maps. In Fourth International Workshop on Graphics Recognition Algorithms and Applications, pages 167--177, London, UK, UK, 2002. Springer-Verlag. Google ScholarDigital Library
S. Chowdhury, S. Mandal, A. Das, and B. Chanda. Segmentation of text and graphics from document images. In Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02, pages 619--623, Washington, DC, USA, 2007. IEEE Computer Society. Google ScholarDigital Library
L. A. Fletcher and R. Kasturi. A robust algorithm for text string separation from mixed text/graphics images. IEEE Transaction Pattern Analysis Machine Intelligence, 10(6): 910--918, 1988. Google ScholarDigital Library
B. Gatos, I. Pratikakis, and S. J. Perantonis. Adaptive degraded document image binarization. Pattern Recognition, 39: 317--327, 2006. Google ScholarDigital Library
A. K. Jain and S. Bhattacharjee. Texture segmentation using gabor filters for automatic document processing. Machine Vision and Application, 5: 169--184, 1992. Google ScholarDigital Library
N. Journet, V. Eglin, J. Ramel, and R. Mullot. Text/graphic labelling of ancient printed documents. In Proceedings of International Conference on Document Analysis and Recognition, volume 2, pages 1010--1014, August 2005. Google ScholarDigital Library
S. Kumar, R. Gupta, N. Khanna, S. Chaudhury, and S. D. Joshi. Text extraction and document image segmentation using matched wavelets and mrf model. IEEE Transactions of Image Processing, 16: 2117--2128, August 2007. Google ScholarDigital Library
W. Niblack. An Introduction to Digital Image Processing. Strandberg Publishing Company, 1985. Google ScholarDigital Library
N. Otsu. A threshold selection method from gray-level histograms. IEEE Transactions on Systems, Man and Cybernetics, 9: 62--66, 1979.Google ScholarCross Ref
P. P. Roy, J. Llados, and U. Pal. Text/graphics separation in color maps. In Proceedings of the International Conference on Computing: Theory and Applications, pages 545--551, Washington, DC, USA, 2007. IEEE Computer Society. Google ScholarDigital Library
J. Sauvola and M. Pietikainen. Adaptive document image binarization. Pattern Recognition, 33: 225--236, 2000.Google ScholarCross Ref
G. Sharma, R. Garg, and S. Chaudhury. Curvature feature distribution based classification of indian scripts from document images. In Proceedings of the International Workshop on Multilingual OCR, pages 3:1--3:6, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
C. L. Tan and P. O. Ng. Text extraction using pyramid. Pattern Recognition, 31: 63--72, 1998.Google ScholarCross Ref
K. Tombre, S. Tabbone, L. Pélissier, B. Lamiroy, and P. Dosch. Text/graphics separation revisited. In Proceedings of the 5th International Workshop on Document Analysis Systems V, pages 200--211, London, UK, UK, 2002. Springer-Verlag. Google ScholarDigital Library
F. M. Wahl, K. Y. Wong, and R. G. Casey. Block segmentation and text extraction in mixed text/image documents. In Computer Graphics and Image Processing, volume 20, pages 375--390, 1982.Google Scholar
H. Yan. Unified formulation of a class of image thresholding techniques. Pattern Recognition, 29: 2025--2032, 1996.Google ScholarCross Ref

Index Terms

Content directed enhancement of degraded document images
1. Applied computing
  1. Document management and text processing
    1. Document capture
      1. Optical character recognition
2. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
  2. Computer graphics
    1. Image manipulation

Recommendations

Broken and degraded document images binarization

Document image binarization refers to the conversion of a document image into a binary image. For broken and severely degraded document images, binarization is a very challenging process. Unlike the traditional methods that separate the foreground from ...
Read More
Binarization of degraded document images based on contrast enhancement

Because of the different types of document degradation such as uneven illumination, image contrast variation, blur caused by humidity, and bleed-through, degraded document image binarization is still an enormous challenge. This paper presents a new ...
Read More
Parameter-free based two-stage method for binarizing degraded document images

Binarization plays an important role in document image processing, especially in degraded documents. For degraded document images, adaptive binarization methods often incorporate local information to determine the binarization threshold for each ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
DAR '12: Proceeding of the workshop on Document Analysis and Recognition
December 2012
162 pages
ISBN:9781450317979
DOI:10.1145/2432553
Program Chairs:
A. G. Ramakrishnan
IISc, Bangalore
,
Sitaram Ramachandrula
HP Labs, Bangalore
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 16 December 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
document binarization
parameter estimation
text/graphic separation
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 102
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Content directed enhancement of degraded document images

DAR '12: Proceeding of the workshop on Document Analysis and Recognition

ABSTRACT

References

Cited By

Index Terms

Recommendations

Broken and degraded document images binarization

Binarization of degraded document images based on contrast enhancement

Parameter-free based two-stage method for binarizing degraded document images