ABSTRACT
Security applications related to document authentication require an exact match between an authentic copy and the original of a document. This implies that the documents analysis algorithms that are used to compare two documents (original and copy) should provide the same output. This kind of algorithm includes the computation of layout descriptors from the segmentation result, as the layout of a document is a part of its semantic content. To this end, this paper presents a new layout descriptor that significantly improves the state of the art. The basic of this descriptor is the use of a Delaunay triangulation of the centroids of the document regions. This triangulation is seen as a graph and the adjacency matrix of the graph forms the descriptor. While most layout descriptors have a stability of 0% with regard to an exact match, our descriptor has a stability of 74% which can be brought up to 100% with the use of an appropriate matching algorithm. It also achieves 100% accuracy and retrieval in a document retrieval scheme on a database of 960 document images. Furthermore, this descriptor is extremely efficient as it performs a search in constant time with respect to the size of the document database and it reduces the size of the index of the database by a factor 400.
- F. Álvaro. A shape-based layout descriptor for classifying spatial relationships in handwritten math. In Proc. of the 2013 symposium on Document engineering, pages 123--126. ACM, 2013. Google ScholarDigital Library
- A. Antonacopoulos, D. Bridson, C. Papadopoulos, and S. Pletschacher. A realistic dataset for performance evaluation of document layout analysis. In Proc. of 10th International Conference on Document Analysis and Recognition (ICDAR), pages 296--300. IEEE, 2009. Google ScholarDigital Library
- ANTS. Spécifications techniques des Codes à Barres 2D-Doc. Technical report, ANTS, 2013.Google Scholar
- A. D. Bagdanov and M. Worring. First order Gaussian graphs for efficient structure classification. Pattern Recognition, 36:1311--1324, 2003.Google ScholarCross Ref
- J. Bryson and P. Gallagher. Secure Hash Standard (SHS), 2012.Google Scholar
- F. Cesarini, M. Lastri, S. Marinai, and G. Soda. Encoding of modified X-Y trees for document classification. Proc. of 6th International Conference on Document Analysis and Recognition (ICDAR), 2001. Google ScholarDigital Library
- B. B. Chaudhuri. Digital document processing. major directions and recent advances. Springer, 2007. Google ScholarDigital Library
- K. Chen, F. Yin, and C.-l. Liu. Hybrid page segmentation with efficient whitespace rectangles extraction and grouping. In Proc. of 12th International Conference on Document Analysis and Recognition (ICDAR), pages 958--962. IEEE, Aug. 2013. Google ScholarDigital Library
- B. Coüasnon. DMOS, a generic document recognition method: application to table structure analysis in a general and in a specific way. In International Journal on Document Analysis and Recognition (IJDAR), volume 8, pages 111--122. Springer-Verlag, 2006.Google Scholar
- Y. Deng and B. S. Manjunath. Unsupervised segmentation of color-texture regions in images and video. Pattern Analysis and Machine Intelligence (PAMI), 23(8):800--810, 2001. Google ScholarDigital Library
- F. Esposito, D. Malerba, and G. Semeraro. Multistrategy learning for document recognition. Applied Artificial Intelligence an International Journal, 8(1):33--84, 1994.Google Scholar
- A. Gordo and E. Valveny. A rotation invariant page layout descriptor for document classification and retrieval. In Proc. of the 10th International Conference on Document Analysis and Recognition (ICDAR), pages 481--485. IEEE, 2009. Google ScholarDigital Library
- T. Kanungo, R. M. Haralick, and I. Phillips. Global and local document degradation models. In Proc. of 2nd International Conference on Document Analysis and Recognition (ICDAR), pages 730--734. IEEE, 1993.Google ScholarCross Ref
- E. Kasutani and A. Yamada. The MPEG-7 color layout descriptor: a compact image feature description for high-speed image/video segment retrieval. In Proc. of 2001 International Conference on Image Processing (ICIP), volume 1. IEEE, 2001.Google ScholarCross Ref
- K. Kise, A. Sato, and M. Iwata. Segmentation of page images using the area voronoi diagram. Computer Vision and Image Understanding, 70(3):370--382, June 1998. Google ScholarDigital Library
- G. Leach and G. Leach. Improving worst-case optimal Delaunay triangulation algorithms. In Proc. of 4th Canadian Conference on Computational Geometry, pages 340--346, 1992.Google Scholar
- J. L. J. Liang, D. Doermann, M. Ma, and J. Guo. Page classification through logical labelling. In Proc. of 16th International Conference on Pattern Recognition (ICPR), volume 3, pages 477--480. IEEE, 2002. Google ScholarDigital Library
- A. Malvido Garcià. Secure Imprint Generated for Paper Documents (SIGNED). Technical Report December 2010, Bit Oceans, 2013.Google Scholar
- T. Nakai, K. Kise, and M. Iwamura. Use of affine invariants in locally likely arrangement hashing for camera-based document image retrieval. Lecture Notes in Computer Science (LNCS), 3872:541--552, 2006. Google ScholarDigital Library
- R. Rivest. The MD5 message-digest algorithm. Technical report, Internet activities board, 1992. Google ScholarDigital Library
Index Terms
- The Delaunay Document Layout Descriptor
Recommendations
A Rotation Invariant Page Layout Descriptor for Document Classification and Retrieval
ICDAR '09: Proceedings of the 2009 10th International Conference on Document Analysis and RecognitionDocument classification usually requieres of structural features such as the physical layout to obtain good accuracy rates on complex documents. This paper introduces a descriptor of the layout and a distance measure based on the cyclic Dynamic Time ...
Uyghur Printed Document Image Retrieval Based on SIFT Features
Image retrieval is an attractive topic in the field of information retrieval in electronic library and computer vision. This paper proposed a research in the field of Uyghur document image retrieval that using 128-dimensional SIFT features for Uyghur ...
Document Layout Analysis Based on Emergent Computation
ICDAR '97: Proceedings of the 4th International Conference on Document Analysis and RecognitionA new method of document layout analysis is proposed for a document reader to be used for reading a wide variety of documents. Emergent computation, which is a key concept of artificial life, is adopted to analyze various complex document structures. ...
Comments