research-article

The Delaunay Document Layout Descriptor

Authors:
Sébastien Eskenazi

Université de La Rochelle, La Rochelle, France

Université de La Rochelle, La Rochelle, France
View Profile

,
Petra Gomez-Krämer

Université de La Rochelle, La Rochelle, France

Université de La Rochelle, La Rochelle, France
View Profile

,
Jean-Marc Ogier

Université de La Rochelle, La Rochelle, France

Université de La Rochelle, La Rochelle, France
View Profile

DocEng '15: Proceedings of the 2015 ACM Symposium on Document EngineeringSeptember 2015Pages 167–175https://doi.org/10.1145/2682571.2797059

Published:08 September 2015Publication History

DocEng '15: Proceedings of the 2015 ACM Symposium on Document Engineering

Pages 167–175

ABSTRACT

Security applications related to document authentication require an exact match between an authentic copy and the original of a document. This implies that the documents analysis algorithms that are used to compare two documents (original and copy) should provide the same output. This kind of algorithm includes the computation of layout descriptors from the segmentation result, as the layout of a document is a part of its semantic content. To this end, this paper presents a new layout descriptor that significantly improves the state of the art. The basic of this descriptor is the use of a Delaunay triangulation of the centroids of the document regions. This triangulation is seen as a graph and the adjacency matrix of the graph forms the descriptor. While most layout descriptors have a stability of 0% with regard to an exact match, our descriptor has a stability of 74% which can be brought up to 100% with the use of an appropriate matching algorithm. It also achieves 100% accuracy and retrieval in a document retrieval scheme on a database of 960 document images. Furthermore, this descriptor is extremely efficient as it performs a search in constant time with respect to the size of the document database and it reduces the size of the index of the database by a factor 400.

References

F. Álvaro. A shape-based layout descriptor for classifying spatial relationships in handwritten math. In Proc. of the 2013 symposium on Document engineering, pages 123--126. ACM, 2013. Google ScholarDigital Library
A. Antonacopoulos, D. Bridson, C. Papadopoulos, and S. Pletschacher. A realistic dataset for performance evaluation of document layout analysis. In Proc. of 10th International Conference on Document Analysis and Recognition (ICDAR), pages 296--300. IEEE, 2009. Google ScholarDigital Library
ANTS. Spécifications techniques des Codes à Barres 2D-Doc. Technical report, ANTS, 2013.Google Scholar
A. D. Bagdanov and M. Worring. First order Gaussian graphs for efficient structure classification. Pattern Recognition, 36:1311--1324, 2003.Google ScholarCross Ref
J. Bryson and P. Gallagher. Secure Hash Standard (SHS), 2012.Google Scholar
F. Cesarini, M. Lastri, S. Marinai, and G. Soda. Encoding of modified X-Y trees for document classification. Proc. of 6th International Conference on Document Analysis and Recognition (ICDAR), 2001. Google ScholarDigital Library
B. B. Chaudhuri. Digital document processing. major directions and recent advances. Springer, 2007. Google ScholarDigital Library
K. Chen, F. Yin, and C.-l. Liu. Hybrid page segmentation with efficient whitespace rectangles extraction and grouping. In Proc. of 12th International Conference on Document Analysis and Recognition (ICDAR), pages 958--962. IEEE, Aug. 2013. Google ScholarDigital Library
B. Coüasnon. DMOS, a generic document recognition method: application to table structure analysis in a general and in a specific way. In International Journal on Document Analysis and Recognition (IJDAR), volume 8, pages 111--122. Springer-Verlag, 2006.Google Scholar
Y. Deng and B. S. Manjunath. Unsupervised segmentation of color-texture regions in images and video. Pattern Analysis and Machine Intelligence (PAMI), 23(8):800--810, 2001. Google ScholarDigital Library
F. Esposito, D. Malerba, and G. Semeraro. Multistrategy learning for document recognition. Applied Artificial Intelligence an International Journal, 8(1):33--84, 1994.Google Scholar
A. Gordo and E. Valveny. A rotation invariant page layout descriptor for document classification and retrieval. In Proc. of the 10th International Conference on Document Analysis and Recognition (ICDAR), pages 481--485. IEEE, 2009. Google ScholarDigital Library
T. Kanungo, R. M. Haralick, and I. Phillips. Global and local document degradation models. In Proc. of 2nd International Conference on Document Analysis and Recognition (ICDAR), pages 730--734. IEEE, 1993.Google ScholarCross Ref
E. Kasutani and A. Yamada. The MPEG-7 color layout descriptor: a compact image feature description for high-speed image/video segment retrieval. In Proc. of 2001 International Conference on Image Processing (ICIP), volume 1. IEEE, 2001.Google ScholarCross Ref
K. Kise, A. Sato, and M. Iwata. Segmentation of page images using the area voronoi diagram. Computer Vision and Image Understanding, 70(3):370--382, June 1998. Google ScholarDigital Library
G. Leach and G. Leach. Improving worst-case optimal Delaunay triangulation algorithms. In Proc. of 4th Canadian Conference on Computational Geometry, pages 340--346, 1992.Google Scholar
J. L. J. Liang, D. Doermann, M. Ma, and J. Guo. Page classification through logical labelling. In Proc. of 16th International Conference on Pattern Recognition (ICPR), volume 3, pages 477--480. IEEE, 2002. Google ScholarDigital Library
A. Malvido Garcià. Secure Imprint Generated for Paper Documents (SIGNED). Technical Report December 2010, Bit Oceans, 2013.Google Scholar
T. Nakai, K. Kise, and M. Iwamura. Use of affine invariants in locally likely arrangement hashing for camera-based document image retrieval. Lecture Notes in Computer Science (LNCS), 3872:541--552, 2006. Google ScholarDigital Library
R. Rivest. The MD5 message-digest algorithm. Technical report, Internet activities board, 1992. Google ScholarDigital Library

Index Terms

The Delaunay Document Layout Descriptor
1. Applied computing

Recommendations

A Rotation Invariant Page Layout Descriptor for Document Classification and Retrieval
ICDAR '09: Proceedings of the 2009 10th International Conference on Document Analysis and Recognition

Document classification usually requieres of structural features such as the physical layout to obtain good accuracy rates on complex documents. This paper introduces a descriptor of the layout and a distance measure based on the cyclic Dynamic Time ...
Read More
Uyghur Printed Document Image Retrieval Based on SIFT Features

Image retrieval is an attractive topic in the field of information retrieval in electronic library and computer vision. This paper proposed a research in the field of Uyghur document image retrieval that using 128-dimensional SIFT features for Uyghur ...
Read More
Document Layout Analysis Based on Emergent Computation
ICDAR '97: Proceedings of the 4th International Conference on Document Analysis and Recognition

A new method of document layout analysis is proposed for a document reader to be used for reading a wide variety of documents. Emergent computation, which is a key concept of artificial life, is adopted to analyze various complex document structures. ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
DocEng '15: Proceedings of the 2015 ACM Symposium on Document Engineering
September 2015
248 pages
ISBN:9781450333078
DOI:10.1145/2682571
General Chair:
Christine Vanoirbeek
EPFL, Switzerland
,
Program Chair:
Pierre Genevès
CNRS, France
Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 8 September 2015
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
classification
delaunay
hashing
layout descriptor
retrieval
stability
Qualifiers
- research-article
Conference

Acceptance Rates
DocEng '15 Paper Acceptance Rate11of31submissions,35%Overall Acceptance Rate178of537submissions,33%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 8
  Total Citations
  View Citations
- 79
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

The Delaunay Document Layout Descriptor

DocEng '15: Proceedings of the 2015 ACM Symposium on Document Engineering

ABSTRACT

References

Cited By

Index Terms

Recommendations

A Rotation Invariant Page Layout Descriptor for Document Classification and Retrieval

Uyghur Printed Document Image Retrieval Based on SIFT Features

Document Layout Analysis Based on Emergent Computation