skip to main content
10.1145/2682571.2797059acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
research-article

The Delaunay Document Layout Descriptor

Published:08 September 2015Publication History

ABSTRACT

Security applications related to document authentication require an exact match between an authentic copy and the original of a document. This implies that the documents analysis algorithms that are used to compare two documents (original and copy) should provide the same output. This kind of algorithm includes the computation of layout descriptors from the segmentation result, as the layout of a document is a part of its semantic content. To this end, this paper presents a new layout descriptor that significantly improves the state of the art. The basic of this descriptor is the use of a Delaunay triangulation of the centroids of the document regions. This triangulation is seen as a graph and the adjacency matrix of the graph forms the descriptor. While most layout descriptors have a stability of 0% with regard to an exact match, our descriptor has a stability of 74% which can be brought up to 100% with the use of an appropriate matching algorithm. It also achieves 100% accuracy and retrieval in a document retrieval scheme on a database of 960 document images. Furthermore, this descriptor is extremely efficient as it performs a search in constant time with respect to the size of the document database and it reduces the size of the index of the database by a factor 400.

References

  1. F. Álvaro. A shape-based layout descriptor for classifying spatial relationships in handwritten math. In Proc. of the 2013 symposium on Document engineering, pages 123--126. ACM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Antonacopoulos, D. Bridson, C. Papadopoulos, and S. Pletschacher. A realistic dataset for performance evaluation of document layout analysis. In Proc. of 10th International Conference on Document Analysis and Recognition (ICDAR), pages 296--300. IEEE, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. ANTS. Spécifications techniques des Codes à Barres 2D-Doc. Technical report, ANTS, 2013.Google ScholarGoogle Scholar
  4. A. D. Bagdanov and M. Worring. First order Gaussian graphs for efficient structure classification. Pattern Recognition, 36:1311--1324, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  5. J. Bryson and P. Gallagher. Secure Hash Standard (SHS), 2012.Google ScholarGoogle Scholar
  6. F. Cesarini, M. Lastri, S. Marinai, and G. Soda. Encoding of modified X-Y trees for document classification. Proc. of 6th International Conference on Document Analysis and Recognition (ICDAR), 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. B. B. Chaudhuri. Digital document processing. major directions and recent advances. Springer, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. K. Chen, F. Yin, and C.-l. Liu. Hybrid page segmentation with efficient whitespace rectangles extraction and grouping. In Proc. of 12th International Conference on Document Analysis and Recognition (ICDAR), pages 958--962. IEEE, Aug. 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. B. Coüasnon. DMOS, a generic document recognition method: application to table structure analysis in a general and in a specific way. In International Journal on Document Analysis and Recognition (IJDAR), volume 8, pages 111--122. Springer-Verlag, 2006.Google ScholarGoogle Scholar
  10. Y. Deng and B. S. Manjunath. Unsupervised segmentation of color-texture regions in images and video. Pattern Analysis and Machine Intelligence (PAMI), 23(8):800--810, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. F. Esposito, D. Malerba, and G. Semeraro. Multistrategy learning for document recognition. Applied Artificial Intelligence an International Journal, 8(1):33--84, 1994.Google ScholarGoogle Scholar
  12. A. Gordo and E. Valveny. A rotation invariant page layout descriptor for document classification and retrieval. In Proc. of the 10th International Conference on Document Analysis and Recognition (ICDAR), pages 481--485. IEEE, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. T. Kanungo, R. M. Haralick, and I. Phillips. Global and local document degradation models. In Proc. of 2nd International Conference on Document Analysis and Recognition (ICDAR), pages 730--734. IEEE, 1993.Google ScholarGoogle ScholarCross RefCross Ref
  14. E. Kasutani and A. Yamada. The MPEG-7 color layout descriptor: a compact image feature description for high-speed image/video segment retrieval. In Proc. of 2001 International Conference on Image Processing (ICIP), volume 1. IEEE, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  15. K. Kise, A. Sato, and M. Iwata. Segmentation of page images using the area voronoi diagram. Computer Vision and Image Understanding, 70(3):370--382, June 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. G. Leach and G. Leach. Improving worst-case optimal Delaunay triangulation algorithms. In Proc. of 4th Canadian Conference on Computational Geometry, pages 340--346, 1992.Google ScholarGoogle Scholar
  17. J. L. J. Liang, D. Doermann, M. Ma, and J. Guo. Page classification through logical labelling. In Proc. of 16th International Conference on Pattern Recognition (ICPR), volume 3, pages 477--480. IEEE, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Malvido Garcià. Secure Imprint Generated for Paper Documents (SIGNED). Technical Report December 2010, Bit Oceans, 2013.Google ScholarGoogle Scholar
  19. T. Nakai, K. Kise, and M. Iwamura. Use of affine invariants in locally likely arrangement hashing for camera-based document image retrieval. Lecture Notes in Computer Science (LNCS), 3872:541--552, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. R. Rivest. The MD5 message-digest algorithm. Technical report, Internet activities board, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. The Delaunay Document Layout Descriptor

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      DocEng '15: Proceedings of the 2015 ACM Symposium on Document Engineering
      September 2015
      248 pages
      ISBN:9781450333078
      DOI:10.1145/2682571

      Copyright © 2015 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 8 September 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      DocEng '15 Paper Acceptance Rate11of31submissions,35%Overall Acceptance Rate178of537submissions,33%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader