Skip to main content
Log in

Junction-based table detection in camera-captured document images

  • Original Paper
  • Published:
International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Abstract

In this paper, we present a method that locates tables and their cells in camera-captured document images. In order to deal with this problem in the presence of geometric and photometric distortions, we develop new junction detection and labeling methods, where junction detection means to find candidates for the corners of cells, and junction labeling is to infer their connectivity. We consider junctions as the intersections of curves, and so we first develop a multiple curve detection algorithm. After the junction detection, we encode the connectivity information (including false detection) between the junctions into 12 labels, and design a cost function reflecting pairwise relationships as well as local observations. The cost function is minimized via the belief propagation algorithm, and we can locate tables and their cells from the inferred labels. Also, in order to handle multiple tables in a single page, we propose a table area detection method. Our method is based on the well-known recursive X-Y cut, however, we modify the method so that we can also deal with curved seams caused by the geometric distortions. For the evaluation of our method, we build a data set that includes a variety of camera-captured table images and make the set publicly available. Experimental results on the set show that our method successfully locates tables and their cells in camera-captured images.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

  1. Arias, J.F., Kasturi, R.: Efficient extraction of primitives from line drawings composed of horizontal and vertical lines. Mach. Vis. Appl. 10(4), 214–221 (1997)

    Article  Google Scholar 

  2. Avidan, S., Shamir, A.: Seam carving for content-aware image resizing. ACM Trans. Graph. 26(3):10 (2007)

  3. Brown, M.S., Seales, W.B.: Image restoration of arbitrarily warped documents. IEEE Trans. Pattern Anal. Mach. Intell. 26(10), 1295–1306 (2004)

    Article  Google Scholar 

  4. Cao, H., Ding, X., Liu, C.: A cylindrical surface model to rectify the bound document image. In: IEEE International Conference on Computer Vision, pp. 228–233 (2003)

  5. Cesarini, F., Marinai, S., Sarti, L., Soda, G.: Trainable table location in document images. In: International Conference on Pattern Recognition, pp. 236–240 (2002)

  6. Gatos, B., Danatsas, D., Pratikakis, I., Perantonis, S.J.: Automatic table detection in document images. In: Pattern Recognition and Data Mining, pp. 609–618 (2005)

  7. Ha, J., Haralick, R., Phillips, I.: Recursive x-y cut using bounding boxes of connected components. In: International Conference on Document Analysis and Recognition, pp. 952–955 (1995)

  8. Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge (2000)

    MATH  Google Scholar 

  9. Hu, J., Kashi, R.S., Lopresti, D.P., Wilfong, G.: Medium-independent table detection. SPIE Doc. Recognit. Retr. VII 3967, 291–302 (2000)

    Google Scholar 

  10. Kieninger, T., Dengel, A.: Applying the t-recs table recognition system to the business letter domain. In: IEEE International Conference on Document Analysis and Recognition, pp. 518–522 (2001)

  11. Kieninger, T.G.: Table structure recognition based on robust block segmentation. In: Photonics West’98 Electronic Imaging, pp. 22–32 (1998)

  12. Koo, H.I.: Segmentation and rectification of pictures in the camera-captured images of printed documents. IEEE Trans. Multimed. 15(3), 647–660 (2013)

    Article  MathSciNet  Google Scholar 

  13. Koo, H.I., Cho, N.I.: State estimation in a document image and its application in text block identification and text line extraction. In: European Conference on Computer Vision, pp. 421–434 (2010)

  14. Koo, H.I., Kim, J., Cho, N.I.: Composition of a dewarped and enhanced document image from two view images. IEEE Trans. Image Process. 18(7), 1551–1562 (2009)

    Article  MathSciNet  Google Scholar 

  15. Liang, J., DeMenthon, D., Doermann, D.: Geometric rectification of camera-captured document images. IEEE Trans. Pattern Anal. Mach. Intell. 30(4), 591–605 (2008)

    Article  Google Scholar 

  16. Mandal, S., Chowdhury, S., Das, A.K., Chanda, B.: A simple and effective table detection system from document images. Int. J. Doc. Anal. Recognit. 8(2–3), 172–182 (2006)

  17. Matas, J., Galambos, C., Kittler, J.: Robust detection of lines using the progressive probabilistic hough transform. Comput. Vis. Image Underst. 78(1), 119–137 (2000)

    Article  Google Scholar 

  18. Nagy, G., Seth, S.: Hierarchical representation of optically scanned documents. In: International Conference on Pattern Recognition, pp. 347–349 (1984)

  19. Pereira Neves, L., Facon, J.: Methodology of automatic extraction of table-form cells. In: XIII Brazilian Symposium on Computer Graphics and Image Processing, pp. 15–21 (2000)

  20. Pilu, M., Pollard, S.: A light-weight text image processing method for handheld embedded cameras. In: British Machine Vision Conference, pp. 547–556 (2002)

  21. Shafait, F.: Document image dewarping contest. In: 2nd International Workshop on Camera-Based Document Analysis and Recognition, pp. 181–188 (2007)

  22. Shafait, F., Smith, R.: Table detection in heterogeneous documents. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, pp. 65–72 (2010)

  23. Stamatopoulos, N., Gatos, B., Pratikakis, I., Perantonis, S.: Goal-oriented rectification of camera-based document images. IEEE Trans. Image Process. 20(4), 910–920 (2011)

    Article  MathSciNet  Google Scholar 

  24. Szeliski, R., Zabih, R., Scharstein, D., Veksler, O., Kolmogorov, V., Agarwala, A., Tappen, M., Rother, C.: A comparative study of energy minimization methods for markov random fields. In: European Conference on Computer Vision, pp. 16–29 (2006)

  25. Tappen, M.F., Freeman, W.T.: Comparison of graph cuts with belief propagation for stereo, using identical mrf parameters. In: IEEE International Conference on Computer Vision, pp. 900–906 (2003)

  26. Taylor, S.L., Fritzson, R., Pastor, J.A.: Extraction of data from preprinted forms. Mach. Vis. Appl. 5(3), 211–222 (1992)

    Article  Google Scholar 

  27. Tian, Y., Narasimhan, S.: Rectification and 3d reconstruction of curved document images. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 377–384 (2011)

  28. Yamashita, A., Kawarago, A., Kaneko, T., Miura, K.: Shape reconstruction and image restoration for non-flat surfaces of documents with a stereo vision system. In: International Conference on Pattern Recognition, pp. 482–485 (2004)

  29. Zanibbi, R., Blostein, D., Cordy, R.: A survey of table recognition. Int. J. Doc. Anal. Recognit. 7(1), 1–16 (2004)

    Article  Google Scholar 

  30. Zhang, Z., Liang, X., Ma, Y.: Unwrapping low-rank textures on generalized cylindrical surfaces. In: IEEE International Conference on Computer Vision, pp. 1347–1354 (2011)

  31. Zheng, Y., Liu, C., Ding, X., Pan, S.: Form frame line detection with directional single-connected chain. In: IEEE International Conference on Document Analysis and Recognition, pp. 699–703 (2001)

  32. Saabni, R., El-Sana, J.: Language-Independent Text Lines Extraction Using Seam Carving. In: IEEE International Conference on Document Analysis and Recognition, pp. 563–568 (2011)

Download references

Acknowledgments

This work was supported by the Ministry of Science, ICT and Future Planning, Korea, through the Information Technology Research Center support Program supervised by the National IT Industry Promotion Agency under Grant NIPA-2014-H0301-14-1019.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hyung Il Koo.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Seo, W., Koo, H.I. & Cho, N.I. Junction-based table detection in camera-captured document images. IJDAR 18, 47–57 (2015). https://doi.org/10.1007/s10032-014-0226-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10032-014-0226-7

Keywords

Navigation