Skip to main content

Analyzing the Content of Business Documents Recognized with a Large Number of Errors Using Modified Levenshtein Distance

  • Chapter
  • First Online:
Cyber-Physical Systems: Intelligent Models and Algorithms

Abstract

The chapter discusses methods for analyzing test material of business documents, lists tasks that use word comparison. The features of the analysis of recognized texts are indicated. A mechanism for identifying recognized words based on textual feature points is described. The advantages and disadvantages of Levenshtein position are listed. Other distances between string objects are described: Jaro-Winkler similarity, multiset metric, MFKC metric. The Levenshtein standard distance is compared to other distances between two string objects. A modification of the Levenshtein position is proposed focused on the features of the recognized characters. Experimental results are presented that demonstrate the effect of using the proposed distance in comparison with the normalized Levenshtein distances. The experiments investigate the extraction of data from the document and the classification of documents. We also compared the time spent on calculating the modified Levenshtein metric and the multiset metric. The proposed method can be applied in a modern CAD system in the recognition component to analyze the information of recognized text documents. Also, the method can be in the system of the analysis of the recognized text using the methods of computational linguistics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kravets, A.G., Salnikova, N.A., Shestopalova, E.L.: Development of a module for predictive modeling of technological development trends. Cyber Phys. Syst. 125–136 (2021). https://doi.org/10.1007/978-3-030-67892-0_11

  2. Sabitov, A., Minnikhanov, R., Dagaeva, M., Katasev, A., Asliamov, T.: Text classification in emergency calls management systems. Cyber Phys. Syst. 199–210 (2021). https://doi.org/10.1007/978-3-030-67892-0_17

  3. Deza, M.M., Deza, E.: Encyclopedia of distances. Springer-Verlag, Berlin (2009)

    Book  Google Scholar 

  4. Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions, and reversals. Rep. USSR Acad. Sci. 163(4), 845–848 (1965)

    Google Scholar 

  5. Sankoff, D., Kruskal, J.: Review: time warps, string edits, and macromolecules: the theory and practice of sequence comparison. J. Log. Comput. 11(2), 356–356 (1983). https://doi.org/10.1093/logcom/11.2.356

    Article  Google Scholar 

  6. Yujian, L., Bo, L.: A normalized Levenshtein distance metric. IEEE Trans. Pattern Anal. Mach. Intell. 29(6), 1091–1095 (2007). https://doi.org/10.1109/TPAMI.2007.1078

    Article  Google Scholar 

  7. Moysset, B., Kermorvant, C., Wolf, C.: Learning to detect, localize and recognize many text objects in document images from few examples. IJDAR 21, 161–175 (2018). https://doi.org/10.1007/s10032-018-0305-2

    Article  Google Scholar 

  8. Nagy, G.: Document analysis systems that improve with use. IJDAR 23, 13–29 (2020). https://doi.org/10.1007/s10032-019-00344-x

    Article  Google Scholar 

  9. Rusiñol, M., Frinken, V., Karatzas, D., Bagdanov, A.D., Lladós, J.: Multimodal page classification inadministrative document image streams. IJDAR 17(4), 331–341 (2014)

    Google Scholar 

  10. Şecker, ŞE., Altun, O., Ayan, U., Mert, C.: A novel string distance function based on most frequent k characters. Int. J. Mach. Learn. Comput. 4(2), 177–183 (2014). https://doi.org/10.7763/IJMLC.2014.V4.408

    Article  Google Scholar 

  11. Petrovsky, A.B.: Metrics in multiset spaces. J. Intell. Fuzzy Syst. 36(4), 3073–3085 (2019). https://doi.org/10.3233/JIFS-18525

    Article  Google Scholar 

  12. Hjouji, A., EL-Mekkaoui, J., Jourhmane, M.: Image classification by mixed finite element method and orthogonal legendre moments. Pattern Recogn. Image Anal. 30, 655–673 (2020). https://doi.org/10.1134/S1054661820040185

  13. Karkishchenko, A.N., Mnukhin, V.B.: On the metric on images invariant with respect to the monotonic brightness transformation. Patt. Recogn. Image Anal. 30, 359–371 (2020). https://doi.org/10.1134/S1054661820030104

    Article  Google Scholar 

  14. Slavin, O.A.: Using special text points in the recognition of documents. In: Studies in Systems, Decision and Control, vol. 259, pp. 43–53. Springer Nature Switzerland AG (2020). https://doi.org/10.1007/978-3-030-32579-4_4

  15. Andreeva, E., Arlazarov, V.V., Slavin, O., Mishev, A.: Comparison of scanned administrative document images. In: Proceedings of SPIE, 2020: Twelfth International Conference on Machine Vision, ICMV 2019, vol. 11433, pp. 16–18. Amsterdam, Netherlands. (2019). https://doi.org/10.1117/12.2559369

  16. Schmid, C., Mohr, R.: Local gray value invariants for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 19(5), 530–535 (1997). https://doi.org/10.1109/34.589215

    Article  Google Scholar 

  17. Awal, A.M., Ghanmi, N., Sicre, R., Furon, T.: Complex document classification and localization application on identity document images. In: Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, pp. 427–432. (2017). https://doi.org/10.1109/ICDAR.2017.77

  18. Chernyshova, Y.S., Sheshkus, A.V., Arlazarov, V.V.: Two-step CNN framework for text line recognition in camera-captured images. IEEE Access. 8, 32587–32600 (2020). https://doi.org/10.1109/ACCESS.2020.2974051I

    Article  Google Scholar 

  19. Limonova, E.E., Neiman-zade, M.I., Arlazarov, V.L.: Special aspects of matrix operation implementations for low-precision neural network model on the elbrus platform. In: Bulletin of the South Ural State University. Ser. Mathematical Modelling, Programming & Computer Software (Bulletin SUSU MMCS), vol. 13(1), 118–128 (2020). https://doi.org/10.14529/mmp200109

  20. El-Kishky, A., Song, Y., Wang, C., Voss, C. R., Han, J.: Scalable topical phrase mining from text corpora. In: Proc. VLDB Endowment, vol. 8(3), pp. 305–316 (2014). https://doi.org/10.14778/2735508.2735519

  21. Liu, J., Shang, J., Wang, C., Ren, X., Han, J.: Mining quality phrases from massive text corpora. In: Proc. of the 2015 ACM SIGMOD International Conference on Management of Data—SIGMOD vol. 45, pp. 1729–1744. ACM, New York, NY, USA. (2015). https://doi.org/10.1145/2723372.2751523

  22. Limonova, E., Skoryukina, N., Neiman-zade, M.: Fast hamming distance computation for 2D art recognition on VLIW-architecture in case of Elbrus platform. In: Proc. SPIE, Eleventh International Conference on Machine Vision, vol. 11041. Art. ID: 110411N. (2018). https://doi.org/10.1117/12.2523101

Download references

Acknowledgements

The research is carried out with partial financial support of The Russian Foundation for Basic Research (Project 20-07-00934).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Oleg Slavin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Slavin, O., Farsobina, V., Myshev, A. (2022). Analyzing the Content of Business Documents Recognized with a Large Number of Errors Using Modified Levenshtein Distance. In: Kravets, A.G., Bolshakov, A.A., Shcherbakov, M. (eds) Cyber-Physical Systems: Intelligent Models and Algorithms. Studies in Systems, Decision and Control, vol 417. Springer, Cham. https://doi.org/10.1007/978-3-030-95116-0_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-95116-0_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-95115-3

  • Online ISBN: 978-3-030-95116-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics