Skip to main content

Quantitative Analysis of Deep CNNs for Multilingual Handwritten Digit Recognition

  • Conference paper
  • First Online:
Proceedings of International Conference on Trends in Computational and Cognitive Engineering

Abstract

Indian subcontinent is a birthplace of multilingual people, where documents such as job application form, passport, number plate identification, and so forth are composed of text contents written in different languages or scripts. These scripts consist of different Indic numerals in a single document page. Recently, deep convolutional neural networks (CNN) have achieved favorable result in computer vision problems, especially in recognizing handwritten digits but most of the works focuses on only one language, i.e., English or Hindi or Bangla, etc. However, developing a language-invariant method is very important as we live in a global village now. In this work, we have examined the performance of the ten state-of-the-art deep CNN methods for the recognition of handwritten digits using four most common languages in the Indian sub-continent that creates the foundation of a script invariant handwritten digit recognition system. Among the deep CNNs, Inception-v4 performs the best based on accuracy and computation time. Besides, it discusses the limitations of existing techniques and shows future research directions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Pal, U., Chaudhuri, B.B.: Indian script character recognition: a survey. Pattern Recogn. 37(9), 1887–1899 (2004)

    Article  Google Scholar 

  2. Lopez, B., Nguyen, M.A., Walia, A.: Modified mnist (2019)

    Google Scholar 

  3. Chaudhary, M., Mirja, M.H., Mittal, N.: Hindi numeral recognition using neural network. Int. J. Sci. Eng. Res. 5(6), 260–268 (2014)

    Google Scholar 

  4. Pal, U., Chaudhuri, B.B.: Automatic recognition of unconstrained off-line Bangla handwritten numerals. In: International Conference on Multimodal Interfaces. Springer, Berlin, Heidelberg (2000)

    Google Scholar 

  5. Pal, U., Wakabayashi, T., Kimura, F.: A system for off-line Oriya handwritten character recognition using curvature feature. In: 10th International Conference on Information Technology (ICIT), pp. 227–229 (2007)

    Google Scholar 

  6. Bag, S., Chawpatnaik, G.: A modified parallel thinning method for handwritten Oriya character images. In: Proceedings of the 4th International Conference on Frontiers in Intelligent Computing: Theory and Applications (FICTA) 2015. Springer, New Delhi (2016)

    Google Scholar 

  7. Jena, O.P., Pradhan, S.K., Biswal, P.K., Tripathy, A.R.: Odia characters and numerals recognition using hopfield neural network based on Zoning features. Int. J. Recent Technol. Eng. 8(2), 4928–4937 (2019)

    Article  Google Scholar 

  8. Pattanayak, Sudha, S., Pradhan, S.K., Mallik, R.C.: Printed Odia symbols for character recognition: a database study. In: Advanced Computing and Intelligent Engineering, pp. 297–397. Springer, Singapore (2020)

    Google Scholar 

  9. Vohra, U.S., Dwivedi, S.P., Mandoria, H.L.: Study and analysis of multilingual hand written characters recognition using SVM classifier. Oriental J. Comput. Sci. Technol. 9(2), 109–114 (2016)

    Article  Google Scholar 

  10. Reddy, R., Kumar, V., Babu, U.R.: Handwritten Hindi character recognition using deep learning techniques. Department of CSE, Acharya Nagarjuna University, Guntur, India. Int. J. Comput. Sci. Eng. (2019)

    Google Scholar 

  11. Fujisawa, H.: Forty years of research in character and document recognition—an industrial perspective. Pattern Recogn. 41(8), 2435–2446 (2008)

    Article  Google Scholar 

  12. Yadav, C., Bottou, L.: Cold case: the lost mnist digits. In: Advances in Neural Information Processing Systems (2019)

    Google Scholar 

  13. Baldominos, Alejandro, Saez, Y., and Pedro I.: A survey of handwritten character recognition with mnist and emnist. Applied Sciences 9.15, 3169, (2019).

    Google Scholar 

  14. .Alvear-Sandoval, R.F., Sancho-Gómez, J.L., Figueiras-Vidal, A.R.: On improving CNNs performance: the case of MNIST. Information Fusion. 52, 106–109 (2019)

    Google Scholar 

  15. Shamsuddin, M., Razif, S., Abdul-Rahman, Mohamed, A.: Exploratory analysis of MNIST handwritten digit for machine learning modelling. In: International Conference on Soft Computing in Data Science. Springer, Singapore (2018)

    Google Scholar 

  16. Alom, M., Sidike, P., Taha, T., Asari, V.: Handwritten Bangla Digit Recognition Using Deep Learning. ArXiv abs/1705.02680 (2017)

    Google Scholar 

  17. Rabby, Azad, AKM.S.: et al.: Bangla handwritten digit recognition using convolutional neural network. In: Emerging Technologies in Data Mining and Information Security, pp. 111–112. Springer, Singapore (2019)

    Google Scholar 

  18. Sufian, A., et al.: Bdnet: Bengali handwritten numeral digit recognition based on densely connected convolutional neural networks. J. King Saud Univ.-Comput. Inf. Sci. (2020)

    Google Scholar 

  19. Maity, S., et al.: Handwritten Bengali character recognition using deep convolution neural network. In: International Conference on Machine Learning, Image Processing, Network Security and Data Sciences. Springer, Singapore (2020)

    Google Scholar 

  20. Pias, M., Mutasim, A.K., Amin, M.A.: Bangladeshi number plate detection: cascade learning versus deep learning. In: Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing (2017)

    Google Scholar 

  21. Lecun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)

    Article  Google Scholar 

  22. Kienzle, W., Chellapilla, K.: Personalized handwriting recognition via biased regularization. In: Proceedings of the 23rd International Conference on Machine Learning (2006)

    Google Scholar 

  23. Simonyan, K., Zisserman, A.: Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv preprint arXiv: 1409.1556 (2014)

    Google Scholar 

  24. Mohd, S.S., et al.: Offline Signature Verification using Deep Learning Convolutional Neural Network (CNN) Architectures GoogLeNet Inception-v1 and Inception-v3. Procedia Comput. Sci. 161, 475–483 (2019)

    Google Scholar 

  25. Cheng, W., et al.: Pulmonary image classification based on inception-v3 transfer learning model. IEEE Access 7, 146533–146541 (2019)

    Google Scholar 

  26. Wen, L., Xinyu, L., Liang, G.: A transfer convolutional neural network for fault diagnosis based on ResNet-50. Neural Comput. Appl. 1–14 (2019)

    Google Scholar 

  27. Chollet, F.: Xception: Deep learning with depthwise separable convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017)

    Google Scholar 

  28. Sevilla, A., Glotin, H.: Audio Bird Classification with Inception-v4 extended with Time and Time-Frequency Attention Mechanisms. CLEF (Working Notes), (2017).

    Google Scholar 

  29. Chen, X., et al.: Visual crowd counting with improved inception-ResNet-A module. In: 2018 IEEE International Conference on Robotics and Biomimetics (ROBIO). IEEE (2018)

    Google Scholar 

  30. Qianqian, Z., Sen, L., Weiming, G.: Research on Vehicle Appearance Component Recognition Based on Mask R-CNN. In: Journal of Physics: Conference Series, vol. 1335, no. 1. IOP Publishing (2019)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohammad Reduanul Haque .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Haque, M.R., Azam, M.G., Milon, S.M., Hossain, M.S., Molla, M.AA., Uddin, M.S. (2021). Quantitative Analysis of Deep CNNs for Multilingual Handwritten Digit Recognition. In: Kaiser, M.S., Bandyopadhyay, A., Mahmud, M., Ray, K. (eds) Proceedings of International Conference on Trends in Computational and Cognitive Engineering. Advances in Intelligent Systems and Computing, vol 1309. Springer, Singapore. https://doi.org/10.1007/978-981-33-4673-4_2

Download citation

Publish with us

Policies and ethics