Skip to main content
Log in

Abstract

Arabic character segmentation is a necessary step in Arabic Optical Character Recognition (OCR). The cursive nature of Arabic script poses challenging problems in Arabic character recognition; however, incorrectly segmented characters will cause misclassifications of characters which in turn may lead to wrong results. Therefore, off-line Arabic character segmentation is a difficult research problem and little research has been achieved in this area in the past few decades. This is due to both the cursive nature of Arabic writing in both printed and handwritten forms and the scarcity of Arabic databases and dictionaries. Most of the character recognition methods used in the recognition of Arabic characters are adopted from available methods used on handwritten Latin and Chinese characters; however, other methods are developed only for Arabic character segmentation. This survey presents the description of the Arabic script characteristics with an overview on OCR systems and a comprehensive review mainly on off-line printed Arabic character segmentation techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Lorigo L.M., Govindaraju V.: Offline Arabic handwriting recognition: a survey. IEEE Trans. Patt. Anal. Mach. Intell. 28(5), 712–724 (2006). doi:10.1109/TPAMI.2006.102Key:citeulike:9240539

    Article  Google Scholar 

  2. Nazif, A.: A System for the Recognition of the Printed Arabic Characters. M.Sc. Thesis, Faculty of Engineering, Cairo University (1975)

  3. Zeki, A.M.: The segmentation problem in Arabic character recognition: The state of the art. First International Conference on Information and Communication Technologies, ICICT, pp. 11–26 (2005)

  4. Alshebeili A., Nabawi A., Mahmoud S.: Arabic character recognition using 1-D slices of the character spectrum. J. Signal Process. 56(1), 59–75 (1997)

    Article  MATH  Google Scholar 

  5. Amin, A.: Segmentation of printed Arabic text. International Conference on Advances in Pattern Recognition, pp. 115–126 (2001)

  6. Wikipedia: http://en.wikipedia.org/wiki/Optical_Character_Recognition (2010). Accessed 27 June 2010

  7. Wikipedia: http://en.wikipedia.org/wiki/Arabic_language (2010). Accessed 23 Nov 2010

  8. John, W.: Major Languages of the World, The New York Times Almanac. p. 492 (2002)

  9. Wikipedia: Arabic Language. http://en.wikipedia.org/wiki/Arabic_language#cite_note-Proch-0 (2010). Accessed 3 Aug 2010

  10. Andaman: http://www.andaman.org/BOOK/reprints/weber/rep-weber.htm (2011). Accessed 2 Jan 2011

  11. Internet World Statistics: http://www.internetworldstats.com/stats7.htm (2010). Accessed 10 Dec 2010

  12. Khorsheed, M.S.: Off-Line Arabic Character Recognition—A Review. Pattern Analysis and Applications, pp. 31–45 (2005)

  13. Looklex Encyclopedia.: http://looklex.com/e.o/arabic_l.htm (2011). Accessed 13 Mar 2011

  14. Al-Badr B., Haralick R.M.: A segmentation-free approach to text recognition with application to Arabic text. Int. J. Document Anal. Recognit. 1(3), 147–166 (1998)

    Article  Google Scholar 

  15. Alginahi, Y.M.: Chapter 1: Preprocessing Techniques in Character Recognition. Character Recognition, Edited by Minoru Mori, ISBN: 978-953-307-105-3, Sciyo, Available from: http://sciyo.com/articles/show/title/preprocessing-techniques-in-character-recognition (2010)

  16. Alginahi, Y.M., Siddiqi, A.: Multi-stage hybrid Arabic/Indian numeral OCR system. Int. J. Comput.Sci. Inform. Secur. ISSN 1947–5500. 8(1), 9–18. http://sites.google.com/site/ijcsis (2010)

  17. Nixon, N., Aguado, A.: Feature Extraction and Image Processing, 2nd edn. ISBN: 978-0-12-372538-7, Elsevier Ltd., London(2008)

  18. AnyDoc Software:www.ocrforanydoc.com. Automate your document processing and data capture. www.anydocsoftware.com/software/products/ocr/pdf/brochure.pdf (2010). Accessed on 10 Nov 2010

  19. Jambi, K.: Design and Implementation of a System for Recognizing Arabic Handwritten Words with Learning Ability. Ph.D. Thesis. Illinois Institute of Technology, Chicago (1991)

  20. Ali, M.O.: A New Pattern Matching Approach to the Recognition of Printed Arabic. Workshop on content visualization and intermedia representations (CVIR’98), University of Montreal, Montreal (1998)

  21. Nawaz, S.N., Sarfraz, M., Zidouri, A., Al-Khatib, W.G.: An approach to off-line Arabic character recognition using neural networks. In: Proceedings of the 10th IEEE International Conference on Electronics, Circuits and Systems (ICECS 2003), vol. 3, pp. 1328–1331 (2003)

  22. Parhami B., Taraghi M.: Automatic recognition of printed Farsi texts. Patt. Recognit. 14(1–6), 395–403 (1981)

    Article  Google Scholar 

  23. Adnan, A., Masini, G.: Machine recognition of Arabic cursive words. SPIE 26th International Symposium on Instrument Display. Application of Digital Image Processing IV, vol. 359, pp. 286–292. San Diego (1982)

  24. Mori S., Nishida H., Yamada H.: Optical Character Recognition. Wiley, New Jersey (1999)

    Google Scholar 

  25. Amin, A., Masini, G.: Machine recognition of multi-font printed Arabic Texts. In: Proceedings of International Conference on Pattern Recognition, Paris, France, pp. 392–395 (1986)

  26. Ymin, A., Aoki, Y.: On the segmentation of multi-font printed Uygur scripts. 13 th International Conference on Pattern Recognition, vol. 3, pp. 215–219 (1996)

  27. Bushofa, B.M.F., Spann, M. Segmentation of Arabic characters using their contour information. 13th International Conference on Digital Signal Processing, vol. 2, pp. 683–686

  28. Romeo-Pakker, K., Miled, H., Lecourtier, Y. A new approach for Latin/Arabic character segmentation. 3rd International Conference on Document Analysis and Recognition, vol. 2, pp. 874– 877

  29. Tellache M., Sid-Ahmed M.A., Abaza B.: Thinning algorithms for Arabic OCR. IEEE Pac. Rim Conf. Commun. Comput. Signal Process. 1, 248–251 (1993)

    Google Scholar 

  30. Altuwaijri M., Bayoumi M.: A new thinning algorithm for Arabic characters using self-organizing neural network. IEEE Int. Symp. Circ. Syst. 3, 1824–1827 (1995)

    Google Scholar 

  31. Altuwaijri M.M., Bayoumi M.A.: A thinning algorithm for Arabic characters using ART2 neural network. IEEE Trans. Circ. Syst. II: Analog Digit. Signal Process. 45(2), 260–264 (1998)

    Article  Google Scholar 

  32. Hamid, A.: A neural network approach for the segmentation of handwritten Arabic text. International Symposium on Innovation in Information and Communication Technology. Amman, Jordan (2001)

  33. Hamid, A., Haraty, R.: A neuro-heuristic approach for segmenting handwritten Arabic text. ACS/IEEE International Conference on Computer Systems and Applications (AICCSA 2001), pp. 110–113. Lebanon (2001)

  34. Elgammal, A.M., Ismail, M.A.: A graph-based segmentation and feature extraction framework for Arabic text recognition. 6th International Conference on Document Analysis and Recognition, pp. 622–626 (2001)

  35. Al-Badr, B., Haralick. R.M. Segmentation-free word recognition with application to Arabic In: Proceedings of the Third International Conference on Document Analysis and Recognition, vol. 1, 355–359 (1995)

  36. Timsari, B., Fahimi, H.: Morphological approach to character recognition in machine-printed Persian words. In: Proceeding of SPIE. Document Recognition III. San Jose (1996)

  37. Motawa, D., Amin, A., Sabourin, R.: Segmentation of Arabic cursive script 4th International Conference on Document Analysis and Recognition, vol. 2, 625–628

  38. Gouda, A.M., Rashwan, M.A.: Segmentation of connected Arabic characters using hidden Markov models. IEEE International Conference on Computational Intelligence for Measurement Systems and Applications, CIMSA, pp. 115–119 (2004)

  39. Touj S.M., Ben Amara N., Amiri H.: Segmentation stage of a PHMM-based model for off-line recognition of Arabic handwritten city names. IEEE Int. Conf. Syst. Man Cybernet. vol. 4, 6–9 (2002)

    Google Scholar 

  40. Najoua, B.A., Noureddine, E. A robust approach for Arabic printed character segmentation Third International Conference on Document Analysis and Recognition, vol. 2, 865–868 (1995)

  41. Latifa, H., Daoud, B.: Recognition system for printed multi-font and multi-size Arabic characters. Arab. J. Sci. Eng. 27(1B), 57–72 (2002)

    Google Scholar 

  42. Zheng L., Hassin A.H., Tang X.: A new algorithm for machine printed Arabic character segmentation. Patt. Recognit. Lett. 25(15), 1723–1729 (2004)

    Article  Google Scholar 

  43. Sarfraz, M., Nawaz, S.N., Al-Khuraidly, A.: Off-line Arabic text recognition system. International Conference on Geometric Modeling and Graphics, London, England, pp. 30–36 (2003)

  44. El-Sheikh T.S., Guindi R.M.: Computer recognition of Arabic cursive scripts. Patt. Recognit. 21, 293–302 (1988)

    Article  Google Scholar 

  45. Hashemi, M.R., Fatemi, O., Safavi, R.: Persian cursive script recognition. 3rd International Conference on Document Analysis and Recognition, vol. 2, pp. 869–873. Montreal, Canada (1995)

  46. Fakir M., Hassani M.M: On the recognition of Arabic characters using Hough transform techniques. Malays. J. Comput. Sci. 13(2), 39–47 (2000)

    Google Scholar 

  47. Fakir, M., Hassani, M.M., Sodeyama, C.: Recognition of Arabic characters using Karhunen- Loeve transform and dynamic programming. IEEE International Conference on Systems Man and Cybernetics, vol. 6, pp 12–15. 868–873, (1999)

  48. Lorigo, L., Govindaraju, V. Segmentation and pre-recognition of Arabic handwriting In: Proceedings of the 8th International Conference on Document Analysis and Recognition, vol. 2, pp. 605–609

  49. Zidouri, A., Sarfraz, M., Shahab, S.A., Jafri, S.M.: Adaptive dissection based sub-word segmentation of printed Arabic text. In: Proceedings of the 9th International Conference on Information Visualization, pp. 239–243 (2005)

  50. El-Khaly, F., Sid-Ahmed, M.A.: Machine recognition of optically captured machine printed Arabic text. In: Proceedings of Pattern Recognition, vol. 23, pp.1207–1214 (1990)

  51. Abuhaiba I.S.I.: A discrete Arabic script for better automatic document understanding. Arab. J. Sci. Eng. 28(1B), 77–94 (2003)

    Google Scholar 

  52. Amin A., Mari J.: Machine recognition and correction of printed Arabic text. IEEE Trans. Syst. Man Cybern. SMC 19(5), 1300–1306 (1989)

    Article  Google Scholar 

  53. Al-Badr B., Mahmoud S.: Survey and bibliography of Arabic optical text recognition. Signal Process. 41(1), 49–77 (1995)

    Article  MATH  Google Scholar 

  54. El-Dabi S.S., Ramsis R., Kamel A.: Arabic character recognition system: a statistical approach for recognizing cursive typewritten text. Patt. Recognit. 23(5), 485–495 (1990)

    Article  Google Scholar 

  55. Amin, A.: “Arabic Character Recognition (1997) In: Handbook of Character Recognition and Document Image Analysis—(Chapter 15). Edited by H. Bunke and P. S. P. Wang. World Scientific. Singapore. pp. 397–420

  56. Zahour, A., Taconet, B., Mercy, P., Ramdane, S.: Arabic hand-written text-line extraction. 6th International Conference on Document Analysis and Recognition, pp. 281–285. Seattle, Washington (2001)

  57. Al-Yousefi H., Udpa S.S.: Recognition of Arabic characters. IEEE Trans. Patt. Anal. Mach. Intell. 14(8), 853–857 (1992)

    Article  Google Scholar 

  58. El Gowely, K., Dessouki, I., Nazif, A.: Multi-phase recognition of multi font photoscript Arabic text. 10th International Conference on Pattern Recognition ICPR, vol. 1, pp. 700–702. Atlantic City, New Jersy (1990)

  59. Tolba, M.F., Shaddad, E.: On the automatic reading of printed Arabic characters. IEEE International Conference on Systems Man and Cybernetics, pp. 496–498. Los Angeles (1990)

  60. Cheung, A., Bennamoun, M., Bergmann, N.W.: Implementation of a statistical based Arabic character recognition system. IEEE Region 10 Annual Conference on Speech and Image Technologies for Computing and Telecommunications, vol. 2. pp. 531–534. Brisbane, Australia (1997)

  61. Abuhaiba I., Mahmoud S., Green R.: Recognition of handwritten cursive Arabic characters. IEEE Trans. Patt. Anal. Mach. Intell. 16(6), 664–672 (1994)

    Article  Google Scholar 

  62. Ben Amara, N., Ellouze, N. A Robust approach for Arabic printed character segmentation 3 rd International Conference on Document Analysis and Recognition, vol. 2, 865–868. Montreal (1995)

  63. Abelazim, H., Hashish, M.: Arabic reading machine. 10th Saudi National Computer Conference. Riyadh, Saudi Arabia, pp. 733–743 (1988)

  64. Abelazim H., Hashish M.: Automatic reading of bilingual typewritten text. Proc. CompEuro’ 89(VLSI and Computer Peripherals. Hamburg. 2), 140–144 (1989)

    Google Scholar 

  65. Margner, V.: SARAT-a system for the recognition of Arabic printed text. 11th International Conference on Pattern Recognition Methodology and Systems, vol. 2, Conference B, pp. 561–564 (1992)

  66. Liangrui, P., Changsong, L., Xiaoqing, D., Hua, W.: Multilingual document recognition research and its application in China. Second International Conference on Document Image Analysis for Libraries, pp. 126–132 (2006)

  67. Mehran, R., Pirsiavash, H., Razzazi, F.: A front-end OCR for Omni-font Persian/Arabic cursive printed documents. In: Proceedings of the Digital Imaging Computing: Techniques and Applications, pp. 56–60 (2005)

  68. Sari, T., Souici, S.M.: Off-line handwritten Arabic character segmentation algorithm: ACSA. In: Proceedings of International Workshop Frontiers in Handwriting Recognition, pp. 452–457 (2002)

  69. Lam L., Suen Y.: Thinning methodologies—a comprehensive survey. IEEE Trans. Patt. Anal. Mach. Intell. 14(9), 869–885 (1992)

    Article  Google Scholar 

  70. Cowell, J., Hussain, F.: Thinning Arabic characters for feature extraction. In: Proceedings Fifth International Conference on Information Visualization, pp. 181–185 (2001)

  71. Goraine H., Usher M., Al-Emami S.: Off-line Arabic character recognition. Computer 25(7), 71–74 (1992)

    Article  Google Scholar 

  72. Narima, Z., Messaoud, R., Mouldi, B. Neuro-Markovian hybrid system for handwritten Arabic word recognition In: Proceedings of the 10th IEEE International Conference on Electronics, Circuits and Systems, vol. 2, 878–881

  73. Hosseini, H.M.M., Bouzerdoum, A.: A combined method for Persian and Arabic handwritten digit recognition. Australian and New Zealand Conference on Intelligent Information Systems, pp 80–83 (1996)

  74. Mozaffari, S., Faez, K., Ziaratban, M. Structural decomposition and statistical description of Farsi/Arabic handwritten numeric characters Eighth International Conference on Document Analysis and Recognition, vol. 1, 237–241 (2005)

  75. Blumenstein, M., Verma, B.: A segmentation algorithm used in conjunction with artificial neural networks for the recognition of real-world postal addresses. International Conference on Computational Intelligence and Multimedia Applications (ICCIMA ’97), pp. 155–160. Gold Coast, Australia (1997)

  76. Eastwood, B., Jennings, A., Harvey, A.: A feature based neural network segmenter for handwritten words. In: Proceedings of the International Conference on Computational Intelligence and Multimedia Applications (ICCIMA ’97), pp. 286–290. Gold Coast, Australia (1997)

  77. Lee, S.-W., Lee, D.-J., Park, H.-S.: A new methodology for gray-scale character segmentation and recognition. IEEE Trans. Patt. Anal. Mach. Intell. 1045–1051 (1996)

  78. Han, K., Sethi, I. K.: “Off-line Cursive Handwriting Segmentation”, pp. 894–897. ICDAR ‘95, Montreal, Canada (1995)

  79. Srihari, S. N.: Recognition of handwritten and machine-printed text for postal address interpretation. Patt. Recognit. Lett. 291–302

  80. Xiu, P., Peng, L., Ding, X. (2006) Multi-queue merging scheme and its applications in Arabic script segmentation. In: Proceedings of the Second International Conference on Document Image Analysis of Libraries, pp. 24–29

  81. Zidouri, A.: ORAN: a basis for an Arabic OCR system. International Symposium on Intelligent Multimedia, Video and Speech Processing, pp. 703–706 (2004)

  82. Rabiner, L.R., Levinson, S.E.: A speaker-independent, syntax-directed, connected word recognition system based on hidden Markov model and level building. IEEE Trans. Audio, Speech Signal Process. 33(3) (1985)

  83. Afify, M.A.: Large Vocabulary Continuous Arabic Speech Recognition. Ph.D. Thesis, Faculty of Engineering. Cairo University (1995)

  84. Brugnara, F., Faiavigna, D., Omologo, M.: Automatic Segmentation and Labeling of Speech Based on Hidden Markov Model. Speech Communication 12. North Holland (1993)

  85. Rabiner L.R., Wilpon J.G., Soong F.K.: High performance connected digit recognition using hidden Markov model. IEEE Trans. Audio, Speech Signal Process. 37(8), 1214–1225 (1989)

    Article  Google Scholar 

  86. LaPre, C., Ying, Z., Raphael, C., Schwartz, R., Makhoul, J. Multi-font recognition of printed Arabic using the BBN BYBLOS speech recognition system. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 4, 2136–2139 (1996)

  87. El-Hajj, R., Likforman-Sulem, L., Mokbel, C. Arabic handwriting recognition using baseline dependant features and hidden Markov modeling. Eighth International Conference on Document Analysis and Recognition, vol. 2, pp. 893–897 (2005)

  88. Rashwan, M.A.: A new OCR system similar to ASR system. The 10th International Conference on Computing and Information, ICCI 2000, Kuwait (2000)

  89. Rashwan, M., Fakhr, M., Attia, M., El-Mahallawy, M., Arabic OCR System analogous to HMM-based ASR systems; implementation and evaluation. J. Eng. Appl. Sci. Cairo University, www.Journal.eng.CU.edu.eg, Apr. 2008

  90. www.itp.net/Arabic, “OCR software, group test,“,,

  91. Alma’adeed S., Higgens C., Elliman D.: Recognition of off-line handwritten Arabic words using hidden Markov model approach. 16th International Conference on Pattern Recognition, vol. 3, pp 481–484 (2002)

    Google Scholar 

  92. Bourouba, H., Bedda, M.: Hybrid approach DTW/HMMC for the recognition of the isolated Arabic words. International Conference on Information and Communication Technologies: From Theory to Applications, pp. 481–482 (2004)

  93. Pechwitz, M., Maergner, V.: HMM based approach for handwritten Arabic word recognition using the IFN/ENIT—database. Seventh International Conference on Document Analysis and Recognition pp. 890–894 (2003)

  94. Dehghan M., Faez K., Ahmadi M., Shridhar M.: Holistic handwritten word recognition using discrete HMM and self-organizing feature map. IEEE Int. Conf. Syst. Man Cybern. 4, 2735–2739 (2000)

    Google Scholar 

  95. Bushofa B., Spann M.: Segmentation and recognition of Arabic characters by structural classification. Image Vis. Comput. (IVC) 15(3), 167–179 (1997)

    Article  Google Scholar 

  96. Touj, S., Ben Amara, N., Amiri, H.: Two approaches for Arabic script recognition-based segmentation using the Hough transform. Ninth International Conference on Document Analysis and Recognition, vol. 2, pp. 654–658. ICDAR 2007 (2007)

  97. Broumandnia, A., Shanbehzadeh, J., Nourani, M.: Segmentation of printed Farsi/Arabic words. International Conference on Computer Systems and Applications, pp. 761–766. Amman, Jordon (2007)

  98. Broumandnia A., Shanbehzadeh J.: Fast Zernike wavelet moments for Farsi character recognition. Image Vis. Comput. 25, 717–726 (2007)

    Article  Google Scholar 

  99. Cheung A., Bennamouon M., Bergmann N.W.: An Arabic optical character recognition system using recognition-based segmentation. Patt. Recognit. 34, 215–233 (2001)

    Article  MATH  Google Scholar 

  100. Almuallim H., Yamaguchi S.: A method for recognition of Arabic cursive handwriting. IEEE Trans. Patt. Anal. Mach. Intell. PAMI- 9(5), 715–722 (1987)

    Article  Google Scholar 

  101. Ramsis, R., El-Dabi, S.S., Kamel, A.: Arabic character recognition system, IBM Kuwait Scientific Centre, report No. KSC027 (1988)

  102. Zahour A., Taconet B., Faure A.: Machine recognition of Arabic cursive writing. In: Impedovo, S., Simon, J.C. (eds) From Pixels to Features III: Frontiers in Handwriting Recognition, pp. 289–296. Elsevier Science Publishers B.V., Amsterdam (1992)

    Google Scholar 

  103. Abuhaiba, I.S.I.: Recognition of Off-Line Handwritten Cursive Text,” Ph.D. thesis, Department of Electronic and Electrical Engineering, Loughborough University, Loughborough, UK (1996)

  104. Erlandson, E., Trenkle, J., Vogt, R.: Word-level recognition of multi-font Arabic text using a feature vector matching approach. In: Proceedings of the International Society for Optical Engineers, SPIE, vol. 2660, pp. 63–70 (1996)

  105. Amin, A.: Recognition of printed Arabic text suing machine learning. In: Proceedings of the International Society for Optical Engineers, SPIE, vol. 3305, pp. 63–70 (1998)

  106. Clocksin, W.F., Khorsheed, M.S.: Word recognition in Arabic handwriting. In: Proceedings of International Conference on Artificial Intelligence Applications. ICAIA. Egypt (2000)

  107. Khorsheed, M.S., Clocksin, W.F.: Spectral features for Arabic word recognition. IEEE International Conference on Acoustics. Speech and Signal Processing, ICASP, Turkey (2000)

  108. Khorsheed, M.S., Clocksin, W.F.: Structural features of cursive Arabic script. In: Proceedings Of British Conference on Machine Vision, pp. 422–431 (1999)

  109. Khorsheed M.S., Clocksin W.F.: Multi-font Arabic word recognition using spectral features. 15th International Conference on Pattern Recognition 4, 543–546 (2000)

    Google Scholar 

  110. Tse, E., Bigun, J.: A Base-line character recognition for Syriac-Aramaic. IEEE International Conference on Systems, Man and Cybernetics, pp. 1048–1055 (2007)

  111. Dehghan, M., Faez, K., Ahmadi, M., Shridhar, M.: Off-line unconstrained Farsi handwritten word recognition using fuzzy vector quantization and hidden Markov word models. In: Proceedings of the 15th International Conference on Pattern Recognition, vol. 2, pp. 351–354 (2000)

  112. Nadeem Ahmad Khan: A shape Analysis Model with Application to Character and Word Recognition. Technische Universiteit Eindhoven. Proefschrift. ISBN 90-386-1750-X (2000)

  113. Casey R.G., Lecolinet E.: A survey of methods and strategies in character segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 18(7), 690–706 (1996)

    Article  Google Scholar 

  114. Chen, C.H., DeCurtins, J.L.: Word recognition in a segmentation-free approach to OCR. Second International Conference on Document Analysis and Recognition, pp. 573–576 (1993)

  115. Shaikh, N.A., Shaikh, Z.A., Ali, G.: Segmentation of Arabic text into characters for recognition. IMTIC 2008, CCIS 20, pp. 11–18 (2008)

  116. Zidouri A.: On multiple typeface Arabic script recognition. Res. J. Appl. Sci. Eng. Technol. 2(5), 428–435 (2010)

    Google Scholar 

  117. Shirali-Shahreza, S., Manzuri-Shalmani, M.T., Shirali-Shahreza, M.: A skew resistant method for Persian text recognition. In: Proceedings for the IEEE Symposium on Computational Intelligence in Image and Signal Processing, pp. 115–120 (2007)

  118. Khorsheed M.S.: Off-line recognition of Omnifont Arabic text using the HMM ToolKit (HTK). Patt. Recognit. Lett. 28(12), 1563–1571 (2007)

    Article  Google Scholar 

  119. Boubaker, H., El Baati, A., Kherallah, M., Alimi, A.M., Elabed, H.: Online Arabic handwriting modeling system based on the graphemes segmentation. 20th International Conference on Pattern Recognition (ICPR), pp. 2061–2064 (2010)

  120. Moussa S.B., Zahour A., Benabdelhafid A., Adel M.A.: New features using fractal multi-dimensions for generalised Arabic font recognition. Patt. Recognit. Lett. 31(5), 361–371 (2010)

    Article  Google Scholar 

  121. Slimane, F., Kanoun, S., Alimi, A.M., Ingold, R., Hennebert, J.: Gaussian mixture models for Arabic font recognition. 20th International Conference on Pattern Recognition (ICPR), pp. 2174–2177 (2010)

  122. Khosravi H., Kabir E.: Farsi font recognition based on Sobel-Roberts features. Patt. Recognit. Lett. 31, 75–82 (2010)

    Article  Google Scholar 

  123. Boussellaa, W., Zahour, A., Elabed, H., Benabdelhafid, A., Alimi A.: Unsupervised block covering analysis for text-line segmentation of Arabic ancient handwritten document images. 20 th International Conference on Pattern Recognition (ICPR), pp. 1929–1932 (2010)

  124. Al Hamad H., Abu Zitar R.: Development of an efficient neural-based segmentation technique for Arabic handwriting recognition. Patt. Recognit. 43, 2773–2798 (2010)

    Article  MATH  Google Scholar 

  125. Saeed K., Albakoor M.: Region growing based segmentation algorithm for typewritten and handwritten text recognition. Appl. Soft Comput. 9, 608–617 (2009)

    Article  Google Scholar 

  126. Harouni M., Mohamad D., Rasouli A.: Deductive method for recognition of on-line handwritten Persian/Arabic characters. The 2nd International Conference on Computer and Automation Engineering (ICCAE) 5, 791–795 (2010)

    Article  Google Scholar 

  127. Omer M.A.H., Shilong M.: Recognition online Arabic pattern. 3 rd International Conference on Advanced Computer Theory and Engineering (ICACTE) 6, 18–22 (2010)

    Google Scholar 

  128. Khan, K.U., Haider, I.: Online recognition of multi-stroke handwritten Urdu characters. International Conference on Image Analysis and Signal Processing (IASP), pp. 284–290 (2010)

  129. Shirali-Shahreza, M., Shirali-Shahreza, S.: Persian/Arabic text font estimation using dots. In: Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, pp. 420–425 (2006)

  130. Altec, “Character Recognition Systems Overview”, viewed on 13/2/2012. http://www.alteccenter.org/page.php?pg=filesrepositry/getRepository.php&main_cat=1&sub_cat=24

  131. Ashraf, A., Colin A.H., Mahmoud, K. (2008) A database for Arabic printed character recognition. International Conference on Image Analysis and Recognition, ICIAR 2008, pp. 567–578

  132. Slimane, F., Ingold, R., Kanoun, S., Alimi, M.A., Hennebert, J.: Database and evaluation protocols for Arabic printed text recognition. Internal Research Report (DIUF). University of Fribourg. Switzerland. Obtained from: http://diuf.unifr.ch/diva/APTI/publications.html (2009)

  133. Schlosser, S.: ERIM Arabic Database. Document Processing Research Program, Information and Materials Applications Laboratory. Environmental Research Institute of Michigan (1995)

  134. Slimane, F., Ingold, R., Kanoun, S., Alimi, M.A., Hennebert, J.: A new Arabic printed text image database and evaluation protocols. In: Proceedings of 10th IEEE International Conference on Document Analysis and Recognition (ICDAR 2009), pp. 946–950. Barcelona (Spain) (2009)

  135. Altex, http://www.ALTEC-Center.org, viewed on 13/2/2012

  136. Beesley, K.R.: Arabic finite-state morphological analysis and generation. In: COLING-96 Proceedings, Copenhagen, vol. 1. pp. 89–94 (1996)

  137. Ramzi Abbes, J.D., Hassoun, M.: The architecture of a standard Arabic lexical database. Some figures, ratios and categories from the DIINAR.1 Source program. In: Workshop of Computational Approaches to Arabic Script-based Languages, Geneva, Switzerland (2004)

  138. David Graff K.C., Kong J., Maeda K.: Arabic Gigaword, 2nd edn. Linguistic Data Consortium. University of Pennsylvania, Philadelphia (2006)

    Google Scholar 

  139. Bell, J., Zemanek, P.: Test of two Arabic OCR programs. Manuscr.Orient. Int. J. Orient. Manuscr. Res. 1(3), 55–57. http://www.islamicmanuscripts.info/.../Bell-Zemanek-1995-MO-1-3-Test-O.PDF (1995)

  140. Kanungo, T., Marton, G.E., Bulbul, O.: Performance evaluation of two Arabic OCR products. In: Proceedings of AIPR Workshop on Advances in Computer Assisted Recognition. SPIE vol. 3584, Washington, DC (1998)

  141. Kanungo, T., Marton, G.A., Bulbul, O.: OmniPage versus Sakhr: paired model evaluation of two Arabic OCR products. In: Proceedings of SPIE Conference on Document Recognition, San Jose, CA vol. 3651, pp. 109–120 (1999)

  142. Aramedia, The Best Arabic OCR Technology, viewed on 10th of June, 2010, http://aramedia.com/ocr.htm

  143. Al-Ohali Y., Cheriet M., Suen Ching: Databases for recognition of handwritten Arabic cheques. Patt. Recognit. 36(1), 111–121 (2003)

    Article  MATH  Google Scholar 

  144. Al-Ohali Y.: Handwritten word recognition: application to Arabic cheque processing. Department of Computer Science, Concordia University, Montreal (2002)

    Google Scholar 

  145. Fujisawa, H.: Forty years of research in character and document recognition—an industrial perspective. Patt. Recognit. Vol. 41, pp. 2435–2446 (2008)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yasser M. Alginahi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Alginahi, Y.M. A survey on Arabic character segmentation. IJDAR 16, 105–126 (2013). https://doi.org/10.1007/s10032-012-0188-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10032-012-0188-6

Keywords

Navigation