Abstract
Arabic character segmentation is a necessary step in Arabic Optical Character Recognition (OCR). The cursive nature of Arabic script poses challenging problems in Arabic character recognition; however, incorrectly segmented characters will cause misclassifications of characters which in turn may lead to wrong results. Therefore, off-line Arabic character segmentation is a difficult research problem and little research has been achieved in this area in the past few decades. This is due to both the cursive nature of Arabic writing in both printed and handwritten forms and the scarcity of Arabic databases and dictionaries. Most of the character recognition methods used in the recognition of Arabic characters are adopted from available methods used on handwritten Latin and Chinese characters; however, other methods are developed only for Arabic character segmentation. This survey presents the description of the Arabic script characteristics with an overview on OCR systems and a comprehensive review mainly on off-line printed Arabic character segmentation techniques.
Similar content being viewed by others
References
Lorigo L.M., Govindaraju V.: Offline Arabic handwriting recognition: a survey. IEEE Trans. Patt. Anal. Mach. Intell. 28(5), 712–724 (2006). doi:10.1109/TPAMI.2006.102Key:citeulike:9240539
Nazif, A.: A System for the Recognition of the Printed Arabic Characters. M.Sc. Thesis, Faculty of Engineering, Cairo University (1975)
Zeki, A.M.: The segmentation problem in Arabic character recognition: The state of the art. First International Conference on Information and Communication Technologies, ICICT, pp. 11–26 (2005)
Alshebeili A., Nabawi A., Mahmoud S.: Arabic character recognition using 1-D slices of the character spectrum. J. Signal Process. 56(1), 59–75 (1997)
Amin, A.: Segmentation of printed Arabic text. International Conference on Advances in Pattern Recognition, pp. 115–126 (2001)
Wikipedia: http://en.wikipedia.org/wiki/Optical_Character_Recognition (2010). Accessed 27 June 2010
Wikipedia: http://en.wikipedia.org/wiki/Arabic_language (2010). Accessed 23 Nov 2010
John, W.: Major Languages of the World, The New York Times Almanac. p. 492 (2002)
Wikipedia: Arabic Language. http://en.wikipedia.org/wiki/Arabic_language#cite_note-Proch-0 (2010). Accessed 3 Aug 2010
Andaman: http://www.andaman.org/BOOK/reprints/weber/rep-weber.htm (2011). Accessed 2 Jan 2011
Internet World Statistics: http://www.internetworldstats.com/stats7.htm (2010). Accessed 10 Dec 2010
Khorsheed, M.S.: Off-Line Arabic Character Recognition—A Review. Pattern Analysis and Applications, pp. 31–45 (2005)
Looklex Encyclopedia.: http://looklex.com/e.o/arabic_l.htm (2011). Accessed 13 Mar 2011
Al-Badr B., Haralick R.M.: A segmentation-free approach to text recognition with application to Arabic text. Int. J. Document Anal. Recognit. 1(3), 147–166 (1998)
Alginahi, Y.M.: Chapter 1: Preprocessing Techniques in Character Recognition. Character Recognition, Edited by Minoru Mori, ISBN: 978-953-307-105-3, Sciyo, Available from: http://sciyo.com/articles/show/title/preprocessing-techniques-in-character-recognition (2010)
Alginahi, Y.M., Siddiqi, A.: Multi-stage hybrid Arabic/Indian numeral OCR system. Int. J. Comput.Sci. Inform. Secur. ISSN 1947–5500. 8(1), 9–18. http://sites.google.com/site/ijcsis (2010)
Nixon, N., Aguado, A.: Feature Extraction and Image Processing, 2nd edn. ISBN: 978-0-12-372538-7, Elsevier Ltd., London(2008)
AnyDoc Software:www.ocrforanydoc.com. Automate your document processing and data capture. www.anydocsoftware.com/software/products/ocr/pdf/brochure.pdf (2010). Accessed on 10 Nov 2010
Jambi, K.: Design and Implementation of a System for Recognizing Arabic Handwritten Words with Learning Ability. Ph.D. Thesis. Illinois Institute of Technology, Chicago (1991)
Ali, M.O.: A New Pattern Matching Approach to the Recognition of Printed Arabic. Workshop on content visualization and intermedia representations (CVIR’98), University of Montreal, Montreal (1998)
Nawaz, S.N., Sarfraz, M., Zidouri, A., Al-Khatib, W.G.: An approach to off-line Arabic character recognition using neural networks. In: Proceedings of the 10th IEEE International Conference on Electronics, Circuits and Systems (ICECS 2003), vol. 3, pp. 1328–1331 (2003)
Parhami B., Taraghi M.: Automatic recognition of printed Farsi texts. Patt. Recognit. 14(1–6), 395–403 (1981)
Adnan, A., Masini, G.: Machine recognition of Arabic cursive words. SPIE 26th International Symposium on Instrument Display. Application of Digital Image Processing IV, vol. 359, pp. 286–292. San Diego (1982)
Mori S., Nishida H., Yamada H.: Optical Character Recognition. Wiley, New Jersey (1999)
Amin, A., Masini, G.: Machine recognition of multi-font printed Arabic Texts. In: Proceedings of International Conference on Pattern Recognition, Paris, France, pp. 392–395 (1986)
Ymin, A., Aoki, Y.: On the segmentation of multi-font printed Uygur scripts. 13 th International Conference on Pattern Recognition, vol. 3, pp. 215–219 (1996)
Bushofa, B.M.F., Spann, M. Segmentation of Arabic characters using their contour information. 13th International Conference on Digital Signal Processing, vol. 2, pp. 683–686
Romeo-Pakker, K., Miled, H., Lecourtier, Y. A new approach for Latin/Arabic character segmentation. 3rd International Conference on Document Analysis and Recognition, vol. 2, pp. 874– 877
Tellache M., Sid-Ahmed M.A., Abaza B.: Thinning algorithms for Arabic OCR. IEEE Pac. Rim Conf. Commun. Comput. Signal Process. 1, 248–251 (1993)
Altuwaijri M., Bayoumi M.: A new thinning algorithm for Arabic characters using self-organizing neural network. IEEE Int. Symp. Circ. Syst. 3, 1824–1827 (1995)
Altuwaijri M.M., Bayoumi M.A.: A thinning algorithm for Arabic characters using ART2 neural network. IEEE Trans. Circ. Syst. II: Analog Digit. Signal Process. 45(2), 260–264 (1998)
Hamid, A.: A neural network approach for the segmentation of handwritten Arabic text. International Symposium on Innovation in Information and Communication Technology. Amman, Jordan (2001)
Hamid, A., Haraty, R.: A neuro-heuristic approach for segmenting handwritten Arabic text. ACS/IEEE International Conference on Computer Systems and Applications (AICCSA 2001), pp. 110–113. Lebanon (2001)
Elgammal, A.M., Ismail, M.A.: A graph-based segmentation and feature extraction framework for Arabic text recognition. 6th International Conference on Document Analysis and Recognition, pp. 622–626 (2001)
Al-Badr, B., Haralick. R.M. Segmentation-free word recognition with application to Arabic In: Proceedings of the Third International Conference on Document Analysis and Recognition, vol. 1, 355–359 (1995)
Timsari, B., Fahimi, H.: Morphological approach to character recognition in machine-printed Persian words. In: Proceeding of SPIE. Document Recognition III. San Jose (1996)
Motawa, D., Amin, A., Sabourin, R.: Segmentation of Arabic cursive script 4th International Conference on Document Analysis and Recognition, vol. 2, 625–628
Gouda, A.M., Rashwan, M.A.: Segmentation of connected Arabic characters using hidden Markov models. IEEE International Conference on Computational Intelligence for Measurement Systems and Applications, CIMSA, pp. 115–119 (2004)
Touj S.M., Ben Amara N., Amiri H.: Segmentation stage of a PHMM-based model for off-line recognition of Arabic handwritten city names. IEEE Int. Conf. Syst. Man Cybernet. vol. 4, 6–9 (2002)
Najoua, B.A., Noureddine, E. A robust approach for Arabic printed character segmentation Third International Conference on Document Analysis and Recognition, vol. 2, 865–868 (1995)
Latifa, H., Daoud, B.: Recognition system for printed multi-font and multi-size Arabic characters. Arab. J. Sci. Eng. 27(1B), 57–72 (2002)
Zheng L., Hassin A.H., Tang X.: A new algorithm for machine printed Arabic character segmentation. Patt. Recognit. Lett. 25(15), 1723–1729 (2004)
Sarfraz, M., Nawaz, S.N., Al-Khuraidly, A.: Off-line Arabic text recognition system. International Conference on Geometric Modeling and Graphics, London, England, pp. 30–36 (2003)
El-Sheikh T.S., Guindi R.M.: Computer recognition of Arabic cursive scripts. Patt. Recognit. 21, 293–302 (1988)
Hashemi, M.R., Fatemi, O., Safavi, R.: Persian cursive script recognition. 3rd International Conference on Document Analysis and Recognition, vol. 2, pp. 869–873. Montreal, Canada (1995)
Fakir M., Hassani M.M: On the recognition of Arabic characters using Hough transform techniques. Malays. J. Comput. Sci. 13(2), 39–47 (2000)
Fakir, M., Hassani, M.M., Sodeyama, C.: Recognition of Arabic characters using Karhunen- Loeve transform and dynamic programming. IEEE International Conference on Systems Man and Cybernetics, vol. 6, pp 12–15. 868–873, (1999)
Lorigo, L., Govindaraju, V. Segmentation and pre-recognition of Arabic handwriting In: Proceedings of the 8th International Conference on Document Analysis and Recognition, vol. 2, pp. 605–609
Zidouri, A., Sarfraz, M., Shahab, S.A., Jafri, S.M.: Adaptive dissection based sub-word segmentation of printed Arabic text. In: Proceedings of the 9th International Conference on Information Visualization, pp. 239–243 (2005)
El-Khaly, F., Sid-Ahmed, M.A.: Machine recognition of optically captured machine printed Arabic text. In: Proceedings of Pattern Recognition, vol. 23, pp.1207–1214 (1990)
Abuhaiba I.S.I.: A discrete Arabic script for better automatic document understanding. Arab. J. Sci. Eng. 28(1B), 77–94 (2003)
Amin A., Mari J.: Machine recognition and correction of printed Arabic text. IEEE Trans. Syst. Man Cybern. SMC 19(5), 1300–1306 (1989)
Al-Badr B., Mahmoud S.: Survey and bibliography of Arabic optical text recognition. Signal Process. 41(1), 49–77 (1995)
El-Dabi S.S., Ramsis R., Kamel A.: Arabic character recognition system: a statistical approach for recognizing cursive typewritten text. Patt. Recognit. 23(5), 485–495 (1990)
Amin, A.: “Arabic Character Recognition (1997) In: Handbook of Character Recognition and Document Image Analysis—(Chapter 15). Edited by H. Bunke and P. S. P. Wang. World Scientific. Singapore. pp. 397–420
Zahour, A., Taconet, B., Mercy, P., Ramdane, S.: Arabic hand-written text-line extraction. 6th International Conference on Document Analysis and Recognition, pp. 281–285. Seattle, Washington (2001)
Al-Yousefi H., Udpa S.S.: Recognition of Arabic characters. IEEE Trans. Patt. Anal. Mach. Intell. 14(8), 853–857 (1992)
El Gowely, K., Dessouki, I., Nazif, A.: Multi-phase recognition of multi font photoscript Arabic text. 10th International Conference on Pattern Recognition ICPR, vol. 1, pp. 700–702. Atlantic City, New Jersy (1990)
Tolba, M.F., Shaddad, E.: On the automatic reading of printed Arabic characters. IEEE International Conference on Systems Man and Cybernetics, pp. 496–498. Los Angeles (1990)
Cheung, A., Bennamoun, M., Bergmann, N.W.: Implementation of a statistical based Arabic character recognition system. IEEE Region 10 Annual Conference on Speech and Image Technologies for Computing and Telecommunications, vol. 2. pp. 531–534. Brisbane, Australia (1997)
Abuhaiba I., Mahmoud S., Green R.: Recognition of handwritten cursive Arabic characters. IEEE Trans. Patt. Anal. Mach. Intell. 16(6), 664–672 (1994)
Ben Amara, N., Ellouze, N. A Robust approach for Arabic printed character segmentation 3 rd International Conference on Document Analysis and Recognition, vol. 2, 865–868. Montreal (1995)
Abelazim, H., Hashish, M.: Arabic reading machine. 10th Saudi National Computer Conference. Riyadh, Saudi Arabia, pp. 733–743 (1988)
Abelazim H., Hashish M.: Automatic reading of bilingual typewritten text. Proc. CompEuro’ 89(VLSI and Computer Peripherals. Hamburg. 2), 140–144 (1989)
Margner, V.: SARAT-a system for the recognition of Arabic printed text. 11th International Conference on Pattern Recognition Methodology and Systems, vol. 2, Conference B, pp. 561–564 (1992)
Liangrui, P., Changsong, L., Xiaoqing, D., Hua, W.: Multilingual document recognition research and its application in China. Second International Conference on Document Image Analysis for Libraries, pp. 126–132 (2006)
Mehran, R., Pirsiavash, H., Razzazi, F.: A front-end OCR for Omni-font Persian/Arabic cursive printed documents. In: Proceedings of the Digital Imaging Computing: Techniques and Applications, pp. 56–60 (2005)
Sari, T., Souici, S.M.: Off-line handwritten Arabic character segmentation algorithm: ACSA. In: Proceedings of International Workshop Frontiers in Handwriting Recognition, pp. 452–457 (2002)
Lam L., Suen Y.: Thinning methodologies—a comprehensive survey. IEEE Trans. Patt. Anal. Mach. Intell. 14(9), 869–885 (1992)
Cowell, J., Hussain, F.: Thinning Arabic characters for feature extraction. In: Proceedings Fifth International Conference on Information Visualization, pp. 181–185 (2001)
Goraine H., Usher M., Al-Emami S.: Off-line Arabic character recognition. Computer 25(7), 71–74 (1992)
Narima, Z., Messaoud, R., Mouldi, B. Neuro-Markovian hybrid system for handwritten Arabic word recognition In: Proceedings of the 10th IEEE International Conference on Electronics, Circuits and Systems, vol. 2, 878–881
Hosseini, H.M.M., Bouzerdoum, A.: A combined method for Persian and Arabic handwritten digit recognition. Australian and New Zealand Conference on Intelligent Information Systems, pp 80–83 (1996)
Mozaffari, S., Faez, K., Ziaratban, M. Structural decomposition and statistical description of Farsi/Arabic handwritten numeric characters Eighth International Conference on Document Analysis and Recognition, vol. 1, 237–241 (2005)
Blumenstein, M., Verma, B.: A segmentation algorithm used in conjunction with artificial neural networks for the recognition of real-world postal addresses. International Conference on Computational Intelligence and Multimedia Applications (ICCIMA ’97), pp. 155–160. Gold Coast, Australia (1997)
Eastwood, B., Jennings, A., Harvey, A.: A feature based neural network segmenter for handwritten words. In: Proceedings of the International Conference on Computational Intelligence and Multimedia Applications (ICCIMA ’97), pp. 286–290. Gold Coast, Australia (1997)
Lee, S.-W., Lee, D.-J., Park, H.-S.: A new methodology for gray-scale character segmentation and recognition. IEEE Trans. Patt. Anal. Mach. Intell. 1045–1051 (1996)
Han, K., Sethi, I. K.: “Off-line Cursive Handwriting Segmentation”, pp. 894–897. ICDAR ‘95, Montreal, Canada (1995)
Srihari, S. N.: Recognition of handwritten and machine-printed text for postal address interpretation. Patt. Recognit. Lett. 291–302
Xiu, P., Peng, L., Ding, X. (2006) Multi-queue merging scheme and its applications in Arabic script segmentation. In: Proceedings of the Second International Conference on Document Image Analysis of Libraries, pp. 24–29
Zidouri, A.: ORAN: a basis for an Arabic OCR system. International Symposium on Intelligent Multimedia, Video and Speech Processing, pp. 703–706 (2004)
Rabiner, L.R., Levinson, S.E.: A speaker-independent, syntax-directed, connected word recognition system based on hidden Markov model and level building. IEEE Trans. Audio, Speech Signal Process. 33(3) (1985)
Afify, M.A.: Large Vocabulary Continuous Arabic Speech Recognition. Ph.D. Thesis, Faculty of Engineering. Cairo University (1995)
Brugnara, F., Faiavigna, D., Omologo, M.: Automatic Segmentation and Labeling of Speech Based on Hidden Markov Model. Speech Communication 12. North Holland (1993)
Rabiner L.R., Wilpon J.G., Soong F.K.: High performance connected digit recognition using hidden Markov model. IEEE Trans. Audio, Speech Signal Process. 37(8), 1214–1225 (1989)
LaPre, C., Ying, Z., Raphael, C., Schwartz, R., Makhoul, J. Multi-font recognition of printed Arabic using the BBN BYBLOS speech recognition system. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 4, 2136–2139 (1996)
El-Hajj, R., Likforman-Sulem, L., Mokbel, C. Arabic handwriting recognition using baseline dependant features and hidden Markov modeling. Eighth International Conference on Document Analysis and Recognition, vol. 2, pp. 893–897 (2005)
Rashwan, M.A.: A new OCR system similar to ASR system. The 10th International Conference on Computing and Information, ICCI 2000, Kuwait (2000)
Rashwan, M., Fakhr, M., Attia, M., El-Mahallawy, M., Arabic OCR System analogous to HMM-based ASR systems; implementation and evaluation. J. Eng. Appl. Sci. Cairo University, www.Journal.eng.CU.edu.eg, Apr. 2008
www.itp.net/Arabic, “OCR software, group test,“,,
Alma’adeed S., Higgens C., Elliman D.: Recognition of off-line handwritten Arabic words using hidden Markov model approach. 16th International Conference on Pattern Recognition, vol. 3, pp 481–484 (2002)
Bourouba, H., Bedda, M.: Hybrid approach DTW/HMMC for the recognition of the isolated Arabic words. International Conference on Information and Communication Technologies: From Theory to Applications, pp. 481–482 (2004)
Pechwitz, M., Maergner, V.: HMM based approach for handwritten Arabic word recognition using the IFN/ENIT—database. Seventh International Conference on Document Analysis and Recognition pp. 890–894 (2003)
Dehghan M., Faez K., Ahmadi M., Shridhar M.: Holistic handwritten word recognition using discrete HMM and self-organizing feature map. IEEE Int. Conf. Syst. Man Cybern. 4, 2735–2739 (2000)
Bushofa B., Spann M.: Segmentation and recognition of Arabic characters by structural classification. Image Vis. Comput. (IVC) 15(3), 167–179 (1997)
Touj, S., Ben Amara, N., Amiri, H.: Two approaches for Arabic script recognition-based segmentation using the Hough transform. Ninth International Conference on Document Analysis and Recognition, vol. 2, pp. 654–658. ICDAR 2007 (2007)
Broumandnia, A., Shanbehzadeh, J., Nourani, M.: Segmentation of printed Farsi/Arabic words. International Conference on Computer Systems and Applications, pp. 761–766. Amman, Jordon (2007)
Broumandnia A., Shanbehzadeh J.: Fast Zernike wavelet moments for Farsi character recognition. Image Vis. Comput. 25, 717–726 (2007)
Cheung A., Bennamouon M., Bergmann N.W.: An Arabic optical character recognition system using recognition-based segmentation. Patt. Recognit. 34, 215–233 (2001)
Almuallim H., Yamaguchi S.: A method for recognition of Arabic cursive handwriting. IEEE Trans. Patt. Anal. Mach. Intell. PAMI- 9(5), 715–722 (1987)
Ramsis, R., El-Dabi, S.S., Kamel, A.: Arabic character recognition system, IBM Kuwait Scientific Centre, report No. KSC027 (1988)
Zahour A., Taconet B., Faure A.: Machine recognition of Arabic cursive writing. In: Impedovo, S., Simon, J.C. (eds) From Pixels to Features III: Frontiers in Handwriting Recognition, pp. 289–296. Elsevier Science Publishers B.V., Amsterdam (1992)
Abuhaiba, I.S.I.: Recognition of Off-Line Handwritten Cursive Text,” Ph.D. thesis, Department of Electronic and Electrical Engineering, Loughborough University, Loughborough, UK (1996)
Erlandson, E., Trenkle, J., Vogt, R.: Word-level recognition of multi-font Arabic text using a feature vector matching approach. In: Proceedings of the International Society for Optical Engineers, SPIE, vol. 2660, pp. 63–70 (1996)
Amin, A.: Recognition of printed Arabic text suing machine learning. In: Proceedings of the International Society for Optical Engineers, SPIE, vol. 3305, pp. 63–70 (1998)
Clocksin, W.F., Khorsheed, M.S.: Word recognition in Arabic handwriting. In: Proceedings of International Conference on Artificial Intelligence Applications. ICAIA. Egypt (2000)
Khorsheed, M.S., Clocksin, W.F.: Spectral features for Arabic word recognition. IEEE International Conference on Acoustics. Speech and Signal Processing, ICASP, Turkey (2000)
Khorsheed, M.S., Clocksin, W.F.: Structural features of cursive Arabic script. In: Proceedings Of British Conference on Machine Vision, pp. 422–431 (1999)
Khorsheed M.S., Clocksin W.F.: Multi-font Arabic word recognition using spectral features. 15th International Conference on Pattern Recognition 4, 543–546 (2000)
Tse, E., Bigun, J.: A Base-line character recognition for Syriac-Aramaic. IEEE International Conference on Systems, Man and Cybernetics, pp. 1048–1055 (2007)
Dehghan, M., Faez, K., Ahmadi, M., Shridhar, M.: Off-line unconstrained Farsi handwritten word recognition using fuzzy vector quantization and hidden Markov word models. In: Proceedings of the 15th International Conference on Pattern Recognition, vol. 2, pp. 351–354 (2000)
Nadeem Ahmad Khan: A shape Analysis Model with Application to Character and Word Recognition. Technische Universiteit Eindhoven. Proefschrift. ISBN 90-386-1750-X (2000)
Casey R.G., Lecolinet E.: A survey of methods and strategies in character segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 18(7), 690–706 (1996)
Chen, C.H., DeCurtins, J.L.: Word recognition in a segmentation-free approach to OCR. Second International Conference on Document Analysis and Recognition, pp. 573–576 (1993)
Shaikh, N.A., Shaikh, Z.A., Ali, G.: Segmentation of Arabic text into characters for recognition. IMTIC 2008, CCIS 20, pp. 11–18 (2008)
Zidouri A.: On multiple typeface Arabic script recognition. Res. J. Appl. Sci. Eng. Technol. 2(5), 428–435 (2010)
Shirali-Shahreza, S., Manzuri-Shalmani, M.T., Shirali-Shahreza, M.: A skew resistant method for Persian text recognition. In: Proceedings for the IEEE Symposium on Computational Intelligence in Image and Signal Processing, pp. 115–120 (2007)
Khorsheed M.S.: Off-line recognition of Omnifont Arabic text using the HMM ToolKit (HTK). Patt. Recognit. Lett. 28(12), 1563–1571 (2007)
Boubaker, H., El Baati, A., Kherallah, M., Alimi, A.M., Elabed, H.: Online Arabic handwriting modeling system based on the graphemes segmentation. 20th International Conference on Pattern Recognition (ICPR), pp. 2061–2064 (2010)
Moussa S.B., Zahour A., Benabdelhafid A., Adel M.A.: New features using fractal multi-dimensions for generalised Arabic font recognition. Patt. Recognit. Lett. 31(5), 361–371 (2010)
Slimane, F., Kanoun, S., Alimi, A.M., Ingold, R., Hennebert, J.: Gaussian mixture models for Arabic font recognition. 20th International Conference on Pattern Recognition (ICPR), pp. 2174–2177 (2010)
Khosravi H., Kabir E.: Farsi font recognition based on Sobel-Roberts features. Patt. Recognit. Lett. 31, 75–82 (2010)
Boussellaa, W., Zahour, A., Elabed, H., Benabdelhafid, A., Alimi A.: Unsupervised block covering analysis for text-line segmentation of Arabic ancient handwritten document images. 20 th International Conference on Pattern Recognition (ICPR), pp. 1929–1932 (2010)
Al Hamad H., Abu Zitar R.: Development of an efficient neural-based segmentation technique for Arabic handwriting recognition. Patt. Recognit. 43, 2773–2798 (2010)
Saeed K., Albakoor M.: Region growing based segmentation algorithm for typewritten and handwritten text recognition. Appl. Soft Comput. 9, 608–617 (2009)
Harouni M., Mohamad D., Rasouli A.: Deductive method for recognition of on-line handwritten Persian/Arabic characters. The 2nd International Conference on Computer and Automation Engineering (ICCAE) 5, 791–795 (2010)
Omer M.A.H., Shilong M.: Recognition online Arabic pattern. 3 rd International Conference on Advanced Computer Theory and Engineering (ICACTE) 6, 18–22 (2010)
Khan, K.U., Haider, I.: Online recognition of multi-stroke handwritten Urdu characters. International Conference on Image Analysis and Signal Processing (IASP), pp. 284–290 (2010)
Shirali-Shahreza, M., Shirali-Shahreza, S.: Persian/Arabic text font estimation using dots. In: Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, pp. 420–425 (2006)
Altec, “Character Recognition Systems Overview”, viewed on 13/2/2012. http://www.alteccenter.org/page.php?pg=filesrepositry/getRepository.php&main_cat=1&sub_cat=24
Ashraf, A., Colin A.H., Mahmoud, K. (2008) A database for Arabic printed character recognition. International Conference on Image Analysis and Recognition, ICIAR 2008, pp. 567–578
Slimane, F., Ingold, R., Kanoun, S., Alimi, M.A., Hennebert, J.: Database and evaluation protocols for Arabic printed text recognition. Internal Research Report (DIUF). University of Fribourg. Switzerland. Obtained from: http://diuf.unifr.ch/diva/APTI/publications.html (2009)
Schlosser, S.: ERIM Arabic Database. Document Processing Research Program, Information and Materials Applications Laboratory. Environmental Research Institute of Michigan (1995)
Slimane, F., Ingold, R., Kanoun, S., Alimi, M.A., Hennebert, J.: A new Arabic printed text image database and evaluation protocols. In: Proceedings of 10th IEEE International Conference on Document Analysis and Recognition (ICDAR 2009), pp. 946–950. Barcelona (Spain) (2009)
Altex, http://www.ALTEC-Center.org, viewed on 13/2/2012
Beesley, K.R.: Arabic finite-state morphological analysis and generation. In: COLING-96 Proceedings, Copenhagen, vol. 1. pp. 89–94 (1996)
Ramzi Abbes, J.D., Hassoun, M.: The architecture of a standard Arabic lexical database. Some figures, ratios and categories from the DIINAR.1 Source program. In: Workshop of Computational Approaches to Arabic Script-based Languages, Geneva, Switzerland (2004)
David Graff K.C., Kong J., Maeda K.: Arabic Gigaword, 2nd edn. Linguistic Data Consortium. University of Pennsylvania, Philadelphia (2006)
Bell, J., Zemanek, P.: Test of two Arabic OCR programs. Manuscr.Orient. Int. J. Orient. Manuscr. Res. 1(3), 55–57. http://www.islamicmanuscripts.info/.../Bell-Zemanek-1995-MO-1-3-Test-O.PDF (1995)
Kanungo, T., Marton, G.E., Bulbul, O.: Performance evaluation of two Arabic OCR products. In: Proceedings of AIPR Workshop on Advances in Computer Assisted Recognition. SPIE vol. 3584, Washington, DC (1998)
Kanungo, T., Marton, G.A., Bulbul, O.: OmniPage versus Sakhr: paired model evaluation of two Arabic OCR products. In: Proceedings of SPIE Conference on Document Recognition, San Jose, CA vol. 3651, pp. 109–120 (1999)
Aramedia, The Best Arabic OCR Technology, viewed on 10th of June, 2010, http://aramedia.com/ocr.htm
Al-Ohali Y., Cheriet M., Suen Ching: Databases for recognition of handwritten Arabic cheques. Patt. Recognit. 36(1), 111–121 (2003)
Al-Ohali Y.: Handwritten word recognition: application to Arabic cheque processing. Department of Computer Science, Concordia University, Montreal (2002)
Fujisawa, H.: Forty years of research in character and document recognition—an industrial perspective. Patt. Recognit. Vol. 41, pp. 2435–2446 (2008)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Alginahi, Y.M. A survey on Arabic character segmentation. IJDAR 16, 105–126 (2013). https://doi.org/10.1007/s10032-012-0188-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10032-012-0188-6