ABSTRACT
Text detection in natural scene images is an open and challenging problem due to the significant variations of the appearance of the text itself and its interaction with the context. In this paper, we present a novel text detection method based on robust localization and adaptive growing of seed text components. The method consists of two main ingredients. First, convolutional neural network is exploited to localize seed candidate characters from the maximally stable extremal regions of the image with learned discriminative deep convolutional features. Next, an iterative and adaptive growing algorithm is employed to grow from seed characters to search for other degraded text components in same text line based on their conformity to the seed, and an associative quality is learned to measure the conformity combining both the geometric and appearance constraints between two neighbouring text components. The effectiveness of the proposed method is demonstrated by the state-of-the-art results achieved on the public datasets.
- X. Chen and A. L. Yuille. Detecting and reading text in natural scenes. In CVPR, pages II-366-II-373 Vol.2, 2004. Google ScholarDigital Library
- B. Epshtein, E. Ofek, and Y. Wexler. Detecting text in natural scenes with stroke width transform. In CVPR, pages 2963--2970, 2010.Google ScholarCross Ref
- W. Huang, Y. Qiao, and X. Tang. Robust scene text detection with convolution neural network induced mser trees. In ECCV, pages 497--511, 2014.Google ScholarCross Ref
- M. Jaderberg, A. Vedaldi, and A. Zisserman. Deep features for text spotting. In ECCV, pages 512--528, 2014.Google ScholarCross Ref
- D. Karatzas, F. Shafait, S. Uchida, M. Iwamura, L. G. i Bigorda, S. R. Mestre, J. Mas, D. F. Mota, J. A. Almazan, and L. P. de las Heras. ICDAR 2013 robust reading competition. In ICDAR, pages 1484--1493, 2013. Google ScholarDigital Library
- H. I. Koo and D. H. Kim. Scene text detection via connected component clustering and nontext filtering. IEEE Trans. Image Processing, 22(6):2296--2305, June 2013. Google ScholarDigital Library
- S. M. Lucas. ICDAR 2005 text locating competition results. In ICDAR, pages 80--84 Vol.1, 2005. Google ScholarDigital Library
- S. M. Lucas, A. Panaretos, L. Sosa, A. Tang, S. Wong, and R. Young. ICDAR 2003 robust reading competitions. In ICDAR, pages 682--687, 2003. Google ScholarDigital Library
- R. Minetto, N. Thome, M. Cord, J. Stolfi, F. Precioso, J. Guyomard, and N. Leite. Text detection and recognition in urban scenes. In ICCVW, pages 227--234, 2011.Google ScholarCross Ref
- A. Mishra, K. Alahari, and C. V. Jawahar. Top-down and bottom-up cues for scene text recognition. In CVPR, pages 2687--2694, 2012. Google ScholarDigital Library
- A. Mosleh, N. Bouguila, and A. B. Hamza. Image text detection using a bandlet-based edge detector and stroke width transform. In BMVC, pages 63.1--63.12, 2012.Google ScholarCross Ref
- L. Neumann and J. Matas. Real-time scene text localization and recognition. In CVPR, pages 3538--3545, 2012. Google ScholarDigital Library
- L. Neumann and J. Matas. Scene text localization and recognition with oriented stroke detection. In ICCV, pages 97--104, 2013. Google ScholarDigital Library
- Y.-F. Pan, X. Hou, and C.-L. Liu. A hybrid approach to detect and localize texts in natural scene images. IEEE Trans. Image Processing, 20(3):800--813, March 2011. Google ScholarDigital Library
- A. Shahab, F. Shafait, and A. Dengel. ICDAR 2011 robust reading competition challenge 2: Reading text in scene images. In ICDAR, pages 1491--1496, 2011. Google ScholarDigital Library
- K. Wang, B. Babenko, and S. Belongie. End-to-end scene text recognition. In ICCV, pages 1457--1464, 2011. Google ScholarDigital Library
- K. Wang and S. Belongie. Word spotting in the wild. In ECCV, pages 591--604, 2010. Google ScholarDigital Library
- T. Wang, D. J. Wu, A. Coates, and A. Y. Ng. End-to-end text recognition with convolutional neural networks. In ICPR, pages 3304--3308, 2012.Google Scholar
- X. Wang, Y. Song, and Y. Zhang. Natural scene text detection with multi-channel connected component segmentation. In ICDAR, pages 1375--1379, 2013. Google ScholarDigital Library
- X.-C. Yin, X. Yin, K. Huang, and H.-W. Hao. Robust text detection in natural scene images. IEEE Trans. PAMI, 36(5):970--983, May 2014.Google ScholarCross Ref
- J. Zhang and R. Kasturi. A novel text detection system based on character and link energies. IEEE Trans. Image Processing, 23(9):4187--4198, Sep. 2014.Google Scholar
Index Terms
- Robust Seed Localization and Growing with Deep Convolutional Features for Scene Text Detection
Recommendations
Multi-Lingual Scene Text Detection Using One-Class Classifier
The main purpose of scene text recognition is to detect texts in a given image. The problem of text detection and recognition in such images has gained great attention in recent years due to rising demand of several applications like visual based ...
An enhanced text detection technique for the visually impaired to read text
An enhanced text detection technique (ETDT) is proposed, which is expected to aid the visually impaired to overcome their reading challenges. This work enhances the edge-preserving maximally stable extremal regions (eMSER) algorithm using the pyramid ...
Scene text detection method research based on maximally stable extremal regions
Text information is an important basis for people to understand the natural scene image. At first, an edge-enhanced maximally stable extremal regions (MSER) text detection method based on weighted guided filtering and histograms of oriented gradients (HOG)...
Comments