research-article

Robust Seed Localization and Growing with Deep Convolutional Features for Scene Text Detection

Authors:
Hailiang Xu

Nanjing University, Nanjing, China

Nanjing University, Nanjing, China
View Profile

,
Feng Su

Nanjing University, Nanjing, China

Nanjing University, Nanjing, China
View Profile

ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia RetrievalJune 2015Pages 387–394https://doi.org/10.1145/2671188.2749370

Published:22 June 2015Publication History

ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval

Pages 387–394

ABSTRACT

Text detection in natural scene images is an open and challenging problem due to the significant variations of the appearance of the text itself and its interaction with the context. In this paper, we present a novel text detection method based on robust localization and adaptive growing of seed text components. The method consists of two main ingredients. First, convolutional neural network is exploited to localize seed candidate characters from the maximally stable extremal regions of the image with learned discriminative deep convolutional features. Next, an iterative and adaptive growing algorithm is employed to grow from seed characters to search for other degraded text components in same text line based on their conformity to the seed, and an associative quality is learned to measure the conformity combining both the geometric and appearance constraints between two neighbouring text components. The effectiveness of the proposed method is demonstrated by the state-of-the-art results achieved on the public datasets.

References

X. Chen and A. L. Yuille. Detecting and reading text in natural scenes. In CVPR, pages II-366-II-373 Vol.2, 2004. Google ScholarDigital Library
B. Epshtein, E. Ofek, and Y. Wexler. Detecting text in natural scenes with stroke width transform. In CVPR, pages 2963--2970, 2010.Google ScholarCross Ref
W. Huang, Y. Qiao, and X. Tang. Robust scene text detection with convolution neural network induced mser trees. In ECCV, pages 497--511, 2014.Google ScholarCross Ref
M. Jaderberg, A. Vedaldi, and A. Zisserman. Deep features for text spotting. In ECCV, pages 512--528, 2014.Google ScholarCross Ref
D. Karatzas, F. Shafait, S. Uchida, M. Iwamura, L. G. i Bigorda, S. R. Mestre, J. Mas, D. F. Mota, J. A. Almazan, and L. P. de las Heras. ICDAR 2013 robust reading competition. In ICDAR, pages 1484--1493, 2013. Google ScholarDigital Library
H. I. Koo and D. H. Kim. Scene text detection via connected component clustering and nontext filtering. IEEE Trans. Image Processing, 22(6):2296--2305, June 2013. Google ScholarDigital Library
S. M. Lucas. ICDAR 2005 text locating competition results. In ICDAR, pages 80--84 Vol.1, 2005. Google ScholarDigital Library
S. M. Lucas, A. Panaretos, L. Sosa, A. Tang, S. Wong, and R. Young. ICDAR 2003 robust reading competitions. In ICDAR, pages 682--687, 2003. Google ScholarDigital Library
R. Minetto, N. Thome, M. Cord, J. Stolfi, F. Precioso, J. Guyomard, and N. Leite. Text detection and recognition in urban scenes. In ICCVW, pages 227--234, 2011.Google ScholarCross Ref
A. Mishra, K. Alahari, and C. V. Jawahar. Top-down and bottom-up cues for scene text recognition. In CVPR, pages 2687--2694, 2012. Google ScholarDigital Library
A. Mosleh, N. Bouguila, and A. B. Hamza. Image text detection using a bandlet-based edge detector and stroke width transform. In BMVC, pages 63.1--63.12, 2012.Google ScholarCross Ref
L. Neumann and J. Matas. Real-time scene text localization and recognition. In CVPR, pages 3538--3545, 2012. Google ScholarDigital Library
L. Neumann and J. Matas. Scene text localization and recognition with oriented stroke detection. In ICCV, pages 97--104, 2013. Google ScholarDigital Library
Y.-F. Pan, X. Hou, and C.-L. Liu. A hybrid approach to detect and localize texts in natural scene images. IEEE Trans. Image Processing, 20(3):800--813, March 2011. Google ScholarDigital Library
A. Shahab, F. Shafait, and A. Dengel. ICDAR 2011 robust reading competition challenge 2: Reading text in scene images. In ICDAR, pages 1491--1496, 2011. Google ScholarDigital Library
K. Wang, B. Babenko, and S. Belongie. End-to-end scene text recognition. In ICCV, pages 1457--1464, 2011. Google ScholarDigital Library
K. Wang and S. Belongie. Word spotting in the wild. In ECCV, pages 591--604, 2010. Google ScholarDigital Library
T. Wang, D. J. Wu, A. Coates, and A. Y. Ng. End-to-end text recognition with convolutional neural networks. In ICPR, pages 3304--3308, 2012.Google Scholar
X. Wang, Y. Song, and Y. Zhang. Natural scene text detection with multi-channel connected component segmentation. In ICDAR, pages 1375--1379, 2013. Google ScholarDigital Library
X.-C. Yin, X. Yin, K. Huang, and H.-W. Hao. Robust text detection in natural scene images. IEEE Trans. PAMI, 36(5):970--983, May 2014.Google ScholarCross Ref
J. Zhang and R. Kasturi. A novel text detection system based on character and link energies. IEEE Trans. Image Processing, 23(9):4187--4198, Sep. 2014.Google Scholar

Index Terms

Robust Seed Localization and Growing with Deep Convolutional Features for Scene Text Detection
1. Information systems
  1. Information retrieval
    1. Document representation

Recommendations

Multi-Lingual Scene Text Detection Using One-Class Classifier

The main purpose of scene text recognition is to detect texts in a given image. The problem of text detection and recognition in such images has gained great attention in recent years due to rising demand of several applications like visual based ...
Read More
An enhanced text detection technique for the visually impaired to read text

An enhanced text detection technique (ETDT) is proposed, which is expected to aid the visually impaired to overcome their reading challenges. This work enhances the edge-preserving maximally stable extremal regions (eMSER) algorithm using the pyramid ...
Read More
Scene text detection method research based on maximally stable extremal regions

Text information is an important basis for people to understand the natural scene image. At first, an edge-enhanced maximally stable extremal regions (MSER) text detection method based on weighted guided filtering and histograms of oriented gradients (HOG)...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval
June 2015
700 pages
ISBN:9781450332743
DOI:10.1145/2671188
General Chairs:
Alex Hauptmann
Carnegie Mellon University, USA
,
Chong-Wah Ngo
City University of Hong Kong, China
,
Xiangyang Xue
Fudan University, China
,
Program Chairs:
Yu-Gang Jiang
Fudan University, China
,
Cees Snoek
University of Amsterdam and Qualcomm Research Netherlands
,
Nuno Vasconcelos
University of California, San Diego, USA
Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 22 June 2015
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
convolutional neural network
mser
natural scene image
swt
text detection
Qualifiers
- research-article
Conference

Acceptance Rates
ICMR '15 Paper Acceptance Rate48of127submissions,38%Overall Acceptance Rate254of830submissions,31%
More
Upcoming Conference
ICMR '24

Sponsor:

sigmm

International Conference on Multimedia Retrieval

June 10 - 14, 2024

Phuket , Thailand
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 8
  Total Citations
  View Citations
- 240
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Robust Seed Localization and Growing with Deep Convolutional Features for Scene Text Detection

ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Multi-Lingual Scene Text Detection Using One-Class Classifier

An enhanced text detection technique for the visually impaired to read text

Scene text detection method research based on maximally stable extremal regions