Paper
30 March 1995 Keyword spotting via word shape recognition
Jeff L. DeCurtins, Edward C. Chen
Author Affiliations +
Proceedings Volume 2422, Document Recognition II; (1995) https://doi.org/10.1117/12.205829
Event: IS&T/SPIE's Symposium on Electronic Imaging: Science and Technology, 1995, San Jose, CA, United States
Abstract
With the advent of on-line access to very large collections of document images, electronic classification into areas of interest has become possible. A first approach to classification might be the use of OCR on each document followed by analysis of the resulting ASCII text. But if the quality of a document is poor, the format unconstrained, or time is critical, complete OCR of each image is not appropriate. An alternative approach is the use of word shape recognition (as opposed to individual character recognition) and the subsequent classification of documents by the presence or absence of selected keywords. Use of word shape recognition not only provides a more robust collection of features but also eliminates the need for character segmentation (a leading cause of error in OCR). In this paper we describe a system we have developed for the detection of isolated words, word portions, as well as multi-word phrases in images of documents. It is designed to be used with large, changeable, keyword sets and very large document sets. The system provides for automated training of desired keywords and creation of indexing filters to speed matching.
© (1995) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jeff L. DeCurtins and Edward C. Chen "Keyword spotting via word shape recognition", Proc. SPIE 2422, Document Recognition II, (30 March 1995); https://doi.org/10.1117/12.205829
Lens.org Logo
CITATIONS
Cited by 37 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Optical character recognition

Sensors

Image processing

Feature extraction

Image classification

Detector development

RELATED CONTENT

Locally adaptive document skew detection
Proceedings of SPIE (April 03 1997)
Word recognition in a segmentation-free approach to OCR
Proceedings of SPIE (February 25 1994)
Key-text spotting in documentary videos using Adaboost
Proceedings of SPIE (February 17 2006)
Script identification of handwritten word images
Proceedings of SPIE (January 19 2009)

Back to Top