article

Free Access

Automatic Document Classification Part II . Additional Experiments

Authors:
Harold Borko

System Development Corporation, Santa Monica, California

System Development Corporation, Santa Monica, California
View Profile

,
Myrna Bernick

System Development Corporation, Santa Monica, California

System Development Corporation, Santa Monica, California
View Profile

Authors Info & Claims

Journal of the ACM Volume 11 Issue 2pp 138–151https://doi.org/10.1145/321217.321219

Published:01 April 1964Publication History

Journal of the ACM

Abstract

This study reports the results of a series of experiments in the techniques of automatic document classification. Two different classification schedules are compared along with two methods of automatically classifying documents into categories. It is concluded that, while there is no significant difference in the predictive efficiency between the Bayesian and the Factor Score methods, automatic document classification is enhanced by the use of a factor-analytically-derived classification schedule. Approximately 55 percent of the document were automatically and correctly classified.

References

1 Institute Radio Engineers, 1959. Abstracts of current computer literature. IRE Trans. EC-8, 1, 2, and 3.Google Scholar
2 BORKO, I-I. The construction of an empirically based mathematically derived classification system. Proc. Spring Joint Comput. Conf. 21 (1962), 279-289.Google Scholar
3 BORKO, H., AND BERNICK, M. Automatic document classification. J. ACM, 10 (1963), 151-102. Google Scholar
4 FRUCHTER, B., AND JENNINGS, E. Factor analysis no. 1. In H. BORKO (Ed.), Computer Applications in the Behavioral Sciences, Prentice-Hall, Englewood Cliffs, N. J., 1962.Google Scholar
5 HARMAN, H. H. Modern Factor Analysis. U. of Chicago Press, Chicago, Ill., 1960.Google Scholar
6 LUHN, H. F. A statistical approach to mechanized encoding and searching of literary information. IBM J. Res. Develop. i (1957), 309-317.Google Scholar
7 MARON, M. E. Automatic indexing: an experimental inquiry. J. ACM 8 (1961), 407-4t7. Google Scholar
8 OLNAY, J. C. FEAT, an inventory program for information retrieval. FN-4018, System Development Corp., Santa Moniea, Calif., 1960.Google Scholar

Index Terms

Automatic Document Classification Part II . Additional Experiments

Recommendations

Automatic document classification: natural language processing, statistical analysis, and expert system techniques used together
SIGIR '92: Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval

In this paper we describe an automated method of classifying research project descriptions: a human expert classifies a sample set of projects into a set of disjoint and pre-defined classes, and then the computer learns from this sample how to classify ...
Read More
Automatic office document classification and information extraction
Read More
Semi-automatic document classification: exploiting document difficulty
ECIR'12: Proceedings of the 34th European conference on Advances in Information Retrieval

There are circumstances where classification is required only if a certain condition, such a specific level of quality, is met. This paper investigates a semi-automatic solution where only the predictions for the documents which are more likely to be ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

Journal of the ACM Volume 11, Issue 2
April 1964
135 pages
ISSN:0004-5411
EISSN:1557-735X
DOI:10.1145/321217
Issue’s Table of Contents

Copyright © 1964 ACM
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 April 1964
Published in jacm Volume 11, Issue 2

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 19
  Total Citations
  View Citations
- 678
  Total Downloads
- Downloads (Last 12 months)43
- Downloads (Last 6 weeks)6
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Automatic Document Classification Part II . Additional Experiments

Journal of the ACM

Abstract

References

Cited By

Index Terms

Recommendations

Automatic document classification: natural language processing, statistical analysis, and expert system techniques used together

Automatic office document classification and information extraction

Semi-automatic document classification: exploiting document difficulty

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Automatic Document Classification Part II . Additional Experiments

Journal of the ACM

Abstract

References

Cited By

Index Terms

Recommendations

Automatic document classification: natural language processing, statistical analysis, and expert system techniques used together

Automatic office document classification and information extraction

Semi-automatic document classification: exploiting document difficulty

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media