article

Free Access

On Relevance, Probabilistic Indexing and Information Retrieval

Authors:
M. E. Maron

The RAND Corporation, Santa Monica, California

The RAND Corporation, Santa Monica, California
View Profile

,
J. L. Kuhns

Ramo-Wooldridge, Canoga Park, California

Ramo-Wooldridge, Canoga Park, California
View Profile

Authors Info & Claims

Journal of the ACM Volume 7 Issue 3pp 216–244https://doi.org/10.1145/321033.321035

Published:01 July 1960Publication History

Journal of the ACM

Abstract

This paper reports on a novel technique for literature indexing and searching in a mechanized library system. The notion of relevance is taken as the key concept in the theory of information retrieval and a comparative concept of relevance is explicated in terms of the theory of probability. The resulting technique called “Probabilistic Indexing,” allows a computing machine, given a request for information, to make a statistical inference and derive a number (called the “relevance number”) for each document, which is a measure of the probability that the document will satisfy the given request. The result of a search is an ordered list of those documents which satisfy the request ranked according to their probable relevance.

The paper goes on to show that whereas in a conventional library system the cross-referencing (“see” and “see also”) is based solely on the “semantical closeness” between index terms, statistical measures of closeness between index terms can be defined and computed. Thus, given an arbitrary request consisting of one (or many) index term(s), a machine can elaborate on it to increase the probability of selecting relevant documents that would not otherwise have been selected.

Finally, the paper suggests an interpretation of the whole library problem as one where the request is considered as a clue on the basis of which the library system makes a concatenated statistical inference in order to provide as an output an ordered list of those documents which most probably satisfy the information needs of the user.

Index Terms

On Relevance, Probabilistic Indexing and Information Retrieval
1. Information systems
  1. Information retrieval
    1. Document representation
    2. Search engine architectures and scalability
      1. Search engine indexing
2. Theory of computation
  1. Models of computation
    1. Probabilistic computation

Recommendations

Image retrieval based on indexing and relevance feedback

In content based image retrieval (CBIR) system, search engine retrieves the images similar to the query image according to a similarity measure. It should be fast enough and must have a high precision of retrieval. Indexing scheme is used to achieve a ...
Read More
Enhancing relevance models with adaptive passage retrieval
ECIR'08: Proceedings of the IR research, 30th European conference on Advances in information retrieval

Passage retrieval and pseudo relevance feedback/query expansion have been reported as two effective means for improving document retrieval in literature. Relevance models, while improving retrieval in most cases, hurts performance on some heterogeneous ...
Read More
Evaluating scalability in information retrieval with multigraded relevance
AIRS'06: Proceedings of the Third Asia conference on Information Retrieval Technology

For the user’s point of view, in large environments, it can be desirable to have Information Retrieval Systems (IRS) that retrieve documents according to their relevance levels. Relevance levels have been studied in some previous Information Retrieval (...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

Journal of the ACM Volume 7, Issue 3
July 1960
97 pages
ISSN:0004-5411
EISSN:1557-735X
DOI:10.1145/321033
Issue’s Table of Contents

Copyright © 1960 ACM
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 July 1960
Published in jacm Volume 7, Issue 3

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 529
  Total Citations
  View Citations
- 3,905
  Total Downloads
- Downloads (Last 12 months)348
- Downloads (Last 6 weeks)64
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

On Relevance, Probabilistic Indexing and Information Retrieval

Journal of the ACM

Abstract

Cited By

Index Terms

Recommendations

Image retrieval based on indexing and relevance feedback

Enhancing relevance models with adaptive passage retrieval

Evaluating scalability in information retrieval with multigraded relevance

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

On Relevance, Probabilistic Indexing and Information Retrieval

Journal of the ACM

Abstract

Cited By

Index Terms

Recommendations

Image retrieval based on indexing and relevance feedback

Enhancing relevance models with adaptive passage retrieval

Evaluating scalability in information retrieval with multigraded relevance

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media