skip to main content
10.5555/1654758.1654763dlproceedingsArticle/Chapter ViewAbstractPublication PagestextgraphsConference Proceedingsconference-collections
research-article
Free Access

Measuring aboutness of an entity in a text

Published:09 June 2006Publication History

ABSTRACT

In many information retrieval and selection tasks it is valuable to score how much a text is about a certain entity and to compute how much the text discusses the entity with respect to a certain viewpoint. In this paper we are interested in giving an aboutness score to a text, when the input query is a person name and we want to measure the aboutness with respect to the biographical data of that person. We present a graph-based algorithm and compare its results with other approaches.

References

  1. Angheluta, R., Jeuniaux, P., Mitra, R. and Moens, M.-F. (2004). Clustering algorithms for noun phrase coreference resolution. In Proceedings JADT - 2004. 7èmes Journées internationales d'Analyse statistique des Données Textuelles. Louvain-La-Neuve, Belgium.Google ScholarGoogle Scholar
  2. Beghtol, C. (1986). Bibliographic classification theory and text linguistics: Aboutness analysis, intertextuality and the cognitive act of classifying documents. Journal of Documentation, 42(2): 84--113.Google ScholarGoogle ScholarCross RefCross Ref
  3. Cardie C. and Wagstaff K. (1999). Noun phrase coreference as clustering. In Proceedings of the Joint Conference on Empirical Methods in NLP and Very Large Corpora.Google ScholarGoogle Scholar
  4. Dunning, T. (1993). Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19: 61--74. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Erkan, G. and Radev, D. R. (2004). LexRank: Graph-based lexical centrality as salience in text summarization. Journal of Artificial Intelligence Research, 22: 457--479. Google ScholarGoogle ScholarCross RefCross Ref
  6. Givón, T. (2001). Syntax. An Introduction. Amsterdam: John Benjamins.Google ScholarGoogle Scholar
  7. Kleinberg, J. M. (1998). Authoritative sources in a hyperlinked environment. In Proceedings 9th ACM-SIAM Symposium on Discrete Algorithms (pp. 668--677). Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Mihalcea, R. and Tarau, P. (2004). TextRank: Bringing order into texts. In Proceedings of EMNLP (pp. 404--411).Google ScholarGoogle Scholar
  9. Soergel, D. (1994). Indexing and retrieval performance: The logical evidence. Journal of the American Society for Information Science, 45 (8): 589--599. Google ScholarGoogle ScholarCross RefCross Ref
  10. Van Dijk, T. A. and Kintsch, W. (1983). Strategies of Discourse Comprehension. New York: Academic Press.Google ScholarGoogle Scholar
  11. Zha, H. (2002). Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 113--120). New York: ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Measuring aboutness of an entity in a text

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image DL Hosted proceedings
        TextGraphs-1: Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing
        June 2006
        115 pages

        Publisher

        Association for Computational Linguistics

        United States

        Publication History

        • Published: 9 June 2006

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader