skip to main content
10.1145/2009916.2010019acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Collective entity linking in web text: a graph-based method

Authors Info & Claims
Published:24 July 2011Publication History

ABSTRACT

Entity Linking (EL) is the task of linking name mentions in Web text with their referent entities in a knowledge base. Traditional EL methods usually link name mentions in a document by assuming them to be independent. However, there is often additional interdependence between different EL decisions, i.e., the entities in the same document should be semantically related to each other. In these cases, Collective Entity Linking, in which the name mentions in the same document are linked jointly by exploiting the interdependence between them, can improve the entity linking accuracy.

This paper proposes a graph-based collective EL method, which can model and exploit the global interdependence between different EL decisions. Specifically, we first propose a graph-based representation, called Referent Graph, which can model the global interdependence between different EL decisions. Then we propose a collective inference algorithm, which can jointly infer the referent entities of all name mentions by exploiting the interdependence captured in Referent Graph. The key benefit of our method comes from: 1) The global interdependence model of EL decisions; 2) The purely collective nature of the inference algorithm, in which evidence for related EL decisions can be reinforced into high-probability decisions. Experimental results show that our method can achieve significant performance improvement over the traditional EL methods.

References

  1. Adafre, S. F. & de Rijke, M. 2005. Discovering missing links in Wikipedia. In: Proceedings of the 3rd international workshop on Link discovery. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Artiles, J., Sekine, S. & Gonzalo, J. 2008. Web people search. In: Proceedings of LREC, vol. 8.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Bunescu, R. & Pasca, M. 2006. Using encyclopedic knowledge for named entity disambiguation. In: Proceedings of EACL, vol. 6.Google ScholarGoogle Scholar
  4. Cucerzan, S. 2007. Large-scale named entity disambiguation based on Wikipedia data. In: Proceedings of EMNLP-CoNLL.Google ScholarGoogle Scholar
  5. Dredze, M., McNamee, P., Rao, D., Gerber, A. & Finin, T. 2010. Entity Disambiguation for Knowledge Base Population. In: Proceedings of COLING. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Fader, A., Soderland, S., Etzioni, O. & Center, T. 2009. Scaling Wikipedia-based named entity disambiguation to arbitrary web text. In: Proceedings of Wiki-AI at IJCAI.Google ScholarGoogle Scholar
  7. Gabrilovich, E. and Markovich, S. 2007. Computing Semantic Relatedness using Wikipedia-based Explicit Semantic Analysis. In: Proceedings of the IJCAI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Gbel, F. & Jagers, A. A. 1974. Random walks on graphs. In: Stochastic processes and their applications, vol. 2, no. 4, pp. 311--336.Google ScholarGoogle Scholar
  9. Han, X. & Zhao, J. 2009.Named Entity Disambiguation by leveraging Wikipedia semantic knowledge. In: Proceedings of CIKM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Han, X. & Zhao, J. 2010. Structural semantic relatedness: a knowledge-based method to named entity disambiguation. In: Proceedings of the 49th ACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kulkarni, S., Singh, A., Ramakrishnan, G. & Chakrabarti, S. 2009. Collective annotation of Wikipedia entities in web text. In: Proceedings of the 15th ACM SIGKDD. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Li, X., Morie, P. & Roth, D. 2004. Identification and tracing of ambiguous names: Discriminative and generative approaches. In: Proceedings of AAAI, pp. 419--424. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. McNamee, P. & Dang, H. T. 2009. Overview of the TAC 2009 Knowledge Base Population Track. In: Proceeding of Text Analysis Conference.Google ScholarGoogle Scholar
  14. Milne, D. & Witten, I. H. 2008. Learning to link with Wikipedia. In: Proceedings of the 17th ACM CIKM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Milne, D., et al. 2006. Mining Domain-Specific Thesauri from Wikipedia: A case study. In: Proceedings of WI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Medelyan, O., Witten, I. H. & Milne, D. 2008. Topic indexing with Wikipedia. In: Proceedings of the AAAI WikiAI workshop.Google ScholarGoogle Scholar
  17. Mihalcea, R. & Csomai, A. 2007. Wikify!: linking documents to encyclopedic knowledge. In: Proceedings of the sixteenth ACM CIKM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Pedersen, T., Purandare, A. & Kulkarni, A. 2005. Name discrimination by clustering similar contexts. In: Proceedings of CICLing. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Strube, M. and Ponzetto, S. P. 2006. WikiRelate! Computing Semantic Relatedness Using Wikipedia. In: Proceedings of AAAI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Taher H. Haveliwala. 2003. Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search. IEEE Transactions on Knowledge and Data Engineering. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Tong, H., Faloutsos, C. & Pan, J. Y. 2007. Fast random walk with restart and its applications, Data Mining. In: Proceedings of ICDM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Zhang, W., Su, J., Tan, Chew Lim & Wang, W. T. 2010. Entity Linking Leveraging Automatically Generated Annotation. In: Proceedings of the 23rd COLING. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Zheng, Z., Li, F., Huang, M. & Zhu, X. 2010. Learning to Link Entities with Knowledge Base. In: The Proceedings of NAACL. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Zhou, Y., Nie, L., Rouhani-Kalleh, O., Vasile, F. & Gaffney, S. 2010. Resolving Surface Forms to Wikipedia Topics. In: Proceedings of the 23rd COLING. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Hu, J., Fang, L., Cao, Y., et al. 2008. Enhancing Text Clustering by Leveraging Wikipedia Semantics. In Proceedings of SIGIR. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Collective entity linking in web text: a graph-based method

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGIR '11: Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
      July 2011
      1374 pages
      ISBN:9781450307574
      DOI:10.1145/2009916

      Copyright © 2011 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 July 2011

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate792of3,983submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader