Abstract
Semantic annotation aims at linking parts of rough data (e.g., text, video, or image) to known entities in the Linked Open Data (LOD) space. When several entities could be linked to a given object, a Named-Entity Disambiguation (NED) problem must be solved. While disambiguation has been extensively studied in Natural Language Understanding (NLU), NED is less ambitious—it does not aim to the meaning of a whole phrase, just to correctly link objects to entities—and at the same time more peculiar since the target must be LOD-entities. Inspired by semantic similarity in NLU, this paper illustrates a way to solve disambiguation based on Common Subsumers of pairs of RDF resources related to entities recognized in the text. The inference process proposed for resolving ambiguities leverages on the DBpedia structured semantics. We apply it to a TV-program description enrichment use case, illustrating its potential in correcting errors produced by automatic text annotators (such as errors in assigning entity types and entity URIs), and in extracting a description of the main topics of a text in form of commonalities shared by its entities.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Alfonseca, E., Manandhar, S.: An unsupervised method for general named entity recognition and automated concept discovery. In: Proc. of the 1st Int. Conf. on General WordNet, Mysore, India, pp. 34–43 (2002)
Bellot, P., Bonnefoy, L., Bouvier, V., Duvert, F., Kim, Y.M.: Large scale text mining approaches for information retrieval and extraction. In: Innovations in Intelligent Machines-4, pp. 3–45. Springer (2014)
Bunescu, R.C., Pasca, M.: Using encyclopedic knowledge for named entity disambiguation. In: Proc. of the 11th Conf. of the European Chapter of the Association for Computational Linguistics (EACL-06), vol. 6, pp. 9–16 (2006)
Cambria, E., White, B.: Jumping NLP curves: A review of natural language processing research. IEEE Computational Intelligence Magazine 9(2), 48–57 (2014)
Chen, L., Ortona, S., Orsi, G., Benedikt, M.: Aggregating semantic annotators. Proc. of the VLDB Endowment 6(13), 1486–1497 (2013)
Chieu, H.L., Ng, H.T.: Named entity recognition: a maximum entropy approach using global information. In: Proc. of the 19th Int. Conf. on Computational linguistics, vol. 1, pp. 1–7. ACL (2002)
Cimiano, P., Völker, J.: Towards large-scale, open-domain and ontology-based named entity classification. In: Proc. of the Int. Conf. on Recent Advances in Natural Language Processing (RANLP) (2005)
Colucci, S., Donini, F.M., Di Sciascio, E.: Common subsumbers in RDF. In: Baldoni, M., Baroglio, C., Boella, G., Micalizio, R. (eds.) AI*IA 2013. LNCS, vol. 8249, pp. 348–359. Springer, Heidelberg (2013)
Colucci, S., Giannini, S., Donini, F.M., Di Sciascio, E.: A deductive approach to the identification and description of clusters in Linked Open Data. In: Proc. of the 21th European Conf. on Artificial Intelligence (ECAI 2014). IOS Press (2014)
Cucerzan, S.: TAC entity linking by performing full-document entity extraction and disambiguation. In: Proc. of the Text Analysis Conference, vol. 2011 (2011)
Dredze, M., McNamee, P., Rao, D., Gerber, A., Finin, T.: Entity disambiguation for knowledge base population. In: Proc. of the 23rd Int. Conf. on Computational Linguistics, pp. 277–285. ACL, Beijing, August 2010
Etzioni, O., Cafarella, M., Downey, D., Popescu, A.M., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Unsupervised named-entity extraction from the web: An experimental study. Artificial Intelligence 165(1), 91–134 (2005)
Fetahu, B., Dietze, S., Pereira Nunes, B., Antonio Casanova, M., Taibi, D., Nejdl, W.: What’s all the data about?: creating structured profiles of linked data on the web. In: Proc. of the Companion Publication of the 23rd Int. Conf. on World Wide Web Companion, pp. 261–262. International World Wide Web Conferences Steering Committee (2014)
Gangemi, A.: A comparison of knowledge extraction tools for the semantic web. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds.) ESWC 2013. LNCS, vol. 7882, pp. 351–366. Springer, Heidelberg (2013)
Hayes, P.: RDF Semantics, W3C Recommendation (2004). http://www.w3.org/TR/2004/REC-rdf-mt-20040210/
Hellmann, S., Lehmann, J., Auer, S., Brümmer, M.: Integrating NLP using linked data. In: Alani, H., et al. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 98–113. Springer, Heidelberg (2013)
Hobbs, J.R., Stickel, M., Martin, P., Edwards, D.: Interpretation as abduction. In: Proc. of the 26th Annual Meeting on Association for Computational Linguistics, pp. 95–103. ACL (1988)
Hoffart, J., Yosef, M.A., Bordino, Ilaria Fürstenau and H., Pinkal, M., Spaniol, M., Taneva, B., Thater, S., Weikum, G.: Robust disambiguation of named entities in text. In: Proc. of the Conf. on Empirical Methods in Natural Language Processing, pp. 782–792. ACL, Edinburgh, July 2011
Maccatrozzo, V.: Burst the filter bubble: using semantic web to enable serendipity. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012, Part II. LNCS, vol. 7650, pp. 391–398. Springer, Heidelberg (2012)
Mihalcea, R., Csomai, A.: Wikify!: linking documents to encyclopedic knowledge. In: Proc. of the 16th ACM Conf. on Information and Knowledge Management, pp. 233–242. ACM (2007)
Milne, D., Witten, I.H.: Learning to link with wikipedia. In: Proc. of the 17th ACM Conf. on Information and Knowledge Management, pp. 509–518. ACM (2008)
Moro, A., Raganato, A., Navigli, R.: Entity linking meets word sense disambiguation: a unified approach. Transactions of the Association for Computational Linguistics 2, 231–244 (2014)
Nakashole, N., Theobald, M., Weikum, G.: Scalable knowledge harvesting with high precision and high recall. In: Proc. of the Fourth ACM Int. Conf. on Web Search and Data Mining, pp. 227–236. ACM, Hong Kong, February 2011
Navigli, R., Ponzetto, S.P.: BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artificial Intelligence 193, 217–250 (2012)
Navigli, R., Velardi, P.: Structural semantic interconnections: a knowledge-based approach to word sense disambiguation. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(7), 1075–1086 (2005)
Niu, C., Li, W., Ding, J., Srihari, R.K.: A bootstrapping approach to named entity classification using successive learners. In: Proc. of the 41st Annual Meeting on Association for Computational Linguistics, vol. 1, pp. 335–342. ACL (2003)
Prokofyev, R., Demartini, G., Cudré-Mauroux, P.: Effective named entity recognition for idiosyncratic web collections. In: Proc. of the 23rd Int. Conf. on World Wide Web, pp. 397–408. International World Wide Web Conferences Steering Committee (2014)
Rizzo, G., Erp, M.V., Troncy, R.: Benchmarking the extraction and disambiguation of named entities on the semantic web. In: Proc. of the 9th Int. Conf. on Language Resources and Evaluation (LREC 2014). European Language Resources Association (ELRA), Reykjavik, May 2014
Studer, R., Burghart, C., Stojanovic, N., Thanh, T., Zacharias, V.: New dimensions in semantic knowledge management. In: Towards the Internet of Services: The THESEUS Research Program, pp. 37–50. Springer (2014)
Van Erp, M., Rizzo, G., Troncy, R.: Learning with the web: spotting named entities on the intersection of NERD and machine learning. In: # MSM, pp. 27–30. Citeseer (2013)
Zhang, L., Pan, Y., Zhang, T.: Focused named entity recognition using machine learning. In: Proc. of the 27th Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, pp. 281–288. ACM (2004)
Zheng, Z., Si, X., Li, F., Chang, E.Y., Zhu, X.: Entity disambiguation with freebase. In: Proc. of the The 2012 IEEE/WIC/ACM Int. Joint Conf. on Web Intelligence and Intelligent Agent Technology, vol. 01, pp. 82–89. IEEE Computer Society (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Giannini, S., Colucci, S., Donini, F.M., Di Sciascio, E. (2015). A Logic-Based Approach to Named-Entity Disambiguation in the Web of Data. In: Gavanelli, M., Lamma, E., Riguzzi, F. (eds) AI*IA 2015 Advances in Artificial Intelligence. AI*IA 2015. Lecture Notes in Computer Science(), vol 9336. Springer, Cham. https://doi.org/10.1007/978-3-319-24309-2_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-24309-2_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24308-5
Online ISBN: 978-3-319-24309-2
eBook Packages: Computer ScienceComputer Science (R0)