Knowledge Sources for WSD

Agirre, Eneko; Stevenson, Mark

doi:10.1007/978-1-4020-4809-8_8

Eneko Agirre⁵ &
Mark Stevenson⁶

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 33))

921 Accesses
6 Citations

This chapter explores the different sources of linguistic knowledge that can be employed by WSD systems. These are more abstract than the features used by WSD algorithms, which are encoded at the algorithmic level and normally extracted from a lexical resource or corpora. The chapter begins by listing a comprehensive set of knowledge sources with examples of their application and then explains whether this linguistic knowledge may be found in corpora, lexical knowledge bases or machine readable dictionaries. An analysis of knowledge sources used in actual WSD systems is then presented. It has been observed that the best results are often obtained by combining knowledge sources and the chapter concludes by analyzing experiments on the effect of different knowledge sources which have implications about the effectiveness of each.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agirre, Eneko & David Martínez. 2001a. Knowledge sources for word sense disambiguation. Proceedings of the Fourth International Conference on Text Speech and Dialogue (TSD), Plzen, Czech Republic.
Google Scholar
Agirre, Eneko & David Martínez. 2001b. Learning class-to-class selectional preferences. Proceedings of the ACL/EACL Workshop on Computational Natural Language Learning (CoNLL), Toulouse, France.
Google Scholar
Agirre, Eneko & German Rigau. 1996. Word sense disambiguation using conceptual density. Proceedings of the 16th International Conference on Computational Linguistics (COLING), Copenhagen, Denmark.
Google Scholar
ALPAC. 1966. Languages and Machines: Computers in Translation and Linguistics. National Research Council Publication 1416, Washington, USA.
Google Scholar
Bar-Hillel, Yehoshua. 1964. Language and Information Addison-Wesley, New York, USA.
Google Scholar
Bateman, John A., Robert Kasper, Johanna Moore & Richard A. Whitney. 1990. A General Organization of Knowledge for Natural Language Processing: The PENMAN Upper Model. Technical report, USC/Information Sciences Institute, Marina del Rey, USA.
Google Scholar
Boguraev, Branimir. 1979. Automatic Resolution of Linguistic Ambiguities. Ph.D. Thesis, Computer Laboratory, University of Cambridge, Cambridge, UK.
Google Scholar
Brill, Eric. 1995. Transformation-based error-driven learning and natural language processing: A case study in part of speech tagging. Computational Linguistics, 21 (4):543-566.
Google Scholar
Brown, Peter F., Stephen A. Della Pietra, Vincent J. Della Pietra & Robert L. Mercer. 1991. Word sense disambiguation using statistical methods. Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics (ACL), Berkeley, USA, 264-270.
Chapter Google Scholar
Bruce, Rebecca & Louise Guthrie. 1992. Genus disambiguation: A study in weighted preference. Proceedings of the 14th International Conference on Computational Linguistics (COLING), Nantes, France, 1187-1191.
Chapter Google Scholar
Carroll, John & Ted Briscoe. 2001. High precision extraction of grammatical relations. Proceedings of the 7th ACL/SIGPARSE International Workshop on Parsing Technologies, Beijing, China, 78-89.
Google Scholar
Charniak, Eugene. 1983. Marker Passing: A Theory of Contextual Influence in Language Comprehension. Cognitive Science 7.
Google Scholar
Chapman, Robert L. 1977. Roget’s International Thesaurus, Fourth Edition. Harper and Row, New York, USA.
Google Scholar
Cowie, Jim, Louise Guthrie & Joe Guthrie. 1992. Lexical disambiguation using simulated annealing. Proceedings of the 14th International Conference on Computational Linguistics (COLING), Nantes, France, 359-365.
Chapter Google Scholar
Cruse, David. 1998. Lexical Semantics. Cambridge University Press, Cambridge, UK.
Google Scholar
Daelemans, Walter, Jakub Zavrel, Ko van der Sloot & Antal van den Bosch. 1999. TiMBL: Tilburg Memory Based Learner, Version 2.0, Reference Guide. ILK Technical Report 99-01, Tilburg University, The Netherlands.
Google Scholar
Decadt, Bart, Véronique Hoste, Walter Daelemans, & Antal van den Bosch. 2004. GAMBL, Genetic Algorithm Optimization of Memory-Based WSD. Proceedings of the ACL/EACL Senseval-3 Workshop, Barcelona, Spain, 108-112.
Google Scholar
Dang, Hoa Trang & Martha Palmer. 2002. Combining contextual features for word sense disambiguation. Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, Philadelphia, USA.
Google Scholar
Duda, Richard & Peter E. Hart. 1973. Pattern Classification and Scene Analysis. New York: Wiley.
Google Scholar
Elworthy, David. 1994. Does Baum-Welch re-estimation help taggers? Proceedings of the 4th Conference on Applied Natural Language Processing, Stuttgart, Germany, 53-58.
Chapter Google Scholar
Fellbaum, Christiane. 1998. WordNet: An Electronic Lexical Database. Massachusetts and London: The MIT Press.
Google Scholar
Fernández, David, Julio Gonzalo & Felisa Verdejo. 2001. The UNED systems at Senseval-2. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France, 75-78.
Google Scholar
Fillmore, Charles. 1971. Types of lexical information. Semantics: An interdisciplinary reader in philosophy, linguistics, and psychology. Cambridge: Cambridge University Press, 370-392.
Google Scholar
Florian, Radu, Silviu Cucerzan, Charles Schafer & David Yarowsky. 2002. Classifier combination for word sense disambiguation. Journal of Natural Language Engineering, 8(4): 327-341.
Article Google Scholar
Freund, Yoav & Robert E. Schapire. 1996. Experiments with a new boosting algorithm. Proceedings of the 13th International Conference on Machine Learning, Bari, Italy, 148-156.
Google Scholar
Gale, William, Kenneth W. Church & David Yarowsky. 1993. A method for disambiguating word senses in a large corpus. Computers and the Humanities, 26 (5): 415-439.
Google Scholar
Hirst, Graeme. 1987. Semantic Interpretation and the Resolution of Ambiguity. Cambridge, UK: Cambridge University Press.
Google Scholar
Kilgarriff, Adam. 1997. Putting frequencies in the dictionary. International Journal of Lexicography, 10(2): 135-155
Article Google Scholar
Kriedler, Charles. 1998. Introducing English Semantics. London and New York: Routledge.
Google Scholar
Lee, Yoong K. & Hwee Tou Ng. 2002. An empirical evaluation of knowledge sources and learning algorithms for word sense disambiguation. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Philadelphia, USA, 41-48.
Chapter Google Scholar
Lee, Yoong K., Hwee Tou Ng & Tee Kiah Chia. 2004. Supervised word sense disambiguation with support vector machines and multiple knowledge sources. Proceedings of the ACL/EACL Senseval-3 Workshop, Barcelona, Spain, 137-140.
Google Scholar
Lesk, Michael. 1986. Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. Proceedings of SIGDOC-86: 5th International Conference on Systems Documentation, Toronto, Canada, 24-26.
Chapter Google Scholar
Lin, Dekang. 1993. Principle based parsing without overgeneration. Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics (ACL), Columbus, USA, 112-120.
Chapter Google Scholar
Lin, Dekang. 1997. Using syntactic dependency as local context to resolve word sense ambiguity. Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL), Madrid, 64-71.
Google Scholar
Magnini, Bernardo, Carlo Strapparava, Giovani Pezzulo & Alfio Gliozzo. 2001. Using domain information for word sense disambiguation. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, France, 111-114.
Google Scholar
Martínez, David, Eneko Agirre & Lluis Márquez. 2002. Syntactic features for high precision word sense disambiguation. Proceedings of the 19th International Conference on Computational Linguistics (COLING), Taipei, Taiwan.
Google Scholar
McCarthy, Diana, John Carroll & Judita Preiss. 2001. Disambiguating noun and verb senses using automatically acquired selectional preferences. Proceedings of the ACL/EACL Senseval-2 Workshop, Toulouse, France.
Google Scholar
McCarthy, Diana, Rob Koeling, Julie Weeds & John Carroll. 2004. Using automatically acquired predominant senses for word sense disambiguation. Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, 151-158.
Google Scholar
McRoy, Susan W. 1992. Using multiple knowledge sources for word sense disambiguation. Computational Linguistics, 18(1): 1-30.
Google Scholar
Mihalcea, Rada & Dan Moldovan. 2001. Pattern learning and active feature selection for word sense disambiguation. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France.
Google Scholar
Mihalcea, Rada & Ehsanul Faruque. 2004. SenseLearner: Minimally supervised word sense disambiguation for all words in open text. Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, 155-158.
Google Scholar
Miller, George. A., Claudia Leacock, Randee Tengi & Ross. T. Bunker. 1993. A semantic concordance. Proceedings of the ARPA Workshop on Human Language Technology, 303-308.
Google Scholar
Montoyo, Andres & Armando Suárez. 2001. The University of Alicante word sense disambiguation system. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France.
Google Scholar
Ng, Hwee Tou & Hiang B. Lee. 1996. Integrating multiple knowledge sources for word sense disambiguation: An exemplar-based approach. Proceedings of the 34th Meeting of the Association for Computational Linguistics (ACL), Santa Cruz, CA, USA, 40-47.
Chapter Google Scholar
Patwardhan, Siddharth, Satanjeev Banerjee & Ted Pedersen. 2003. Using measures of semantic relatedness for word sense disambiguation. Proceedings of the Fourth International Conference on Intelligent Text Processing and Computational Linguistics (CICLing), Mexico City, Mexico.
Google Scholar
Pedersen, Ted. 2002. Assessing system agreement and instance difficulty in the lexical sample tasks of Senseval-2. Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, Philadelphia, PA, USA.
Google Scholar
Preiss, Judita. 2001. Anaphora resolution with word sense disambiguation. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France, 143-146.
Google Scholar
Procter, Paul, ed. 1978. Longman Dictionary of Contemporary English. Harlow, UK: Longman Group.
Google Scholar
Quinlan, J. Ross. 1993. C4.5: Programs for Machine Learning. San Francisco: Morgan Kaufmann.
Google Scholar
Resnik, Philip. 1997. Selectional preferences and word sense disambiguation. Proceedings of the ACL/SIGLEX Workshop on Tagging Text with Lexical Semantics: What, Why and How?, Washington, DC, USA, 52-57.
Google Scholar
Resnik, Philip & David Yarowsky. 1997. A perspective on word sense disambiguation algorithms and their evaluation. Proceedings of the ACL/SIGLEX Workshop Tagging Texts with Lexical Semantics: What, Why and How?, Washingtonn, DC, USA, 79-86.
Google Scholar
Sag, Ivan A., Timothy Baldwin, Francis Bond, Ann Copestake & Dan Flickinger. 2002. Multiword expressions: A pain in the neck for NLP. Proceedings of the 3rd International Conference on Intelligent Text Processing and Computational Linguistics (CICLing), Mexico City, Mexico, 1-15.
Google Scholar
Small, Steven L. 1980. Word Expert Parsing: A Theory of Distributed Wordbased Natural Language Understanding. Ph.D. Thesis, Department of Computer Science, University of Maryland, USA.
Google Scholar
Strapparava, Carlo, Alfio Gliozzo, & Claudio Giuliano. 2004. Pattern abstraction and term similarity for word sense disambiguation: IRST at Senseval-3. Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, 229-233.
Google Scholar
Stevenson, Mark. 2003. Word Sense Disambiguation: The Case for Combination of Knowledge Sources. Stanford, USA: CSLI Publications.
Google Scholar
Stevenson, Mark & Yorick Wilks. 2001. The interaction of knowledge sources in word sense disambiguation. Computational Linguistics, 27(3): 321-349.
Article Google Scholar
Vapnik, Vladimir. 1995. The Nature of Statistical Learning Theory. New York, USA: Springer-Verlag.
Google Scholar
Wilks, Yorick. 1975. A preferential pattern-seeking semantics for natural language inference. Artificial Intelligence, 6: 53-74.
Article Google Scholar
Wilks, Yorick. 1978. Making preferences more active. Artificial Intelligence 11(3): 197-223.
Article Google Scholar
Wilks, Yorick & Mark Stevenson. 1998. The grammar of sense: Using part of speech tags as a first step in semantic disambiguation. Journal of Natural Language Engineering, 4(2): 135-144.
Article Google Scholar
Yarowsky, David. 1992. Word-sense disambiguation using statistical models of Roget’s categories trained on large corpora. Proceedings of the 14th International Conference on Computational Linguistics (COLING), Nantes, France, 454-460.
Google Scholar
Yarowsky, David. 1996. Three Algorithms for Lexical Ambiguity Resolution, Ph.D. Thesis, School of Computer and Information Science, University of Pennsylvania, USA.
Google Scholar
Yarowsky, David, Silviu Cucerzan, Radu Florian, Charles Schafer & Richard Wicentowski. 2001. The Johns Hopkins Senseval-2 system description. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France, 163-166.
Google Scholar
Yarowsky, David & Radu Florian. 2002. Evaluating sense disambiguation across diverse parameter spaces. Journal of Natural Language Engineering, 8(2): 293-310.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of the Basque Country, Manuel de Lardizabal 1, E-20018, Donostia, Basque Country, Spain
Associate Professor Eneko Agirre
Department of Computer Science, University of Sheffield, S1 4DP, Sheffield, UK
Lecturer Mark Stevenson

Authors

Associate Professor Eneko Agirre
View author publications
You can also search for this author in PubMed Google Scholar
Lecturer Mark Stevenson
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of the Basque Country, Manuel de Lardizabal 1, E-20018, Donostia, Basque Country, Spain
Eneko Agirre
Sharp Laboratories of Europe Limited, Oxford Science Park, OX4 4GB, Oxford, UK
Philip Edmonds

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Agirre, E., Stevenson, M. (2007). Knowledge Sources for WSD. In: Agirre, E., Edmonds, P. (eds) Word Sense Disambiguation. Text, Speech and Language Technology, vol 33. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-4809-8_8

Download citation

DOI: https://doi.org/10.1007/978-1-4020-4809-8_8
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-4808-1
Online ISBN: 978-1-4020-4809-8
eBook Packages: Humanities, Social Sciences and LawSocial Sciences (R0)

Publish with us

Policies and ethics