Skip to main content

Knowledge Sources for WSD

  • Chapter
Word Sense Disambiguation

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 33))

This chapter explores the different sources of linguistic knowledge that can be employed by WSD systems. These are more abstract than the features used by WSD algorithms, which are encoded at the algorithmic level and normally extracted from a lexical resource or corpora. The chapter begins by listing a comprehensive set of knowledge sources with examples of their application and then explains whether this linguistic knowledge may be found in corpora, lexical knowledge bases or machine readable dictionaries. An analysis of knowledge sources used in actual WSD systems is then presented. It has been observed that the best results are often obtained by combining knowledge sources and the chapter concludes by analyzing experiments on the effect of different knowledge sources which have implications about the effectiveness of each.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Agirre, Eneko & David Martínez. 2001a. Knowledge sources for word sense disambiguation. Proceedings of the Fourth International Conference on Text Speech and Dialogue (TSD), Plzen, Czech Republic.

    Google Scholar 

  • Agirre, Eneko & David Martínez. 2001b. Learning class-to-class selectional preferences. Proceedings of the ACL/EACL Workshop on Computational Natural Language Learning (CoNLL), Toulouse, France.

    Google Scholar 

  • Agirre, Eneko & German Rigau. 1996. Word sense disambiguation using conceptual density. Proceedings of the 16th International Conference on Computational Linguistics (COLING), Copenhagen, Denmark.

    Google Scholar 

  • ALPAC. 1966. Languages and Machines: Computers in Translation and Linguistics. National Research Council Publication 1416, Washington, USA.

    Google Scholar 

  • Bar-Hillel, Yehoshua. 1964. Language and Information Addison-Wesley, New York, USA.

    Google Scholar 

  • Bateman, John A., Robert Kasper, Johanna Moore & Richard A. Whitney. 1990. A General Organization of Knowledge for Natural Language Processing: The PENMAN Upper Model. Technical report, USC/Information Sciences Institute, Marina del Rey, USA.

    Google Scholar 

  • Boguraev, Branimir. 1979. Automatic Resolution of Linguistic Ambiguities. Ph.D. Thesis, Computer Laboratory, University of Cambridge, Cambridge, UK.

    Google Scholar 

  • Brill, Eric. 1995. Transformation-based error-driven learning and natural language processing: A case study in part of speech tagging. Computational Linguistics, 21 (4):543-566.

    Google Scholar 

  • Brown, Peter F., Stephen A. Della Pietra, Vincent J. Della Pietra & Robert L. Mercer. 1991. Word sense disambiguation using statistical methods. Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics (ACL), Berkeley, USA, 264-270.

    Chapter  Google Scholar 

  • Bruce, Rebecca & Louise Guthrie. 1992. Genus disambiguation: A study in weighted preference. Proceedings of the 14th International Conference on Computational Linguistics (COLING), Nantes, France, 1187-1191.

    Chapter  Google Scholar 

  • Carroll, John & Ted Briscoe. 2001. High precision extraction of grammatical relations. Proceedings of the 7th ACL/SIGPARSE International Workshop on Parsing Technologies, Beijing, China, 78-89.

    Google Scholar 

  • Charniak, Eugene. 1983. Marker Passing: A Theory of Contextual Influence in Language Comprehension. Cognitive Science 7.

    Google Scholar 

  • Chapman, Robert L. 1977. Roget’s International Thesaurus, Fourth Edition. Harper and Row, New York, USA.

    Google Scholar 

  • Cowie, Jim, Louise Guthrie & Joe Guthrie. 1992. Lexical disambiguation using simulated annealing. Proceedings of the 14th International Conference on Computational Linguistics (COLING), Nantes, France, 359-365.

    Chapter  Google Scholar 

  • Cruse, David. 1998. Lexical Semantics. Cambridge University Press, Cambridge, UK.

    Google Scholar 

  • Daelemans, Walter, Jakub Zavrel, Ko van der Sloot & Antal van den Bosch. 1999. TiMBL: Tilburg Memory Based Learner, Version 2.0, Reference Guide. ILK Technical Report 99-01, Tilburg University, The Netherlands.

    Google Scholar 

  • Decadt, Bart, Véronique Hoste, Walter Daelemans, & Antal van den Bosch. 2004. GAMBL, Genetic Algorithm Optimization of Memory-Based WSD. Proceedings of the ACL/EACL Senseval-3 Workshop, Barcelona, Spain, 108-112.

    Google Scholar 

  • Dang, Hoa Trang & Martha Palmer. 2002. Combining contextual features for word sense disambiguation. Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, Philadelphia, USA.

    Google Scholar 

  • Duda, Richard & Peter E. Hart. 1973. Pattern Classification and Scene Analysis. New York: Wiley.

    Google Scholar 

  • Elworthy, David. 1994. Does Baum-Welch re-estimation help taggers? Proceedings of the 4th Conference on Applied Natural Language Processing, Stuttgart, Germany, 53-58.

    Chapter  Google Scholar 

  • Fellbaum, Christiane. 1998. WordNet: An Electronic Lexical Database. Massachusetts and London: The MIT Press.

    Google Scholar 

  • Fernández, David, Julio Gonzalo & Felisa Verdejo. 2001. The UNED systems at Senseval-2. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France, 75-78.

    Google Scholar 

  • Fillmore, Charles. 1971. Types of lexical information. Semantics: An interdisciplinary reader in philosophy, linguistics, and psychology. Cambridge: Cambridge University Press, 370-392.

    Google Scholar 

  • Florian, Radu, Silviu Cucerzan, Charles Schafer & David Yarowsky. 2002. Classifier combination for word sense disambiguation. Journal of Natural Language Engineering, 8(4): 327-341.

    Article  Google Scholar 

  • Freund, Yoav & Robert E. Schapire. 1996. Experiments with a new boosting algorithm. Proceedings of the 13th International Conference on Machine Learning, Bari, Italy, 148-156.

    Google Scholar 

  • Gale, William, Kenneth W. Church & David Yarowsky. 1993. A method for disambiguating word senses in a large corpus. Computers and the Humanities, 26 (5): 415-439.

    Google Scholar 

  • Hirst, Graeme. 1987. Semantic Interpretation and the Resolution of Ambiguity. Cambridge, UK: Cambridge University Press.

    Google Scholar 

  • Kilgarriff, Adam. 1997. Putting frequencies in the dictionary. International Journal of Lexicography, 10(2): 135-155

    Article  Google Scholar 

  • Kriedler, Charles. 1998. Introducing English Semantics. London and New York: Routledge.

    Google Scholar 

  • Lee, Yoong K. & Hwee Tou Ng. 2002. An empirical evaluation of knowledge sources and learning algorithms for word sense disambiguation. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Philadelphia, USA, 41-48.

    Chapter  Google Scholar 

  • Lee, Yoong K., Hwee Tou Ng & Tee Kiah Chia. 2004. Supervised word sense disambiguation with support vector machines and multiple knowledge sources. Proceedings of the ACL/EACL Senseval-3 Workshop, Barcelona, Spain, 137-140.

    Google Scholar 

  • Lesk, Michael. 1986. Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. Proceedings of SIGDOC-86: 5th International Conference on Systems Documentation, Toronto, Canada, 24-26.

    Chapter  Google Scholar 

  • Lin, Dekang. 1993. Principle based parsing without overgeneration. Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics (ACL), Columbus, USA, 112-120.

    Chapter  Google Scholar 

  • Lin, Dekang. 1997. Using syntactic dependency as local context to resolve word sense ambiguity. Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL), Madrid, 64-71.

    Google Scholar 

  • Magnini, Bernardo, Carlo Strapparava, Giovani Pezzulo & Alfio Gliozzo. 2001. Using domain information for word sense disambiguation. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, France, 111-114.

    Google Scholar 

  • Martínez, David, Eneko Agirre & Lluis Márquez. 2002. Syntactic features for high precision word sense disambiguation. Proceedings of the 19th International Conference on Computational Linguistics (COLING), Taipei, Taiwan.

    Google Scholar 

  • McCarthy, Diana, John Carroll & Judita Preiss. 2001. Disambiguating noun and verb senses using automatically acquired selectional preferences. Proceedings of the ACL/EACL Senseval-2 Workshop, Toulouse, France.

    Google Scholar 

  • McCarthy, Diana, Rob Koeling, Julie Weeds & John Carroll. 2004. Using automatically acquired predominant senses for word sense disambiguation. Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, 151-158.

    Google Scholar 

  • McRoy, Susan W. 1992. Using multiple knowledge sources for word sense disambiguation. Computational Linguistics, 18(1): 1-30.

    Google Scholar 

  • Mihalcea, Rada & Dan Moldovan. 2001. Pattern learning and active feature selection for word sense disambiguation. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France.

    Google Scholar 

  • Mihalcea, Rada & Ehsanul Faruque. 2004. SenseLearner: Minimally supervised word sense disambiguation for all words in open text. Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, 155-158.

    Google Scholar 

  • Miller, George. A., Claudia Leacock, Randee Tengi & Ross. T. Bunker. 1993. A semantic concordance. Proceedings of the ARPA Workshop on Human Language Technology, 303-308.

    Google Scholar 

  • Montoyo, Andres & Armando Suárez. 2001. The University of Alicante word sense disambiguation system. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France.

    Google Scholar 

  • Ng, Hwee Tou & Hiang B. Lee. 1996. Integrating multiple knowledge sources for word sense disambiguation: An exemplar-based approach. Proceedings of the 34th Meeting of the Association for Computational Linguistics (ACL), Santa Cruz, CA, USA, 40-47.

    Chapter  Google Scholar 

  • Patwardhan, Siddharth, Satanjeev Banerjee & Ted Pedersen. 2003. Using measures of semantic relatedness for word sense disambiguation. Proceedings of the Fourth International Conference on Intelligent Text Processing and Computational Linguistics (CICLing), Mexico City, Mexico.

    Google Scholar 

  • Pedersen, Ted. 2002. Assessing system agreement and instance difficulty in the lexical sample tasks of Senseval-2. Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, Philadelphia, PA, USA.

    Google Scholar 

  • Preiss, Judita. 2001. Anaphora resolution with word sense disambiguation. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France, 143-146.

    Google Scholar 

  • Procter, Paul, ed. 1978. Longman Dictionary of Contemporary English. Harlow, UK: Longman Group.

    Google Scholar 

  • Quinlan, J. Ross. 1993. C4.5: Programs for Machine Learning. San Francisco: Morgan Kaufmann.

    Google Scholar 

  • Resnik, Philip. 1997. Selectional preferences and word sense disambiguation. Proceedings of the ACL/SIGLEX Workshop on Tagging Text with Lexical Semantics: What, Why and How?, Washington, DC, USA, 52-57.

    Google Scholar 

  • Resnik, Philip & David Yarowsky. 1997. A perspective on word sense disambiguation algorithms and their evaluation. Proceedings of the ACL/SIGLEX Workshop Tagging Texts with Lexical Semantics: What, Why and How?, Washingtonn, DC, USA, 79-86.

    Google Scholar 

  • Sag, Ivan A., Timothy Baldwin, Francis Bond, Ann Copestake & Dan Flickinger. 2002. Multiword expressions: A pain in the neck for NLP. Proceedings of the 3rd International Conference on Intelligent Text Processing and Computational Linguistics (CICLing), Mexico City, Mexico, 1-15.

    Google Scholar 

  • Small, Steven L. 1980. Word Expert Parsing: A Theory of Distributed Wordbased Natural Language Understanding. Ph.D. Thesis, Department of Computer Science, University of Maryland, USA.

    Google Scholar 

  • Strapparava, Carlo, Alfio Gliozzo, & Claudio Giuliano. 2004. Pattern abstraction and term similarity for word sense disambiguation: IRST at Senseval-3. Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, 229-233.

    Google Scholar 

  • Stevenson, Mark. 2003. Word Sense Disambiguation: The Case for Combination of Knowledge Sources. Stanford, USA: CSLI Publications.

    Google Scholar 

  • Stevenson, Mark & Yorick Wilks. 2001. The interaction of knowledge sources in word sense disambiguation. Computational Linguistics, 27(3): 321-349.

    Article  Google Scholar 

  • Vapnik, Vladimir. 1995. The Nature of Statistical Learning Theory. New York, USA: Springer-Verlag.

    Google Scholar 

  • Wilks, Yorick. 1975. A preferential pattern-seeking semantics for natural language inference. Artificial Intelligence, 6: 53-74.

    Article  Google Scholar 

  • Wilks, Yorick. 1978. Making preferences more active. Artificial Intelligence 11(3): 197-223.

    Article  Google Scholar 

  • Wilks, Yorick & Mark Stevenson. 1998. The grammar of sense: Using part of speech tags as a first step in semantic disambiguation. Journal of Natural Language Engineering, 4(2): 135-144.

    Article  Google Scholar 

  • Yarowsky, David. 1992. Word-sense disambiguation using statistical models of Roget’s categories trained on large corpora. Proceedings of the 14th International Conference on Computational Linguistics (COLING), Nantes, France, 454-460.

    Google Scholar 

  • Yarowsky, David. 1996. Three Algorithms for Lexical Ambiguity Resolution, Ph.D. Thesis, School of Computer and Information Science, University of Pennsylvania, USA.

    Google Scholar 

  • Yarowsky, David, Silviu Cucerzan, Radu Florian, Charles Schafer & Richard Wicentowski. 2001. The Johns Hopkins Senseval-2 system description. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France, 163-166.

    Google Scholar 

  • Yarowsky, David & Radu Florian. 2002. Evaluating sense disambiguation across diverse parameter spaces. Journal of Natural Language Engineering, 8(2): 293-310.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer

About this chapter

Cite this chapter

Agirre, E., Stevenson, M. (2007). Knowledge Sources for WSD. In: Agirre, E., Edmonds, P. (eds) Word Sense Disambiguation. Text, Speech and Language Technology, vol 33. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-4809-8_8

Download citation

Publish with us

Policies and ethics