透過您的圖書館登入
IP:3.129.210.17
  • 期刊
  • OpenAccess

Automatic Pronominal Anaphora Resolution in English Texts

並列摘要


Anaphora is a common phenomenon in discourses as well as an important research issue in the applications of natural language processing. In this paper, anaphora resolution is achieved by employing WordNet ontology and heuristic rules. The proposed system identifies both intra-sentential and inter-sentential antecedents of anaphors. Information about animacy is obtained by analyzing the hierarchical relations of nouns and verbs in the surrounding. context. The identification of animacy entities and pleonastic-it usage in English discourses are employed to promote resolution accuracy. Traditionally, anaphora resolution systems have relied on syntactic, semantic or pragmatic clues to identify the antecedent of an anaphor. Our proposed method makes use of WordNet ontology to identify animate entities as well as essential gender information. In the animacy agreement module, the property is identified by the hypernym relation between entities and their unique beginners defined in WordNet. In addition, the verb of the entity is also an important clue used to reduce the uncertainty. An experiment was conducted using a balanced corpus to resolve the pronominal anaphora phenomenon. The methods proposed in [Lappin and Leass, 94] and [Mitkov, 01] focus on the corpora with only inanimate pronouns such as ”it” or ”its”. Thus the results of intra-sentential and inter-sentential anaphora distribution are different. In an experiment using Brown corpus, we found that the distribution proportion of intra-sentential anaphora is about 60%. Seven heuristic rules are applied in our system; five of them are preference rules, and two are constraint rules. They are derived from syntactic, semantic, pragmatic conventions and from the analysis of training data. A relative measurement indicates that about 30% of the errors can be eliminated by applying heuristic module.

並列關鍵字

無資料

參考文獻


Baldwin, Breck(1997).Proceedings of the ACL'97/EACL'97 Workshop on Operational factors in practical, robust anaphora resolution.
Barkema, H.(1997).The TOSCA-ICLE Tagset: Tagging manual.
Branimir Christopher, Christopher(1996).Proceedings of the 16th International Conference on Computational Linguistics.
Bontcheva, Kalina,Dimitrov, Marin,Maynard, Diana,Tablan, Valentin(2002).Proceedings of TRAITEMENT AUTOMATIQUE DES LANGUES NATURELLES (TALN).
Cardie, Claire,Wagstaff, Kiri(1999).Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Very Large Corpora.

被引用紀錄


Lin, Y. T. (2009). 以共同指向決策方案的自動多選題產生系統 [master's thesis, National Taiwan University]. Airiti Library. https://doi.org/10.6342/NTU.2009.01275
Chen, Y. C. (2005). 中文零代詞解析與應用 [doctoral dissertation, Tatung University]. Airiti Library. https://www.airitilibrary.com/Article/Detail?DocID=U0081-0607200917233352

延伸閱讀