ABSTRACT
We describe the Hindi Discourse Relation Bank project, aimed at developing a large corpus annotated with discourse relations. We adopt the lexically grounded approach of the Penn Discourse Treebank, and describe our classification of Hindi discourse connectives, our modifications to the sense classification of discourse relations, and some cross-linguistic comparisons based on some initial annotations carried out so far.
- Rafiya Begum, Samar Husain, Arun Dhwaj, Dipti Misra Sharma, Lakshmi Bai, and Rajeev Sangal. 2008. Dependency annotation scheme for Indian languages. Proc. of IJCNLP-2008.Google Scholar
- Lucie Mladová, Šárka Zikánová and Eva Hajičová. 2008. From Sentence to Discourse: Building an Annotation Scheme for Discourse Based on Prague Dependency Treebank. Proc. of LREC-2008.Google Scholar
- Rashmi Prasad, Nikhil Dinesh, Alan Lee, Eleni Miltsakaki, Livio Robaldo, Aravind Joshi, and Bonnie Webber. 2008. The Penn Discourse TreeBank 2.0. Proc. of LREC-2008.Google Scholar
- Rashmi Prasad, Samar Husain, Dipti Mishra Sharma, and Aravind Joshi. 2008. Towards an Annotated Corpus of Discourse Relations in Hindi. Proc. of IJCNLP-2008.Google Scholar
- Eve Sweetser. 1990. From etymology to pragmatics: Metaphorical and cultural aspects of semantic structure. Cambridge University Press.Google Scholar
- Nianwen Xue. 2005. Annotating Discourse Connectives in the Chinese Treebank. Proc. of the ACL Workshop on Frontiers in Corpus Annotation II: Pie in the Sky. Google ScholarDigital Library
- Deniz Zeyrek and Bonnie Webber. 2008. A Discourse Resource for Turkish: Annotating Discourse Connectives in the METU Corpus. Proc. of IJCNLP-2008.Google Scholar
Index Terms
- The Hindi Discourse Relation Bank
Recommendations
Cross-Lingual Transfer for Hindi Discourse Relation Identification
Text, Speech, and DialogueAbstractDiscourse relations between two textual spans in a document attempt to capture the coherent structure which emerges in language use. Automatic classification of these relations remains a challenging task especially in case of implicit discourse ...
Using a Discourse Bank and a Lexicon for the Automatic Identification of Discourse Connectives
Computational Processing of the Portuguese LanguageAbstractWe describe two new resources that have been prepared for European Portuguese and how they are used for discourse parsing: the Portuguese subpart of the TED-MDB corpus, a multilingual corpus of TED Talks that has been annotated in the PDTB style, ...
Towards Building Vietnamese Discourse Treebank
SoICT '17: Proceedings of the 8th International Symposium on Information and Communication TechnologyDiscourse analysis is an important natural language processing task. There are many discourse parsers in many languages, such as English and Chinese, constructing discourse trees from text documents for further semantic analysis. However, there is no ...
Comments