ABSTRACT
In this paper we describe the memory-based machine learning system that we submitted to the BioNLP Shared Task on Event Extraction. We modeled the event extraction task using an approach that has been previously applied to other natural language processing tasks like semantic role labeling or negation scope finding. The results obtained by our system (30.58 F-score in Task 1 and 29.27 in Task 2) suggest that the approach and the system need further adaptation to the complexity involved in extracting biomedical events.
- S. Ananiadou and J. McNaught. 2006. Text Mining for Biology and Biomedicine. Artech House Books, London. Google ScholarDigital Library
- S. Buchholz and E. Marsi. 2006. CoNLL-X shared task on multilingual dependency parsing. In Proc. of the X CoNLL Shared Task, New York. SIGNLL. Google ScholarDigital Library
- M. Bundschus, M. Dejori, M. Stetter, V. Tresp, and H-P Kriegel. 2008. Extraction of semantic biomedical relations from text using conditional random fields. BMC Bioinformatics, 9.Google Scholar
- W. C. Chou, R. T. H. Tsai, Y-S. Su, W. Ku, T-Y Sung, and W-L Hsu. 2006. A semi-automatic method for annotating a biomedical proposition bank. In Proc. of ACL Workshop on Frontiers in Linguistically Annotated Corpora 2006, pages 5--12. Google ScholarDigital Library
- N. Collier, H. S. Park, N. Ogata, Y. Tateisi, C. Nobata, T. Sekimizu, H. Imai, and J. Tsujii. 1999. The GENIA project: corpus-based knowledge acquisition and information extraction from genome research papers. In Proc. of EACL 1999. Google ScholarDigital Library
- T. M. Cover and P. E. Hart. 1967. Nearest neighbor pattern classification. Institute of Electrical and Electronics Engineers Transactions on Information Theory, 13:21--27.Google ScholarDigital Library
- W. Daelemans and A. van den Bosch. 2005. Memory-based language processing. Cambridge University Press, Cambridge, UK. Google ScholarDigital Library
- W. Daelemans, A. Van den Bosch, and J. Zavrel. 1999. Forgetting exceptions is harmful in language learning. Machine Learning, Special issue on Natural Language Learning, 34:11--41. Google ScholarDigital Library
- W. Daelemans, J. Zavrel, K. Van der Sloot, and A. Van den Bosch. 2007. TiMBL: Tilburg memory based learner, version 6.1, reference guide. Technical Report Series 07--07, ILK, Tilburg, The Netherlands.Google Scholar
- J. D. Kim, T. Ohta, and J. Tsujii. 2008. Corpus annotation for mining biomedical events from literature. BMC Bioinformatics, 9:10.Google ScholarCross Ref
- M. Krallinger and A. Valencia. 2005. Text-mining and information-retrieval services for molecular biology. Genome Biology, 6:224.Google ScholarCross Ref
- M. Krallinger, F. Leitner, C. Rodriguez-Penagos, and A. Valencia. 2008a. Overview of the protein-protein interaction annotation extraction task of BioCreative II. Genome Biology, 9(Suppl 2):S4.Google ScholarCross Ref
- M. Krallinger, A. Valencia, and L. Hirschman. 2008b. Linking genes to literature: text mining, information extraction, and retrieval applications for biology. Genome Biology, 9(Suppl 2):S8.Google ScholarCross Ref
- R. Morante and W. Daelemans. 2009. A metalearning approach to processing the scope of negation. In Proceedings of CoNLL 2009, Boulder, Colorado. Google ScholarDigital Library
- R. Morante, W. Daelemans, and V. Van Asch. 2008. A combined memory-based semantic role labeler of English. In Proc. of the CoNLL 2008, pages 208--212, Manchester, UK. Google ScholarDigital Library
- C. Nédellec. 2005. Learning language in logic -- genic interaction extraction challenge. In Proc. of Learning Language in Logic Workshop 2005, pages 31--37, Bonn.Google Scholar
- S. Pyysalo, F. Ginter, J. Heimonen, J. Björne, J. Boberg, J. Järvinen, and T. Salakoski. 2007. BioInfer: a corpus for information extraction in the biomedical domain. BMC Bioinformatics, 8(50).Google Scholar
- K. Sagae and J. Tsujii. 2007. Dependency parsing and domain adaptation with lr models and parser ensembles. In Proc. of CoNLL 2007 Shared Task, EMNLP-CoNLL, pages 82--94, Prague. ACL.Google Scholar
- Y. Sasaki, P. Thompson, P. Cotter, J. McNaught, and S. Ananiadou. 2008. Event frame extraction based on a gene regulation corpus. In Proc. of Coling 2008, pages 761--768. Google ScholarDigital Library
- T. Wattarujeekrit, P. K. Shah, and N. Collier. 2004. PASBio: predicate-argument structures for event extraction in molecular biology. BMC Bioinformatics, 5:155.Google ScholarCross Ref
- A. Yakushiji, Y. Tateisi, Y. Miyao, and J. Tsujii. 2001. Event extraction from biomedical papers using a full parser. In Pac Symp Biocomput.Google Scholar
Index Terms
- A memory-based learning approach to event extraction in biomedical texts
Recommendations
Generalizing biomedical event extraction
BioNLP Shared Task '11: Proceedings of the BioNLP Shared Task 2011 WorkshopWe present a system for extracting biomedical events (detailed descriptions of biomolecular interactions) from research articles. This system was developed for the BioNLP'11 Shared Task and extends our BioNLP'09 Shared Task winning Turku Event ...
A pattern approach for biomedical event annotation
BioNLP Shared Task '11: Proceedings of the BioNLP Shared Task 2011 WorkshopWe describe our approach for the GENIA Event Extraction in the Main Task of BioNLP Shared Task 2011. There are two important parts in our method: Event Trigger Annotation and Event Extraction. We use rules and dictionary to annotate event triggers. ...
A semi-supervised learning framework for biomedical event extraction based on hidden topics
ObjectivesScientists have devoted decades of efforts to understanding the interaction between proteins or RNA production. The information might empower the current knowledge on drug reactions or the development of certain diseases. Nevertheless, due to ...
Comments