Definition
Natural language processing is the analysis of linguistic data, most commonly in the form of textual data such as documents or publications, using computational methods. The goal of natural language processing is generally to build a representation of the text that adds structure to the unstructured natural language, by taking advantage of insights from linguistics. This structure can be syntactic in nature, capturing the grammatical relationships among constituents of the text, or more semantic, capturing the meaning conveyed by the text.
Natural language processing is used in systems biology to develop applications that integrate information extracted from the literature with other sources of biological data (see Applied Text Mining).
Characteristics
The typical natural language processing system consists...
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Cohen KB, Verspoor K, Johnson H, Roeder C, Ogren P, Baumgartner W Jr., White E, Tipney H, Hunter L (2011) High-precision biological event extraction: Effects of system and data. Comput Intell 27(4)
Dai H-J, Lai P-T, Tsai RT-H (2010) Multistage gene normalization and svm-based ranking for protein interactor extraction in full-text articles. IEEE/ACM Trans Comput Biol Bioinformatics 7(3):412–420
Ferrucci D, Lally A, Verspoor K (eds) (2009) Unstructured information management architecture (UIMA) Version 1.0. OASIS Standard, 2 Mar 2009
Hakenberg J, Leaman R, Ha Vo N, Jonnalagadda S, Sullivan R, Miller C, Tari L, Baral C, Gonzalez G (2010) Efficient extraction of protein–protein interactions from full-text articles. IEEE/ACM Trans Comput Biol Bioinformatics 7(3):481–494
Hirschman L, Yeh A, Blaschke C, Valencia A (2005) Overview of biocreative: critical assessment of information extraction for biology. BMC Bioinformatics 6(Suppl 1):S1
Hunter L, Bretonnel Cohen K (2006) Biomedical language processing: what’s beyond PubMed? Mol Cell 21:589–594
Kim J-D, Ohta T, Pyysalo S, Kano Y, Tsujii J (2009) Overview of BioNLP’09 shared task on event extraction. In: Proceedings of the Workshop on BioNLP: Shared Task, Association for Computational Linguistics, Boulder, Colorado, pp 1–9
Krallinger M, Morgan A, Smith L, Leitner F, Tanabe L, Wilbur J, Hirschman L, Valencia A (2008) Evaluation of text-mining systems for biology: overview of the second BioCreative community challenge. Genome Biol 9(Suppl 2):S1
Leitner F, Chatr-aryamontri A, Mardis SA, Ceol A, Krallinger M, Licata L, Hirschman L, Cesareni G, Valencia A (2010) The FEBS letters/BioCreative II.5 experiment: making biological information accessible. Nat Biotechnol 28:897–899
Nakov P, Schwartz A, Wolf B, Hearst M (2005) Supporting annotation layers for natural language processing. In: ACL 2005 Poster/Demo Track, Ann Arbor
Porter MF (1980) An algorithm for suffix stripping. Program 14(3):130–137
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media, LLC
About this entry
Cite this entry
Verspoor, K., Cohen, K.B. (2013). Natural Language Processing. In: Dubitzky, W., Wolkenhauer, O., Cho, KH., Yokota, H. (eds) Encyclopedia of Systems Biology. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-9863-7_158
Download citation
DOI: https://doi.org/10.1007/978-1-4419-9863-7_158
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-9862-0
Online ISBN: 978-1-4419-9863-7
eBook Packages: Biomedical and Life SciencesReference Module Biomedical and Life Sciences