Abstract
With the increased simplicity of producing proteomics data, the bottleneck has now shifted to the functional analysis of large lists of proteins to translate this primary level of information into meaningful biological knowledge. Tools implementing such approach are a powerful way to gain biological insights related to their samples, provided that biologists/clinicians have access to computational solutions even when they have little programming experience or bioinformatics support. To achieve this goal, we designed ProteoRE (Proteomics Research Environment), a unified online research service that provides end-users with a set of tools to interpret their proteomics data in a collaborative and reproducible manner. ProteoRE is built upon the Galaxy framework, a workflow system allowing for data and analysis persistence, and providing user interfaces to facilitate the interaction with tools dedicated to the functional and the visual analysis of proteomics datasets. A set of tools relying on computational methods selected for their complementarity in terms of functional analysis was developed and made accessible via the ProteoRE web portal. In this chapter, a step-by-step protocol linking these tools is designed to perform a functional annotation and GO-based enrichment analyses applied to a set of differentially expressed proteins as a use case. Analytical practices, guidelines as well as tips related to this strategy are also provided. Tools, datasets, and results are freely available at http://www.proteore.org, allowing researchers to reuse them.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Aebersold R, Mann M (2003) Mass spectrometry-based proteomics. Nature 422:198–207
Käll L, Vitek O (2011) Computational mass spectrometry-based proteomics. PLoS Comput Biol 7:1–7
Huang DW, Sherman BT, Lempicki RA (2009) Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 37:1–13
Harris MA, Clark J, Ireland A et al (2004) The gene ontology (GO) database and informatics resource. Nucleic Acids Res 32:D258–D261
Goeman JJ, Bühlmann P (2007) Analyzing gene expression data in terms of gene sets: methodological issues. Bioinformatics 23:980–987
Meijer RJ, Goeman JJ (2016) Multiple testing of gene sets from gene ontology: possibilities and pitfalls. Brief Bioinform 17:808–818
Khatri P, Sirota M, Butte AJ (2012) Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Comput Biol 8:e1002375
Grossmann S, Bauer S, Robinson PN et al (2007) Improved detection of overrepresentation of gene-ontology annotations with parent child analysis. Bioinformatics 23:3024–3031
Alexa A, Rahnenfuhrer J, Lengauer T (2006) Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics 22:1600–1607
Rivals I, Personnaz L, Taing L et al (2007) Enrichment or depletion of a GO category within a class of genes: which test? Bioinformatics 23:401–407
Pozniak Y, Balint-Lahat N, Rudolph JD et al (2016) System-wide clinical proteomics of breast cancer reveals global remodeling of tissue homeostasis. Cell Syst 2:172–184
Yu G, Wang LG, Han Y et al (2012) ClusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16:284–287
Vandenbrouck Y, Christiany D, Combes F et al (2019) Bioinformatics tools and workflow to select blood biomarkers for early cancer diagnosis: an application to pancreatic cancer. Proteomics 19:e1800489
Vandenbrouck Y, Pineau C, Lane L (2020) The functionally unannotated proteome of human male tissues: a shared resource to uncover new protein functions associated with reproductive biology. J Proteome Res 19(12):4782–4794
Goecks J, Nekrutenko A, Taylor J et al (2010) Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol 11(8):R86
Afgan E, Baker D, Batut B et al (2018) The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res 46:W537–W544
Nguyen L, Brun V, Combes F et al (2019) Designing an in silico strategy to select tissue-leakage biomarkers using the Galaxy framework. Methods Mol Biol 1959:275–289
Tyanova S, Mann M, Cox J (2014) MaxQuant for in-depth analysis of large SILAC datasets. Methods Mol Biol 1188:351–364
Lane L, Argoud-Puy G, Britan A et al (2012) neXtProt: a knowledge platform for human proteins. Nucleic Acids Res 40:D76–D83
Yon Rhee S, Wood V, Dolinski K et al (2008) Use and misuse of the gene ontology annotations. Nat Rev Genet 9:509–515
Acknowledgements
This work was partly supported by the “Investissement d’Avenir Infrastructures Nationales en Biologie et Santé” grants ANR-10-INBS-08 (Proteomics French Infrastructure—ProFI), ANR-11-INBS-0013 (French Institute of Bioinformatics—IFB). We would like to thank the Galaxy community for their support and the following for their contributions to the design, the development and beta-testing of these tools: Virginie Brun, David Christiany, Benoit Gilquin, Lien Nguyen, Lisa Perus.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Combes, F., Loux, V., Vandenbrouck, Y. (2021). GO Enrichment Analysis for Differential Proteomics Using ProteoRE. In: Cecconi, D. (eds) Proteomics Data Analysis. Methods in Molecular Biology, vol 2361. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-1641-3_11
Download citation
DOI: https://doi.org/10.1007/978-1-0716-1641-3_11
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-1640-6
Online ISBN: 978-1-0716-1641-3
eBook Packages: Springer Protocols