Abstract
Identifying the multiple dysregulated oncoproteins that contribute to tumorigenesis in a given patient is crucial for developing personalized treatment plans. However, accurate inference of aberrant protein activity in biological samples is still challenging as genetic alterations are only partially predictive and direct measurements of protein activity are generally not feasible. To address this problem we introduce and experimentally validate a new algorithm, virtual inference of protein activity by enriched regulon analysis (VIPER), for accurate assessment of protein activity from gene expression data. We used VIPER to evaluate the functional relevance of genetic alterations in regulatory proteins across all samples in The Cancer Genome Atlas (TCGA). In addition to accurately infer aberrant protein activity induced by established mutations, we also identified a fraction of tumors with aberrant activity of druggable oncoproteins despite a lack of mutations, and vice versa. In vitro assays confirmed that VIPER-inferred protein activity outperformed mutational analysis in predicting sensitivity to targeted inhibitors.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
References
Hanahan, D. & Weinberg, R.A. Hallmarks of cancer: the next generation. Cell 144, 646–674 (2011).
Weinstein, I.B. Cancer. addiction to oncogenes—the Achilles heal of cancer. Science 297, 63–64 (2002).
Wang, X., Haswell, J.R. & Roberts, C.W. Molecular pathways: SWI/SNF (BAF) complexes are frequently mutated in cancer—mechanisms and potential therapeutic insights. Clin. Cancer Res. 20, 21–27 (2014).
Sumazin, P. et al. An extensive microRNA-mediated network of RNA–RNA interactions regulates established oncogenic pathways in glioblastoma. Cell 147, 370–381 (2011).
Chen, J.C. et al. Identification of causal genetic drivers of human disease through systems-level analysis of regulatory networks. Cell 159, 402–414 (2014).
Basu, A. et al. An interactive resource to identify cancer genetic and lineage dependencies targeted by small molecules. Cell 154, 1151–1161 (2013).
Barretina, J. et al. The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 483, 603–607 (2012).
MacConaill, L.E. et al. Prospective enterprise-level molecular genotyping of a cohort of cancer patients. J. Mol. Diagn. 16, 660–672 (2014).
Klein, U. et al. Transcriptional analysis of the B cell germinal center reaction. Proc. Natl. Acad. Sci. USA 100, 2639–2644 (2003).
Alizadeh, A.A. et al. Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403, 503–511 (2000).
Tothill, R.W. et al. Novel molecular subtypes of serous and endometrioid ovarian cancer linked to clinical outcome. Clin. Cancer Res. 14, 5198–5208 (2008).
Creighton, C.J. et al. Residual breast cancers after conventional therapy display mesenchymal as well as tumor-initiating features. Proc. Natl. Acad. Sci. USA 106, 13820–13825 (2009).
Wolf-Yadlin, A., Sevecka, M. & MacBeath, G. Dissecting protein function and signaling using protein microarrays. Curr. Opin. Chem. Biol. 13, 398–405 (2009).
Božović, A. & Kulasingam, V. Quantitative mass spectrometry–based assay development and validation: from small molecules to proteins. Clin. Biochem. 46, 444–455 (2013).
Rodríguez, J.A. Interplay between nuclear transport and ubiquitin/SUMO modifications in the regulation of cancer-related proteins. Semin. Cancer Biol. 27, 11–19 (2014).
Rhodes, D.R. et al. Mining for regulatory programs in the cancer transcriptome. Nat. Genet. 37, 579–583 (2005).
Lefebvre, C. et al. A human B-cell interactome identifies MYB and FOXM1 as master regulators of proliferation in germinal centers. Mol. Syst. Biol. 6, 377 (2010).
Carro, M.S. et al. The transcriptional network for mesenchymal transformation of brain tumours. Nature 463, 318–325 (2010).
Chudnovsky, Y. et al. ZFHX4 interacts with the NuRD core member CHD4 and regulates the glioblastoma tumor-initiating cell state. Cell Rep. 6, 313–324 (2014).
Aytes, A. et al. Cross-species regulatory network analysis identifies a synergistic interaction between FOXM1 and CENPF that drives prostate cancer malignancy. Cancer Cell 25, 638–651 (2014).
Piovan, E. et al. Direct reversal of glucocorticoid resistance by AKT inhibition in acute lymphoblastic leukemia. Cancer Cell 24, 766–776 (2013).
Forbes, S.A. et al. COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer. Nucleic Acids Res. 39, D945–D950 (2011).
Basso, K. et al. Reverse engineering of regulatory networks in human B cells. Nat. Genet. 37, 382–390 (2005).
Lachmann, A. et al. ChEA: transcription factor regulation inferred from integrating genome-wide ChIP-X experiments. Bioinformatics 26, 2438–2444 (2010).
ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Krämer, A., Green, J., Pollard, J. Jr. & Tugendreich, S. Causal analysis approaches in Ingenuity Pathway Analysis. Bioinformatics 30, 523–530 (2014).
Abatangelo, L. et al. Comparative study of gene set enrichment methods. BMC Bioinformatics 10, 275 (2009).
Boorsma, A., Foat, B.C., Vis, D., Klis, F. & Bussemaker, H.J. T-profiler: scoring the activity of predefined groups of genes using gene expression data. Nucleic Acids Res. 33, W592–W595 (2005).
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 102, 15545–15550 (2005).
Jiang, Z. & Gentleman, R. Extensions to gene set enrichment. Bioinformatics 23, 306–313 (2007).
Dinu, I. et al. Improving gene set analysis of microarray data by SAM-GS. BMC Bioinformatics 8, 242 (2007).
Wang, K. et al. Genome-wide identification of post-translational modulators of transcription factor activity in human B cells. Nat. Biotechnol. 27, 829–839 (2009).
Lamb, J. et al. The Connectivity Map: using gene-expression signatures to connect small molecules, genes, and disease. Science 313, 1929–1935 (2006).
Basso, K. et al. Integrated biochemical and computational approach identifies BCL6 direct target genes controlling multiple pathways in normal germinal center B cells. Blood 115, 975–984 (2010).
Kushwaha, R. et al. Interrogation of a context-specific transcription factor network identifies novel regulators of pluripotency. Stem Cells 33, 367–377 (2015).
Adjibade, P. & Mazroui, R. Control of mRNA turnover: implication of cytoplasmic RNA granules. Semin. Cell Dev. Biol. 34, 15–23 (2014).
Harris, S.L. & Levine, A.J. The p53 pathway: positive and negative feedback loops. Oncogene 24, 2899–2908 (2005).
Luo, J., Solimini, N.L. & Elledge, S.J. Principles of cancer therapy: oncogene and non-oncogene addiction. Cell 136, 823–837 (2009).
Compagno, M. et al. Mutations of multiple genes cause deregulation of NF-κB in diffuse large B-cell lymphoma. Nature 459, 717–721 (2009).
Lee, E., Chuang, H.Y., Kim, J.W., Ideker, T. & Lee, D. Inferring pathway activity toward precise disease classification. PLoS Comput. Biol. 4, e1000217 (2008).
Vaske, C.J. et al. Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM. Bioinformatics 26, i237–i245 (2010).
Segal, E. et al. Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data. Nat. Genet. 34, 166–176 (2003).
Yörük, E., Ochs, M.F., Geman, D. & Younes, L. A comprehensive statistical model for cell signaling. IEEE/ACM Trans. Comput. Biol. Bioinform. 8, 592–606 (2011).
Boorsma, A., Lu, X.J., Zakrzewska, A., Klis, F.M. & Bussemaker, H.J. Inferring condition-specific modulation of transcription factor activity in yeast through regulon-based analysis of genomewide expression. PLoS One 3, e3112 (2008).
Foat, B.C., Morozov, A.V. & Bussemaker, H.J. Statistical mechanical modeling of genome-wide transcription factor occupancy data by MatrixREDUCE. Bioinformatics 22, e141–e149 (2006).
Kundaje, A. et al. Learning regulatory programs that accurately predict differential expression with MEDUSA. Ann. NY Acad. Sci. 1115, 178–202 (2007).
di Bernardo, D. et al. Chemogenomic profiling on a genome-wide scale using reverse-engineered gene networks. Nat. Biotechnol. 23, 377–383 (2005).
Phillips, H.S. et al. Molecular subclasses of high-grade glioma predict prognosis, delineate a pattern of disease progression, and resemble stages in neurogenesis. Cancer Cell 9, 157–173 (2006).
Margolin, A.A. et al. ARACNE: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinformatics 7 (suppl. 1), S7 (2006).
R Development Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2012).
Gautier, L., Cope, L., Bolstad, B.M. & Irizarry, R.A. affy—analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 20, 307–315 (2004).
Alvarez, M.J., Sumazin, P., Rajbhandari, P. & Califano, A. Correlating measurements across samples improves accuracy of large-scale expression profile experiments. Genome Biol. 10, R143 (2009).
Anders, S. & Huber, W. Differential expression analysis for sequence count data. Genome Biol. 11, R106 (2010).
Dudoit, S., Gentleman, R.C. & Quackenbush, J. Open source software for the analysis of microarray data. Biotechniques (suppl.), 45–51 (2003).
Ashburner, M. et al. Gene ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000).
Ding, B.B. et al. Constitutively activated STAT3 promotes cell proliferation and survival in the activated B-cell subtype of diffuse large B-cell lymphomas. Blood 111, 1515–1523 (2008).
Kim, S.Y. & Volsky, D.J. PAGE: parametric analysis of gene set enrichment. BMC Bioinformatics 6, 144 (2005).
Tian, L. et al. Discovering statistically significant pathways in expression profiling studies. Proc. Natl. Acad. Sci. USA 102, 13544–13549 (2005).
Park, S.Y., Gönen, M., Kim, H.J., Michor, F. & Polyak, K. Cellular and genetic diversity in the progression of in situ human breast carcinomas to an invasive phenotype. J. Clin. Invest. 120, 636–644 (2010).
Park, C.C. et al. β1 integrin inhibitory antibody induces apoptosis of breast cancer cells, inhibits growth, and distinguishes malignant from normal phenotype in three dimensional cultures and in vivo. Cancer Res. 66, 1526–1535 (2006).
Wang, K. et al. Dissecting the interface between signaling and transcriptional regulation in human B cells. Pac. Symp. Biocomput. 2009, 264–275 (2009).
Benaglia, T., Chauveau, D., Hunter, D.R. & Young, D.S. mixtools: an R package for analyzing finite mixture models. J. Stat. Softw. 32, 1–29 (2009).
Acknowledgements
We thank G. Riekhof for critical insight and help with drafting the manuscript. This work was supported by the National Institutes of Health (NIH) Roadmap National Centers for Biomedical Computing (5U54CA121852), the NIH Library of Integrated Network-based Cellular Signatures program (1U01CA164184), the National Cancer Institute (NCI) Cancer Target Discovery and Development program (1U01CA168426), and the NIH instrumentation grants (S10OD012351 and S10OD021764). Additional support was from NIH (R01CA85573) to B.H.Y. and a fellowship grant from the Lauri Strauss Leukemia Foundation to B.B.D. The results published here are in whole or part based upon data generated by The Cancer Genome Atlas pilot project established by the NCI and NHGRI as of January 2011.
Author information
Authors and Affiliations
Contributions
M.J.A. conceptualized and developed the algorithms, designed the experiments, analyzed the data and wrote the manuscript. Y.S., F.M.G. and A.L. analyzed the data. B.B.D. generated the BCL6 knockdown experiment and expression profile. B.H.Y. designed the experiments used for benchmarking the algorithms with transcription factor knockdown assays. A.C. conceptualized the algorithm, directed the project, designed the experiments and wrote the manuscript.
Corresponding authors
Ethics declarations
Competing interests
M.J.A. is chief scientific officer of DarwinHealth Inc. A.C. is a founder of DarwinHealth Inc.
Supplementary information
Supplementary Text and Figures
Supplementary Note, Supplementary Tables 1–6 and 9, and Supplementary Figures 1–19. (PDF 8818 kb)
Supplementary Table 7
VIPER-inferred protein activity for 2,956 regulatory proteins (rows) after perturbation of MCF7 cells with 156 small-molecule compounds (columns). Protein activity predictions were based on the Connectivity Map dataset9 and a breast carcinoma regulatory network inferred from 1,037 RNA-seq profiles from TCGA. The table lists the VIPER-inferred relative protein activity as normalized enrichment score values. The first two columns indicate the gene identifier and gene symbol corresponding to the evaluated regulatory protein. The first row lists the small molecule compound, molar concentration and time of exposure. (XLSX 9134 kb)
Supplementary Table 8
Number of samples harboring nonsilent somatic mutations in COSMIC genes. (XLSX 67 kb)
Rights and permissions
About this article
Cite this article
Alvarez, M., Shen, Y., Giorgi, F. et al. Functional characterization of somatic mutations in cancer using network-based inference of protein activity. Nat Genet 48, 838–847 (2016). https://doi.org/10.1038/ng.3593
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/ng.3593
This article is cited by
-
Transcriptional and metabolic effects of aspartate-glutamate carrier isoform 1 (AGC1) downregulation in mouse oligodendrocyte precursor cells (OPCs)
Cellular & Molecular Biology Letters (2024)
-
KLF5 regulates actin remodeling to enhance the metastasis of nasopharyngeal carcinoma
Oncogene (2024)
-
Mechanism-centric regulatory network identifies NME2 and MYC programs as markers of Enzalutamide resistance in CRPC
Nature Communications (2024)
-
Network integration of thermal proteome profiling with multi-omics data decodes PARP inhibition
Molecular Systems Biology (2024)
-
Immunomodulatory response to neoadjuvant nivolumab in non-metastatic clear cell renal cell carcinoma
Scientific Reports (2024)