Chemical composition and the potential for proteomic transformation in cancer, hypoxia, and hyperosmotic stress

Jeffrey M. Dick

doi:10.7717/peerj.3421

Chemical composition and the potential for proteomic transformation in cancer, hypoxia, and hyperosmotic stress

Jeffrey M. Dick

Wattanothaipayap School, Chiang Mai, Thailand

DOI: 10.7717/peerj.3421

Published: 2017-06-06
Accepted: 2017-05-16
Received: 2017-03-21

Academic Editor: Maria Cristina Albertini

Subject Areas: Biochemistry, Mathematical Biology, Oncology
Keywords: Compositional biology, Thermodynamic potential, Redox balance

Copyright: © 2017 Dick
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Dick JM. 2017. Chemical composition and the potential for proteomic transformation in cancer, hypoxia, and hyperosmotic stress. PeerJ 5:e3421 https://doi.org/10.7717/peerj.3421

The author has chosen to make the review history of this article public.

Abstract

The changes of protein expression that are monitored in proteomic experiments are a type of biological transformation that also involves changes in chemical composition. Accompanying the myriad molecular-level interactions that underlie any proteomic transformation, there is an overall thermodynamic potential that is sensitive to microenvironmental conditions, including local oxidation and hydration potential. Here, up- and down-expressed proteins identified in 71 comparative proteomics studies were analyzed using the average oxidation state of carbon (Z_C) and water demand per residue ( ${\bar{n}}_{H_{2} O}$ ), calculated using elemental abundances and stoichiometric reactions to form proteins from basis species. Experimental lowering of oxygen availability (hypoxia) or water activity (hyperosmotic stress) generally results in decreased Z_C or ${\bar{n}}_{H_{2} O}$ of up-expressed compared to down-expressed proteins. This correspondence of chemical composition with experimental conditions provides evidence for attraction of the proteomes to a low-energy state. An opposite compositional change, toward higher average oxidation or hydration state, is found for proteomic transformations in colorectal and pancreatic cancer, and in two experiments for adipose-derived stem cells. Calculations of chemical affinity were used to estimate the thermodynamic potentials for proteomic transformations as a function of fugacity of O₂ and activity of H₂O, which serve as scales of oxidation and hydration potential. Diagrams summarizing the relative potential for formation of up- and down-expressed proteins have predicted equipotential lines that cluster around particular values of oxygen fugacity and water activity for similar datasets. The changes in chemical composition of proteomes are likely linked with reactions among other cellular molecules. A redox balance calculation indicates that an increase in the lipid to protein ratio in cancer cells by 20% over hypoxic cells would generate a large enough electron sink for oxidation of the cancer proteomes. The datasets and computer code used here are made available in a new R package, canprot.

Introduction

The relationship between cells and tissue microenvironments is a topic of vital importance for cancer biology. Because of rapid cellular proliferation and irregular vascularization, tumors often develop regions of hypoxia (Höckel & Vaupel, 2001). Tumor microenvironments also exhibit abnormal ranges of other physical-chemical variables, including hydration state (McIntyre, 2006; Abramczyk et al., 2014).

Some aspects of the complex metazoan response to hypoxia are mediated by hypoxia-inducible factor 1 (HIF-1). HIF-1 is a transcription factor that is tagged for degradation in normoxic conditions. Under hypoxia, the degradation of HIF-1 is suppressed; HIF-1 can then enter the nucleus and activate the transcription of downstream targets (Semenza, 2003). Indeed, transcriptional targets of HIF-1 are found to be differentially expressed in proteomic datasets for laboratory hypoxia (Cifani et al., 2011; McMahon et al., 2012). However, proteomic studies of cells in hypoxic conditions provide many examples of proteins that are not directly regulated by HIF-1 (McMahon et al., 2012; Fuhrmann et al., 2013), and cancer proteomic datasets also include many proteins that are not known to be regulated by HIF-1.

The complexity of the underlying regulatory mechanisms (McMahon et al., 2012) and the large differences between levels of gene expression and protein abundance (van den Beucken et al., 2011; Cifani et al., 2011; Ho et al., 2016) present many difficulties for a bottom-up understanding of global proteomic trends. As a counterpart to molecular explanations, a systems perspective can incorporate higher-level constraints (Drack & Wolkenhauer, 2011). A commonly used metaphor in systems biology is attractor landscapes. The basins of attraction are defined by dynamical systems behavior, but in many cases are analogous to minimum-energy states in thermodynamics (Emmeche, Koppe & Stjernfelt, 2000; Enver et al., 2009). Nevertheless, little attention has been given to the thermodynamic potential that is inherent to the compositional difference between the up-expressed and down-expressed proteins in proteomic experiments. Such a high-level perspective may require concepts and language that differ from those applicable to molecular interactions (Ellis, 2015).

To better understand the microenvironmental context for compositional changes, this study uses proteomic data as input into a descriptive thermodynamic model. First, a compositional analysis of differentially (up- and down-) expressed proteins identifies consistent trends in the oxidation and hydration states of proteomes of colorectal cancer (CRC), pancreatic cancer, and cells exposed to hypoxia or hyperosmotic stress. These results lay the groundwork for using a thermodynamic model to quantify environmental constraints on the potential for proteomic transformation. Finally, the Discussion section explores some implications of the hypothesis that elevated synthesis of lipids provides an electron sink for the oxidation of proteomes. In this situation, some cancer systems may develop an abnormally large redox disproportionation between pools of cellular biomacromolecules.

Methods

Data sources

Tables 1–4 present the sources of data. Protein IDs and expression (up/down or abundance ratios) were found in the literature, often being reported in the supporting information (SI) or supplementary (suppl.) tables. In some cases, source tables were further processed, using fold-change and significance cutoffs that, where possible, are based on statements made in the primary publication. The data are stored as *.csv files in the R package canprot, which was developed during this study (see http://github.com/jedick/canprot) and is provided as Dataset S1.

Table 1:

Selected proteomic datasets for colorectal cancer.^*

Here and in Tables 2–4, n₁ and n₂ stand for the numbers of down- and up-expressed proteins, respectively, in each dataset.

Set	n₁	n₂	Description	Set	n₁	n₂	Description
ΩaAⒶ	57	70	T/N	ΩsAⒶ	73	175	MSS-type T/N^a
ΩbAⒶ	101	28	CRC C/A^a	ΩtAⒶ	79	677	T/N
ΩcAⒶ	87	81	CIN C/A^a	ΩuAⒶ	55	68	CM T/N^b
ΩdAⒶ	157	76	MIN C/A^a	ΩvAⒶ	33	37	stromal T/N^a
ΩeAⒶ	43	56	biomarkers up/down	ΩwAⒶ	51	55	chromatin-binding C/A
ΩfAⒶ	48	166	stage I/normal^b	ΩxAⒶ	58	65	epithelial A/N
ΩgAⒶ	77	321	stage II/normal^b	ΩyAⒶ	44	210	tissue secretome T/N^a
ΩhAⒶ	61	57	microdissected T/N^b	ΩzAⒶ	113	66	membrane enriched T/N
ΩiAⒶ	71	92	adenoma/normal^a	ΩAAⒶ	1061	1254	A/N
ΩjAⒶ	109	72	stage I/normal^a	ΩBAⒶ	772	1007	C/A
ΩkAⒶ	164	140	stage II/normal^a	ΩCAⒶ	879	1281	C/N
ΩlAⒶ	63	131	stage III/normal^a	ΩDAⒶ	123	75	stromal AD/NC^a
ΩmAⒶ	42	26	stage IV/normal^a	ΩEAⒶ	125	60	stromal CIS/NC^a
ΩnAⒶ	72	45	T/N	ΩFAⒶ	99	75	stromal ICC/NC^a
ΩoAⒶ	335	288	A/N	ΩGAⒶ	191	178	biopsy T/N^b
ΩpAⒶ	373	257	C/A	ΩHAⒶ	113	86	AD/NC^a
ΩqAⒶ	351	232	C/N	ΩIAⒶ	169	138	CIS/NC^a
ΩrAⒶ	75	61	poor/good prognosis^b	ΩJAⒶ	129	100	ICC/NC^a

DOI: 10.7717/peerj.3421/table-1

Notes:

T: tumor
N: normal
C: carcinoma or adenocarcinoma
A: adenoma
CM: conditioned media
AD: adenomatous colon polyps
CIS: carcinoma in situ
ICC: invasive colonic carcinoma
NC: non-neoplastic colonic mucosa

*ΩaAⒶ Source: Table 1 and Suppl. Data 1 of Watanabe et al. (2008). ΩbAⒶΩcAⒶΩdAⒶ Nuclear matrix proteome; chromosomal instability (CIN), microsatellite instability (MIN), or both types (CRC). Source: Suppl. Tables 5–7 of Albrethsen et al. (2010). ΩeAⒶ Candidate serum biomarkers. Source: Table 4 of Jimenez et al. (2010). ΩfAⒶ ΩgAⒶ Source: Suppl. Table 4 of Xie et al. (2010). ΩhAⒶ Source: Suppl. Table 4 of Zhang et al. (2010). ΩiAⒶΩjAⒶΩkAⒶΩlAⒶΩmAⒶ Source: Suppl. Table 9 of Besson et al. (2011). ΩnAⒶ Source: Suppl. Table 2 of Jankova et al. (2011). ΩoAⒶ ΩpAⒶ ΩqAⒶ Source: Table S8 of Mikula et al. (2011). ΩrAⒶ Source: extracted from Suppl. Table 5 of Kim et al. (2012), including proteins with abundance ratio >2 or <0.5. ΩsAⒶ Microsatellite stable (MSS) type CRC tissue. Source: Suppl. Table 4 of Kang et al. (2012). ΩtAⒶ Source: Suppl. Table 4 of Wiśniewski et al. (2012). ΩuAⒶ Source: Suppl. Table 2 of Yao et al. (2012). ΩvAⒶ Source: Table 1 of Mu et al. (2013). ΩwAⒶ Source: Table 2 of Knol et al. (2014). ΩxAⒶ Source: Table III of Uzozie et al. (2014). ΩyAⒶ Source: Suppl. Table 1 of de Wit et al. (2014). ΩzAⒶ Source: Supporting Table 2 of Sethi et al. (2015). ΩAAⒶΩBAⒶΩCAⒶ Source: SI Table 3 of Wiśniewski et al. (2015). ΩDAⒶ ΩEAⒶ ΩFAⒶ Source: Suppl. Table S3 of Li et al. (2016). ΩGAⒶ Source: extracted from SI Table S3 of Liu et al. (2016), including proteins with p-value < 0.05. ΩHAⒶΩIAⒶΩJAⒶ Source: Suppl. Table 4 of Peng et al. (2016).

aGene names or GI numbers were converted to UniProt IDs using the UniProt mapping tool.

bIPI numbers were converted to UniProt IDs using the DAVID conversion tool.

Table 2:

Selected proteomic datasets for pancreatic cancer.^*

Set	n₁	n₂	Description	Set	n₁	n₂	Description
ΩaAⒶ	41	69	T/N	ΩlAⒶ	29	73	FFPE PC/AIP^c
ΩbAⒶ	60	88	T/N^a	ΩmAⒶ	53	73	FFPE PC/CP^c
ΩcAⒶ	48	54	T/N^a	ΩnAⒶ	83	32	low-grade T/N^a
ΩdAⒶ	19	95	CP/N^a	ΩoAⒶ	224	176	high-grade T/N^a
ΩeAⒶ	28	29	T/N	ΩpAⒶ	208	219	T/N (no DM)^a
ΩfAⒶ	38	45	T/N^b	ΩqAⒶ	56	167	T/N (DM)^a
ΩgAⒶ	207	152	FFPE T/N^a	ΩrAⒶ	227	148	LCM PDAC/ANT^c
ΩhAⒶ	108	86	accessible T/N^c	ΩsAⒶ	65	34	T/N
ΩiAⒶ	38	47	FFPE T/N^c	ΩtAⒶ	35	51	mouse 2.5 w T/N^a
ΩjAⒶ	78	57	T/N^a	ΩuAⒶ	40	73	mouse 3.5 w T/N^a
ΩkAⒶ	257	456	T/N^a	ΩvAⒶ	49	84	mouse 5 w T/N^a
				ΩwAⒶ	37	108	mouse 10 w T/N^a

DOI: 10.7717/peerj.3421/table-2

Notes:

T: tumor
N: normal
CP: chronic pancreatitis
AIP: autoimmune pancreatitis
PC: pancreatic cancer
DM: diabetes mellitus
PDAC: pancreatic ductal adenocarcinoma
ANT: adjacent normal tissue
FFPE: formalin-fixed paraffin-embedded
LCM: laser-capture microdissection
NP: normal pancreas

*ΩaAⒶ Pooled tissue samples of PC and matched normal tissue from 12 patients. Source: Tables 2 and 3 of Lu et al. (2004). ΩbAⒶ Two PC and two NP samples. Source: Tables 1 and 2 of Chen et al. (2005). ΩcAⒶ Large-scale immunoblotting (PowerBlot) of 8 tissue specimens of pancreatic intraepithelial neoplasia compared to NP and CP. Source: Table 2 of Crnogorac-Jurcevic et al. (2005). ΩdAⒶ Tissue specimens from patients with CP and 10 control specimens from patients with NP. Source: Table 1 of Chen et al. (2007). ΩeAⒶ 12 carcinoma samples (PDAC), 12 benign pancreatic cystadenomas and 10 normal tissues adjacent to the PDAC primary mass. Source: Table 1 of Cui et al. (2009). ΩfAⒶ Source: extracted from Table S2 of McKinney et al. (2011). ΩgAⒶ PDAC compared to NP. Source: Suppl. Table 3 of Pan et al. (2011). ΩhAⒶ Potentially accessible proteins in fresh samples of PC tumors (three patients) vs normal tissue (two patients with NP and one with CP). Source: extracted from the SI Table of Turtoi et al. (2011). ΩiAⒶ 11 tissue specimens containing >50% cancer and 8 unmatched, uninvolved tissues adjacent to pancreatitis. Source: Suppl. Tables 2 and 3 of Kojima et al. (2012). ΩjAⒶ Fresh-frozen PDAC tissue specimens from seven patients vs a pooled mixture of three normal main pancreatic duct tissue samples. Source: extracted from SI Table S3 of Kawahara et al. (2013), including proteins with an expression ratio >2 [or <0.5] in at least five of the seven experiments and ratio >1 [or <1] in all experiments. ΩkAⒶ Frozen samples of PDAC tumors vs adjacent benign tissue from four patients. Source: Suppl. Table 2 of Kosanam et al. (2013). ΩlAⒶΩmAⒶ Tissue samples from three patients with PC vs 3 patients with AIP or three patients with CP. Source: extracted from Tables 2, 3, and 4 of Paulo et al. (2013). ΩnAⒶ ΩoAⒶ 12 samples each (pooled) of low-grade tumor or high-grade tumor vs non-tumor. Source: extracted from Suppl. Tables S4 and S5 of Wang et al. (2013b), including proteins with ratios ≥3/2 or ≤2/3 for at least two of the four groups, and with expression differences for all four groups in the same direction. ΩpAⒶΩqAⒶ Source: extracted from Suppl. Tables S3 and S4 of Wang et al. (2013a), including proteins with >3/2 or <2/3 fold change in at least 3 of 4 iTRAQ experiments for different pooled samples. ΩrAⒶ LCM of CD24⁺ cells from PDAC vs CD24⁻ cells from adjacent normal tissue (ANT). Source: SI Table S5 of Zhu et al. (2013). ΩsAⒶ Matched PDAC and normal tissue from nine patients. Source: extracted from SI Table S5 of Iuga et al. (2014), excluding “not passed” proteins (those with inconsistent regulation). ΩtAⒶΩuAⒶΩvAⒶΩwAⒶ PDAC tumors in transgenic mice vs pancreas in normal mice, at time points of 2.5, 3.5, 5 and 10 weeks. Source: Suppl. Table of Kuo et al. (2016).

aGene names, IPI numbers or UniProt names were converted to UniProt IDs using the UniProt mapping tool.

bIPI numbers were converted to UniProt IDs using the DAVID conversion tool.

cIncludes differentially expressed proteins shared between groups and proteins identified in only one group.

Table 3:

Selected proteomic datasets for hypoxia and reoxygenation experiments or growth in 3D culture.^*

Set	n₁	n₂	Description	Set	n₁	n₂	Description	Set	n₁	n₂	Description
ΩaAⒶ	37	24	U937^a	ΩkAⒶ	56	40	THP-1	ΩvAⒶ	113	154	CRC-derived SPH
ΩbAⒶ	41	22	placental secretome	ΩlAⒶ	178	77	A431 Hx48	ΩwAⒶ	127	292	HepG2/C3A SPH
ΩcAⒶ	71	19	B104	ΩmAⒶ	69	54	A431 Hx72	ΩxAⒶ	53	72	HeLa
ΩdAⒶ	87	28	DU145^a	ΩnAⒶ	48	36	A431 ReOx	ΩyAⒶ	137	64	U87MG and 786-O
ΩeAⒶ	29	21	SK-N-BE(2)c; IMR-32	ΩoAⒶ	141	64	SH-SY5Y	ΩzAⒶ	129	141	HCT116 transcription^a
ΩfAⒶ	53	65	H9C2^b	ΩpAⒶ	65	34	A431 Hx48-S	ΩAAⒶ	469	1024	HCT116 translation^a
ΩgAⒶ	409	337	MCF-7 SPH P5	ΩqAⒶ	137	61	A431 Hx72-S	ΩBAⒶ	66	50	adipose-derived SC^a
ΩhAⒶ	248	214	MCF-7 SPH P2	ΩrAⒶ	56	49	A431 ReOx-S	ΩCAⒶ	65	27	cardiomyocytes CoCl₂^a
ΩiAⒶ	48	52	SPH perinecrotic^a	ΩsAⒶ	74	44	A431 Hx48-P	ΩDAⒶ	35	69	cardiomyocytes SAL^a
ΩjAⒶ	101	186	SPH necrotic^a	ΩtAⒶ	67	53	A431 Hx72-P	ΩEAⒶ	116	225	HT29 SPH
				ΩuAⒶ	41	31	A431 ReOx-P

DOI: 10.7717/peerj.3421/table-3

Notes:

U937: acute promonocytic leukemic cells
B104: rat neuroblastoma cells
DU145: prostate carcinoma cells
SK-N-BE(2)c; IMR-32; SH-SY5Y: neuroblastoma cells
H9C2: rat heart myoblast
MCF-7: breast cancer cells
THP-1: macrophages
A431: epithelial carcinoma cells
Hx48: hypoxia 48 h
Hx72: hypoxia 72 h
ReOx: hypoxia 48 h followed by reoxygenation for 24 h
-S: supernatant fraction
-P: pellet fraction
SPH: spheroids
HepG2/C3A: hepatocellular carcinoma cells
U87MG: glioblastoma
786-O: renal clear cell carcinoma cells
HCT116; HT29: colon cancer cells
SC: stem cells
SAL: salidroside

*ΩaAⒶ 2% O₂ vs normoxic conditions. Source: Table 1 of Han et al. (2006). ΩbAⒶ 1% vs 6% O₂. Source: Tables 2 and 3 of Blankley et al. (2010). ΩcAⒶ Expression ratios HYP/LSC (oxygen deprivation/low serum control) >1.2 or <0.83. Source: calculated using data from Suppl. Table 2 of Datta et al. (2010), including proteins with p-value < 0.05 and EF < 1.4. ΩdAⒶ Translationally regulated genes. Source: Suppl. Tables 1–4 of van den Beucken et al. (2011). ΩeAⒶ 1% O₂ for 72 h vs standard conditions. Source: Suppl. Table 1(a) of Cifani et al. (2011). ΩfAⒶ Hypoxic vs control conditions for 16 h. Source: Suppl. Table S5 of Li et al. (2012). ΩgAⒶ ΩhAⒶ Tumorspheres (50 to 200 μm diameter) at passage 5 (P5) or 2 (P2) compared to adherent cells. Source: Sheets 2 and 3 in Table S1 of Morrison et al. (2012). ΩiAⒶ ΩjAⒶ Perinecrotic and necrotic regions compared to surface of multicell spheroids (∼600 μm diameter) (expression ratios <0.77 or >1.3). Source: Suppl. Table 1C of McMahon et al. (2012). ΩkAⒶ Incubation for several days under hypoxia (1% O₂). Source: Suppl. Table 2A of Fuhrmann et al. (2013) (control virus cells). ΩlAⒶΩmAⒶΩnAⒶ Source: extracted from Suppl. Table 1 of Ren et al. (2013), including proteins with iTRAQ ratios <0.83 or >1.2 and p-value < 0.05. ΩoAⒶ 5% O₂ vs atmospheric levels of O₂ (normalized expression ratio >1.2 or <0.83). Source: SI table of Villeneuve et al. (2013). ΩpAⒶΩqAⒶΩrAⒶΩsAⒶΩtAⒶΩuAⒶ The comparisons here include proteins with p < 0.05. Source: Suppl. Table S1 of Dutta et al. (2014). ΩvAⒶ Organotypic spheroids (∼250 μm diameter) vs lysed CRC tissue. Source: extracted from Table S2 of Rajcevic et al. (2014), filtered as follows: at least two of three experiments have differences in spectral counts, absolute overall fold change is at least 1.5, and p-value is less than 0.05. ΩwAⒶ SPH vs classical cell culture (2D growth) (log₂ fold change at least ±1). Source: P1_Data sheet in the SI of Wrzesinski et al. (2014). ΩxAⒶ 1% vs 19% O₂. Source: Table S1 of Bousquet et al. (2015). ΩyAⒶ 1% O₂ for 24 h (fold change <0.5 or >1 for proteins detected in only hypoxic or only normoxic conditions). Source: Table S1 of Ho et al. (2016). ΩzAⒶΩAAⒶ Microarray analysis of differential gene expression in the transcriptome (total rRNA) and translatome (polysomal/total RNA ratio) of cells grown in normal and hypoxic (1% O₂) conditions. Source: data file supplied by Ming-Chih Lai (Lai, Chang & Sun, 2016). ΩBAⒶ ASC from three donors cultured for 24 h in hypoxic (1% O₂) vs normoxic (20% O₂) conditions. Source: Tables 1 and 2 of Riis et al. (2016). ΩCAⒶ ΩDAⒶ Rat cardiomyocytes treated with CoCl₂ (hypoxia mimetic) vs control or with SAL (anti-hypoxic) vs CoCl₂. Source: SI Tables 1S and 2S of Xu et al. (2016). ΩEAⒶ 800 μm spheroids vs 2D monolayers. Source: Tables S1a–b of Yue et al. (2016).

aGene names, GI numbers, or other IDs were converted to UniProt IDs using the UniProt mapping tool.

bIPI numbers were converted to UniProt IDs using the DAVID conversion tool.

Table 4:

Selected proteomic datasets for hyperosmotic stress experiments.^*

Set	n₁	n₂	Description	Set	n₁	n₂	Description
ΩaAⒶ	38	44	S. cerevisiae VHG 2 h^a	ΩnAⒶ	49	28	eel gill^a
ΩbAⒶ	33	62	S. cerevisiae VHG 10 h^a	ΩoAⒶ	78	77	S. cerevisiae t30a^b
ΩcAⒶ	18	65	S. cerevisiae VHG 12 h^a	ΩpAⒶ	67	67	S. cerevisiae t30b^b
ΩdAⒶ	63	94	mouse pancreatic islets	ΩqAⒶ	87	87	S. cerevisiae t30c^b
ΩeAⒶ	148	44	adipose-derived stem cells	ΩrAⒶ	25	38	IOBA-NHC
ΩfAⒶ	17	11	ARPE-19 25 mM	ΩsAⒶ	105	96	CAUCR succinate tr.^a
ΩgAⒶ	21	24	ARPE-19 100 mM	ΩtAⒶ	209	142	CAUCR NaCl tr.^a
ΩhAⒶ	114	61	ECO57 25 °C, a_w 0.985^a	ΩuAⒶ	33	33	CAUCR succinate pr.^a
ΩiAⒶ	238	61	ECO57 14 °C, a_w 0.985^a	ΩvAⒶ	33	27	CAUCR NaCl pr.^a
ΩjAⒶ	263	56	ECO57 25 °C, a_w 0.967^a	ΩwAⒶ	294	205	CHO all^a
ΩkAⒶ	372	73	ECO57 14 °C, a_w 0.967^a	ΩxAⒶ	66	75	CHO high^a
ΩlAⒶ	32	39	Chang liver cells 25 mM	ΩyAⒶ	14	28	Yarrowia lipolytica^b
ΩmAⒶ	19	50	Chang liver cells 100 mM	ΩzAⒶ	160	141	Paracoccidioides lutzii^a

DOI: 10.7717/peerj.3421/table-4

Notes:

VHG: very high glucose
ARPE-19: human retinal pigmented epithelium cells
ECO57: Escherichia coli O157:H7 Sakai
IOBA-NHC: human conjunctival epithelial cells
CAUCR: Caulobacter crescentus
tr: transcriptome
pr: proteome
CHO: Chinese hamster ovary cells

*ΩaAⒶΩbAⒶΩcAⒶ VHG (300 g/L) vs control (20 g/L). The comparisons here use proteins with expression ratios <0.9 or >1.1 and with p-values < 0.05. Source: SI Table of Pham & Wright (2008). ΩdAⒶ 24 h at 16.7 mM vs 5.6 mM glucose. Source: extracted from Suppl. Table ST4 of Waanders et al. (2009); including the red- and blue-highlighted rows in the source table (those with ANOVA p-value < 0.01), and applying the authors’ criterion that proteins be identified by 2 or more unique peptides in at least 4 of the 8 most intense LC-MS/MS runs. ΩeAⒶ 300 mOsm (control) or 400 mOsm (NaCl treatment). Source: Suppl. Table 1 of Oswald et al. (2011). ΩfAⒶ ΩgAⒶ Mannitol-balanced 5.5 (control), 25 or 100 mM d-glucose media. Source: Table 1 of Chen et al. (2012). ΩhAⒶ ΩiAⒶ ΩjAⒶ ΩkAⒶ Temperature and NaCl treatment (control: 35 °C, a_w 0.993). Source: Suppl. Tables S13–S16 of Kocharunchitt et al. (2012). ΩlAⒶ ΩmAⒶ 5.5 (control), 25 or 100 mM d-glucose. Source: Table 1 of Chen et al. (2013). ΩnAⒶ Gill proteome of Japanese eel (Anguilla japonica) adapted to seawater or freshwater. Source: protein IDs from Suppl. Table 3 and gene names of human orthologs from Suppl. File 4 of Tse et al. (2013). ΩoAⒶ ΩpAⒶΩqAⒶ Multiple experiments for 30 min after transfer from YPKG (0.5% glucose) to YNB (2% glucose) media. Source: extracted from Suppl. Files 3 and 5 of Giardina, Stanley & Chiang (2014), using the authors’ criterion of p-value < 0.05. ΩrAⒶ 280 (control), 380, or 480 mOsm (NaCl treatment) for 24 h. Source: Table 2 of Chen et al. (2015). ΩsAⒶΩtAⒶΩuAⒶΩvAⒶ Overnight treatment with a final concentration of 40/50 mM NaCl or 200 mM sucrose vs M2 minimal salts medium plus glucose (control). Source: Table S2 of Kohler et al. (2015). ΩwAⒶ ΩxAⒶ 15 g/L vs 5 g/L (control) glucose at days 0, 3, 6, and 9. The comparisons here use all proteins reported to have expression patterns in Cluster 1 (up) or Cluster 5 (down), or only the proteins with high expression differences (ratio ≤ − 0.2 or ≥0.2) at all time points. Source: SI Table S4 of Liu et al. (2015). ΩyAⒶ 4.21 osmol/kg vs 3.17 osmol/kg osmotic pressure (NaCl treatment). Source: Table 1 of Yang et al. (2015). ΩzAⒶ 0.1 M KCl (treatment) vs medium with no added KCl (control). Source: Suppl. Tables 2 and 3 of da Silva Rodrigues et al. (2016).

aGene names, GI numbers, or NCBI RefSeq accessions were converted to UniProt IDs using the UniProt mapping tool.

bAmino acid sequences were obtained for the listed GI numbers using Batch Entrez (https://www.ncbi.nlm.nih.gov/sites/batchentrez).

Sequence IDs were converted to UniProt IDs using the UniProt mapping tool (http://www.uniprot.org/mapping/) or the gene ID conversion tool of DAVID 6.7 (https://david.ncifcrf.gov/conversion.jsp). For proteins where the automatic conversions produced no matches, manual searches in UniProt were performed using the gene names or protein descriptions. If specified (i.e., as UniProt IDs with suffixes), particular isoforms of the proteins were used. Obsolete or secondary IDs reported for some proteins were updated to reflect current, primary IDs (uniprot_updates.csv in Dataset S1). Any duplicated IDs listed as having opposite expression ratios were excluded from the comparisons here.

Amino acid sequences of human proteins were taken from the UniProt human reference proteome. Sequences of proteins in other organisms and of human proteins not contained in the reference proteome were downloaded from UniProt or the NCBI website (for one study reporting GI numbers; see Table 4). Amino acid compositions were computed using functions in the CHNOSZ package (Dick, 2008) or the ProtParam tool on the UniProt website. The amino acid compositions are stored in *.Rdata files in Dataset S1.

R (R Core Team, 2016) and R packages canprot (this study) and CHNOSZ (Dick, 2008) were used to process the data and generate the figures with code specifically written for this study, which is provided in Dataset S2.

Measures of compositional oxidation and hydration state

Two compositional metrics that afford a quantitative description of proteomic data, the average oxidation state of carbon (Z_C) and the water demand per residue ( ${\bar{n}}_{H_{2} O}$ ), are briefly described here.

The oxidation state of atoms in molecules quantifies the degree of electron redistribution due to bonding; a higher oxidation state signifies a lower degree of reduction. Although calculations of oxidation state from molecular formulas necessarily make simplifying assumptions regarding the internal electronic structure of molecules, such calculations may be used to quantify the flow of electrons in chemical reactions, and the oxidation state concept is useful for studying the transformations of complex mixtures of organic molecules. For example, calculations of the average oxidation state of carbon provide insight on the processes affecting the decomposition of carbohydrate, protein and lipid fractions of natural organic matter (Baldock et al., 2004). Moreover, oxidation state can be regarded as an ensemble property of organic systems (Kroll et al., 2015). See Dick (2016) for additional references where organic and biochemical reactions have been characterized using the average oxidation state of carbon.

Despite the large size of proteins, their relatively simple primary structure means that Z_C can be computed using the elemental abundances in any particular amino acid sequence (Dick, 2014): (1) $Z_{C} = \frac{- h + 3 n + 2 o + 2 s + z}{c} .$ In this equation, c, h, n, o, and s are the elemental abundances in the chemical formula $C_{c} H_{h} N_{n} O_{o} S_{s}^{z}$ for a specific protein with total charge z. Note, however, that ionization by gain or loss of protons alters charge and the number of H equally, so has no effect on the value of Z_C; for ease of computation, Z_C is calculated here for proteins in their completely non-ionized forms.

In contrast to the elemental stoichiometry in Eq. (1), a calculation of the hydration state must account for the gain or loss of H₂O. In the biochemical literature, “protein hydration” or water of hydration refers to the effective (time-averaged) number of water molecules that interact with a protein (Timasheff, 2002). These dynamically interacting molecules form a hydration shell that has important implications for crystallography and enzymatic function, but hydration numbers have been measured for few proteins and are difficult to compute, especially for the many proteins with unknown tertiary structure. Thus, the structural hydration of proteins identified in proteomic datasets generally remains unquantified.

A different concept of hydration state arises by considering the chemical components that make up proteins. A componential analysis is a method of projecting the composition of a molecule using specified chemical formula units as the components, or basis species. The notion of components is central to chemical thermodynamics (Gibbs, 1875); the choice of components determines the thermodynamic variables (chemical potentials), and a careful choice leads to more convenient representations of the compositional and energetic constraints on reactions (e.g. Zhu & Anderson, 2002).

The components, or basis species, consist of a minimum number of species whose compositions can be linearly combined to represent the composition of any protein. The 20 proteinogenic amino acids are together composed of five elements (C, H, N, O, S), so five basis species are needed to represent the primary sequences of proteins. As noted previously (see references in Dick, 2016), all possible combinations of basis species lead to thermodynamically consistent models, but are differently suited to making interpretations. Dick (2016) proposed using C₅H₁₀N₂O₃, C₅H₉NO₄, C₃H₇NO₂S, O₂, and H₂O as a basis for assessing compositional differences in proteomes. The first three formulas correspond to glutamine (Q), glutamic acid (E), and cysteine (C).

To account for protein ionization, a proton can be included in the basis, which is now referred to as “QEC+”. Using the QEC+ basis, the stoichiometric projection of a protein with formula $C_{c} H_{h + z} N_{n} O_{o} S_{s}^{z}$ , where z is the charge of the protein and h is the number of H in the fully nonionized protein, is represented by (R1) $n_{Cys} C_{3} H_{7} {NO}_{2} S + n_{Glu} C_{5} H_{9} {NO}_{4} + n_{Gln} C_{5} H_{10} N_{2} O_{3} + n_{H_{2} O} H_{2} O + n_{O_{2}} O_{2} + z H^{+} \to C_{c} H_{h + z} N_{n} O_{o} S_{s}^{z} .$ To compare the compositions of different-sized proteins, the stoichiometric coefficients in Reaction (R1) can be divided by the sequence length (number of amino acids) of the protein. The length-normalized coefficients, written with an overbar, include the per-residue water demand for formation of a protein ( ${\bar{n}}_{H_{2} O}$ ). This componential “hydration state” is used in this study, and should not be confused with the structural biochemical “protein hydration” mentioned above.

The primary reason for choosing the QEC+ basis instead of others lies in the relation of the compositional variables representing oxidation and hydration state ( ${\bar{n}}_{O_{2}}$ and ${\bar{n}}_{H_{2} O}$ ) with each other and with Z_C. It is important to note that Z_C is a measure of oxidation state that is independent of the choice of basis species. Smoothed scatter plots of ${\bar{n}}_{H_{2} O}$ vs Z_C and ${\bar{n}}_{O_{2}}$ vs Z_C are shown in Fig. S1 for the 21,006 human proteins in the UniProt reference proteome. The plots in the top row of this figure are made using the QEC basis (which is equivalent to the QEC+ basis for the plotted variables) while those in the bottom row are made using the basis species CO₂, NH₃, H₂S, H₂O, and O₂; these inorganic species are often used to balance reactions in geochemical models. It is apparent from Fig. S1 that, using the QEC basis, ${\bar{n}}_{O_{2}}$ is highly positively correlated with Z_C, and ${\bar{n}}_{H_{2} O}$ shows a slight negative correlation with Z_C. Accordingly, in the QEC basis, ${\bar{n}}_{O_{2}}$ is a strong indicator of oxidation state, while ${\bar{n}}_{H_{2} O}$ represents a distinct compositional variable. In contrast, the plots in the bottom row of Fig. S1 show a moderate positive correlation between ${\bar{n}}_{O_{2}}$ and Z_C and a stronger negative correlation between ${\bar{n}}_{H_{2} O}$ and Z_C. Using that basis would therefore weaken the interpretation of ${\bar{n}}_{O_{2}}$ as an indicator of oxidation state and of ${\bar{n}}_{H_{2} O}$ as a distinct compositional variable. The relations among ${\bar{n}}_{H_{2} O}$ , ${\bar{n}}_{O_{2}}$ , and Z_C also vary between basis species consisting of different combinations of amino acids; those differences together with biological considerations support the choice of QEC instead of other amino acids (Dick, 2016).

In summary, Reaction (R1) is not a mechanism for protein synthesis, but is a projection of any protein’s elemental composition into chemical components, i.e., the basis. Compared to a basis composed of simpler inorganic species, the QEC+ basis reduces the projected codependence of oxidation and hydration state in proteins, unfolding a compositional dimension that can enrich a thermodynamic model.

Results

Colorectal cancer

The progression of colorectal cancer (CRC) begins with the formation of numerous non-cancerous lesions (adenoma), which may remain undetectable. Over time, a small fraction of adenomas develop into malignant tumors (carcinoma) (Jimenez et al., 2010; Wiśniewski et al., 2015). Publicly available datasets reporting a minimum of ca. 30 up- and 30 down-expressed proteins for tissue samples of CRC, and one meta-analysis of serum biomarkers, were compiled recently (Dick, 2016). These same datasets are listed in Table 1, with one newer addition (dataset ΩGAⒶ; Liu et al., 2016).

Many aspects of the experimental methods, statistical tests, and bioinformatics analyses used to identify significantly up-expressed and down-expressed proteins vary considerably among studies. The comparisons here are made without any control of this variability. Although particular comparisons may reflect study-specific conditions and methods, visualization of the chemical compositions of proteins for many datasets can reveal general features of the cancer phenotype.

For each dataset, Table 1 lists the numbers of down-expressed (n₁) and up-expressed (n₂) proteins in cancer relative to normal tissue. For datasets comparing different stages of cancer progression, groups n₁ and n₂ correspond to the down- and up-expressed proteins in the more advanced stage (e.g., carcinoma) compared to the less advanced stage (e.g., adenoma). Mean values of average oxidation state of carbon (Z_C; Eq. (1)) and water demand per residue ( ${\bar{n}}_{H_{2} O}$ ; Reaction (R1)) were calculated for the up- and down-expressed groups of proteins, together with the corresponding mean differences (ΔZ_C and $Δ {\bar{n}}_{H_{2} O}$ for the means of up- minus down-expressed groups), p-values, and effect sizes. These values are listed in Table S1. Figure S2 shows the mean values of Z_C and ${\bar{n}}_{H_{2} O}$ for the up- and down-expressed proteins together in a single plot (lettered point symbols for down-expressed and arrowheads for up-expressed proteins). Because of the high variability of mean values among datasets, compositional trends between up- and down-expressed proteins are difficult to interpret using Fig. S2. Therefore, the differences in mean values between up- and down-expressed proteins (ΔZ_C and $Δ {\bar{n}}_{H_{2} O}$ ) are plotted in this paper.

Figure 1A shows $Δ {\bar{n}}_{H_{2} O}$ vs ΔZ_C for the CRC datasets. The gray boxes cover the range from −0.01 to 0.01 for each of the variables. To draw attention to the largest and most significant changes, filled points and dashed lines indicate mean differences with a p-value (Wilcoxon test) less than 0.05; solid lines indicate mean differences with a common language effect size (CLES) ≥60% or ≤40%. The common language statistic “is the probability that a score sampled at random from one distribution will be greater than a score sampled from some other distribution” (McGraw & Wong, 1992). Here, CLES is calculated as the percentage of pairings of individual proteins with a positive difference in Z_C or ${\bar{n}}_{H_{2} O}$ between the up- and down-expressed groups from all possible pairings between the groups. Point symbols are squares if the p-values for both Z_C and ${\bar{n}}_{H_{2} O}$ are less than 0.05, or circles otherwise.

The plot illustrates that proteins up-expressed in carcinoma relative to normal tissue most often have significantly higher Z_C [ΩgAⒶ ΩkAⒶ ΩlAⒶ ΩnAⒶ ΩpAⒶ ΩrAⒶ ΩsAⒶ ΩuAⒶ ΩvAⒶ ΩlAⒶ], ${\bar{n}}_{H_{2} O}$ [ΩeAⒶ ΩoAⒶ ΩtAⒶ ΩxAⒶ ΩyAⒶ ΩDAⒶ ΩGAⒶ ΩHAⒶ], or both [ΩqAⒶ ΩAAⒶ ΩCAⒶ] (see also Dick, 2016). The red points in the plot highlight the datasets for adenoma/normal comparisons [ΩiAⒶ ΩoAⒶ ΩxAⒶ ΩAAⒶ ΩDAⒶ ΩHAⒶ]. Most of these exhibit a significant positive $Δ {\bar{n}}_{H_{2} O}$ but not the large increase in Z_C found for many of the carcinoma/normal comparisons.

Pancreatic cancer

Many proteomic studies have been performed to investigate the differences between normal pancreas (NP) and pancreatic adenocarcinoma (PDAC). Proteomic studies also address the inflammatory conditions of autoimmune pancreatitis, which is sometimes misidentified as carcinoma (Paulo et al., 2013), and chronic pancreatitis, which is associated with increased cancer risk (Chen et al., 2007). Searches for proteomic data were aided by the reviews of Pan et al. (2013) and Ansari et al. (2014). Table 2 lists selected datasets reporting at least ca. 25 up-expressed and 25 down-expressed proteins.

The compositional comparisons in Fig. 1B show that up-expressed proteins in pancreatic cancer often have significantly higher Z_C [ΩbAⒶ ΩeAⒶ ΩgAⒶ ΩiAⒶ ΩoAⒶ ΩpAⒶ ΩqAⒶ ΩrAⒶ]. A dataset obtained for pancreatic cancer associated with diabetes mellitus (Wang et al., 2013a) [ΩqAⒶ] has both significantly higher Z_C and ${\bar{n}}_{H_{2} O}$ . Only one dataset, from a study that targeted accessible proteins (Turtoi et al., 2011) [ΩhAⒶ], is characterized by a large negative mean difference of ΔZ_C. Some other datasets that do not have significantly different Z_C exhibit higher ${\bar{n}}_{H_{2} O}$ in cancer compared to non-cancerous (normal or pancreatitis) tissue [ΩaAⒶ ΩjAⒶ ΩkAⒶ ΩmAⒶ ΩuAⒶ]. Two of the four datasets with negative $Δ {\bar{n}}_{H_{2} O}$ [ΩdAⒶ ΩhAⒶ ΩnAⒶ ΩsAⒶ] were obtained from studies of chronic pancreatitis (Chen et al., 2007) or low-grade tumors (Wang et al., 2013b) (red points in Fig. 1B); another used a procedure to isolate accessible proteins (Turtoi et al., 2011) [ΩhAⒶ], while the remaining low- $Δ {\bar{n}}_{H_{2} O}$ dataset [ΩsAⒶ] may be an outlier in terms of mean chemical composition (Fig. S2). Therefore, the datasets with positive $Δ {\bar{n}}_{H_{2} O}$ and/or ΔZ_C likely reflect a general characteristic of pancreatic cancer.

Hypoxia and 3D culture

Hypoxia refers to oxygen concentrations that are lower than normal physiological levels. Hypoxia is a factor in many pathological conditions, including altitude sickness, stroke, and cardiac ischemia (e.g., Datta et al., 2010; Li et al., 2012; Fuhrmann et al., 2013). In tumors, irregular vascularization and abnormal perfusion contribute to the formation of hypoxic regions (Höckel & Vaupel, 2001). A related situation is the growth in the laboratory of 3D cell cultures (e.g., tumor spheroids), instead of two-dimensional growth on a surface. In 2D monolayers, all cells are exposed to the gas phase, but interior regions of 3D cultures are often diffusion-limited, leading to oxygen deprivation and necrosis (McMahon et al., 2012). There are some overlaps, but also many differences, between gene expression in 3D culture and hypoxic conditions (DelNero et al., 2015). These studies emphasize that growth in 3D culture is associated with heterogeneous oxygen concentrations and have found an interdependence between the effects of hypoxia and 3D growth on gene expression. The proteomic changes likely reflect not only oxygen limitation but also other processes connected with 3D growth (e.g., nutrient deprivation, extracellular architecture, and even light penetration). Although the comparisons made here do not address these individual factors, they do provide information on whether hypoxia and 3D culture lead to similar changes in the overall chemical composition of proteomes.

Table 3 lists selected proteomic datasets with a minimum of ca. 20 up- and 20 down-expressed proteins in hypoxia or 3D growth. The differences in chemical composition of the differentially expressed proteins are plotted in Fig. 2A. In many experiments, hypoxia or 3D growth induces a proteomic transformation with a significant and/or large decrease of Z_C [ΩaAⒶ ΩbAⒶ ΩcAⒶ ΩgAⒶ ΩhAⒶ ΩjAⒶ ΩmAⒶ ΩoAⒶ ΩwAⒶ ΩAAⒶ ΩEAⒶ]. These datasets cluster around a narrow range of ΔZ_C (−0.032 to −0.021), except for dataset ΩEAⒶ (3D growth of colon cancer cells) with much lower ΔZ_C. As extracellular proteins have relatively high Z_C (Dick, 2014), the observation in some experiments that hypoxia decreases the abundance of proteins associated with the extracellular matrix (ECM) (Blankley et al., 2010) is compatible with the overall expression of more reduced (low- Z_C) proteins. Conversely, reoxygenation leads to the formation of more oxidized proteins in the supernatant (-S) and pellet (-P) fractions of isolated chromatin [ΩrAⒶ ΩuAⒶ].

Figure 2: Compositional analysis of differential protein expression in (A) hypoxia or 3D culture and (B) hyperosmotic stress.
The plots show differences (Δ) between the mean for up-expressed and the mean for down-expressed proteins of average oxidation state of carbon (Z_C) and water demand per residue ( ${\bar{n}}_{H_{2} O}$ ) for each dataset from Tables 3 and 4. Red, blue, and orange symbols are used to highlight datasets for tumorspheres, reoxygenation or anti-hypoxic treatment, and adipose-derived stem cells, respectively.

Download full-size image

DOI: 10.7717/peerj.3421/fig-2

While most studies controlled gas composition to generate hypoxia, two datasets [ΩCAⒶ ΩDAⒶ] are from a study that used cobalt chloride (CoCl₂) to induce hypoxia in rat cardiomyocytes; treatment with salidroside (SAL) had anti-hypoxic effects (Xu et al., 2016). The CoCl₂ and SAL treatments result in the expression of somewhat more reduced and more oxidized proteins, respectively, in agreement with the general trends for hypoxia and reoxygenation experiments.

Two datasets oppose the general trends, showing large and significantly higher Z_C under hypoxia. These datasets were obtained using particular analytical methods or cell types. One of the nonconforming datasets is for the supernatant in a chromatin isolation procedure [ΩpAⒶ], and the other is for adipose-derived stem cells [ΩBAⒶ] (see below).

Hyperosmotic stress

By hyperosmotic stress is meant a condition that increases the extracellular hypertonicity, or osmolality. The addition of osmolytes (or “cosolvents”) lowers the water activity in the medium (Timasheff, 2002). Equilibration with hypertonic solutions drives water out of cells, causing cell shrinkage. The selected datasets listed in Table 4 include at least ca. 20 up-expressed and 20 down-expressed proteins in response to high concentrations of NaCl (five studies), glucose (six studies), succinate (one study), KCl (one study), or adaptation to seawater (one study). The proteomic analyses used bacterial, yeast, or mammalian cells, or fish (eel) gills (Tse et al., 2013). One study varied temperature along with NaCl concentration (Kocharunchitt et al., 2012), and one study reported both transcriptomic and proteomic ratios (Kohler et al., 2015).

In the study of Giardina, Stanley & Chiang (2014) [ΩoAⒶ ΩpAⒶ ΩqAⒶ], the reported expression ratios for extracellular proteins after transfer from low glucose to high glucose media are nearly all less than 1. Therefore, the “up-expressed” proteins in the comparisons here are taken to be those that have a higher expression ratio than the median in a given experiment. To achieve a sufficient sample size using data from Chen et al. (2015) [ΩrAⒶ], the comparisons here use a combined set of proteins, i.e., those identified to have the same direction of change in the two treatment conditions (380 and 480 mOsm NaCl) and a significant change in at least one of the conditions.

Figure 2B shows that hyperosmotic stress strongly (CLES ≤40%) and/or significantly (p-value < 0.05) induces the formation of proteins with relatively low water demand per residue in 11 datasets [ΩaAⒶΩbAⒶ ΩdAⒶΩfAⒶΩiAⒶ ΩmAⒶΩsAⒶΩtAⒶΩuAⒶΩvAⒶΩzAⒶ]. Five of these datasets, including four for bacteria [ΩsAⒶΩtAⒶΩuAⒶΩvAⒶ] and one for human cells [ΩmAⒶ], also show an increase in Z_C. These trends are found in both the transcriptomic [ΩsAⒶΩtAⒶ] and proteomic [ΩuAⒶ ΩvAⒶ] data from the study of Kocharunchitt et al. (2012).

Four datasets obtained for mammalian cells have low ΔZ_C with no significant [ΩrAⒶΩwAⒶΩxAⒶ] or a significantly negative mean difference of ${\bar{n}}_{H_{2} O}$ [ΩfAⒶ]. Six datasets [ΩhAⒶΩkAⒶΩnAⒶΩoAⒶΩpAⒶΩqAⒶ] from one study each of yeast and E. coli, and of Japanese eels adapted to seawater, have very small mean differences in Z_C and a negative $Δ {\bar{n}}_{H_{2} O}$ that follows the trends of most of the other datasets, but with lower significance (p-value > 0.05).

The comparisons here show that hyperosmotic stress consistently induces the formation of proteins with lower water demand per residue. In some, but not all, cases, this coincides with an increase in average oxidation state of carbon. Less often, and perhaps specific to mammalian cells, the proteomic composition is shifted toward lower oxidation state of carbon. There are only a couple of datasets, using NaCl treatment [ΩeAⒶΩjAⒶ], that show an increase in water demand per residue.

Notably, two datasets for adipose-derived stem cells oppose the general trends for hypoxic and hyperosmotic conditions (see Fig. 2A [ΩBAⒶ] and Fig. 2B [ΩeAⒶ]). This intriguing result shows that these stem cells respond to external stresses with proteomic transformations that are chemically similar to those in cancer (Fig. 1).

Potential diagrams

The correlations of compositional differences (negative ΔZ_C and $Δ {\bar{n}}_{H_{2} O}$ ) with hypoxia and hyperosmotic stress can be proposed as resulting from attraction of the proteomes to a context-specific low-energy state. Thermodynamic models can help to illuminate the possible microenvironmental constraints on the observed proteomic transformations. Here, the chemical affinities of stoichiometric formation reactions of proteins were calculated, grouped, and compared in order to estimate the thermodynamic potential for the overall process of proteomic transformation.

The chemical affinity quantifies the potential, or propensity, for a reaction to proceed. It is the infinitesimal change with respect to reaction progress of the negative of the Gibbs energy of the system. The chemical affinity is numerically equal to the “non-standard” or actual (Warn & Peters, 1996), “real” (Zhu & Anderson, 2002), or “overall” (Shock, 2009) negative Gibbs energy of reaction. These energies are not constant, but vary with the chemical potentials, or chemical activities, of species in the reaction. Chemical activity (a) and potential (μ) are related through μ = μ^∘ + RTlna, where the standard chemical potentials of particular species (μ^∘ = G^∘, i.e., standard Gibbs energies) depend only on temperature and pressure.

The equilibrium constant (K) for a reaction is given by ΔG^∘ = − 2.303RTlogK, where ΔG^∘ is the standard Gibbs energy of the reaction, 2.303 stands for the natural logarithm of 10, R is the gas constant, T is temperature in Kelvin, and log denotes the decadic logarithm. The equation used for affinity (A) is A = 2.303RTlog(K∕Q), where Q is the activity quotient of the reaction (e.g., Helgeson, 1979, Eq. 11.27; Warn & Peters, 1996, Eq. 7.14; Shock, 2009). Accordingly, the per-residue affinity of Reaction (R1) can be written as (2) $A = 2.303 R T (log K + {\bar{n}}_{Cys} log a_{Cys} + {\bar{n}}_{Glu} log a_{Glu} + {\bar{n}}_{Gln} log a_{Gln} + {\bar{n}}_{H_{2} O} log a_{H_{2} O} + {\bar{n}}_{O_{2}} log f_{O_{2}} - {\bar{z}}_{H^{+}} pH - log a_{residue})$ where the abbreviations of the amino acids have been substituted for their formulas. Here, a and f stand for chemical activity and fugacity (e.g., a_H₂O is water activity, and f_O₂ is oxygen fugacity). The fugacity, rather than activity, of O₂ is used because gaseous oxygen is the reference state most commonly used in previous thermodynamic models. If a_O₂ were used instead, its values would differ from f_O₂ according to the solubility of oxygen in water at the given temperature but otherwise the two models would be thermodynamically equivalent. The overbar notation ( $\bar{n}$ and $\bar{z}$ ) signifies that the coefficients in Reaction (R1) are each divided by the length (number of amino acids) of the protein sequence. Likewise, the elemental composition and standard Gibbs energy per residue are those of the ionized protein (with formula $C_{c} H_{h + z} N_{n} O_{o} S_{s}^{z}$ ) divided by the length of the protein.

The standard Gibbs energies of species at 37 °C and 1 bar were calculated with CHNOSZ (Dick, 2008) using equations and data taken from Wagman et al. (1982) and Kelley (1960) ( ${O_{2}}_{(g)}$ ), Johnson, Oelkers & Helgeson (1992) and references therein (H₂O), and using the Helgeson–Kirkham–Flowers equations of state (Helgeson, Kirkham & Flowers, 1981) with data taken from Amend & Helgeson (1997) and Dick, LaRowe & Helgeson (2006) (amino acids), and from Dick, LaRowe & Helgeson (2006) and LaRowe & Dick (2012) (amino acid group additivity for proteins).

In previous calculations, activities of the amino acid basis species and protein residues were set to 10⁻⁴ and 10⁰, respectively (Dick, 2016). As long as constant total activity of residues is assumed, the specific value does not greatly affect the outcome of the calculations; here it is kept at 10⁰. Revised activities of the amino acid basis species, corresponding to mean concentrations in human plasma (Tcherkas & Denisenko, 2001), are used here: 10^−3.6 (cysteine), 10^−4.5 (glutamic acid) and 10^−3.2 (glutamine). Adopting these activities of basis species, instead of 10⁻⁴, lowers the calculated equipotential lines for proteomic transformations by about 0.5 to 1 loga_H₂O (see below). Accounting for protein ionization, with pH set to 7, also lowers the equipotential lines, by about 1 loga_H₂O compared to calculations for nonionized proteins.

It follows from Eq. (2) that varying the fugacity of O₂ and activity of H₂O alters the chemical affinity for formation of proteins by a specific amount depending on their chemical composition. For example, Figure 5A of Dick (2016) shows that decreasing logf_O₂ is relatively more favorable for the formation of up-expressed than down-expressed proteins in a particular cancer dataset (Knol et al., 2014; ΩwAⒶ in Table 1). This tendency is consistent with the lower Z_C of these up-expressed proteins, which is unlike most other datasets for CRC (Fig. 1A).

How can the affinities of groups, rather than individual proteins, be compared? One method is based on differences in the ranks of chemical affinities of proteins between groups (Dick, 2016). Using this method, the affinities of all of the proteins in a dataset are ranked; the ranks are then summed for proteins in the up- and down-expressed groups (r_up and r_down). Before taking the difference, the ranks are multiplied by a weighting factor to account for the different numbers of proteins in the groups (n = n_up + n_down). This weighted rank difference (WRD) of affinity summarizes the estimates of the differential potential for formation: (3) $WRD = 2 (\frac{n_{down}}{n} \sum r_{up} - \frac{n_{up}}{n} \sum r_{down}) .$

On a contour diagram of the WRD of affinity (referred to here as a “potential diagram”), the line of zero WRD represents a rank-wise equal affinity (or “equipotential line”) for formation of proteins in the two groups.

To characterize the general trends, diagrams were made for groups of proteomic datasets with similar compositional features. For pancreatic cancer, there are 11 datasets with ΔZ_C > 0.01 (i.e., to the right of the gray box in Fig. 1B) and for which the mean difference of ${\bar{n}}_{H_{2} O}$ is neither significant (low p-value) nor large (high CLES). Conversely, there are 8 datasets for pancreatic cancer with $Δ {\bar{n}}_{H_{2} O} > 0.01$ and for which the mean difference of Z_C is neither large nor significant. Similarly, weighted rank-difference diagrams were constructed for 13 (ΔZ_C > 0.01) and 10 ( $Δ {\bar{n}}_{H_{2} O} > 0.01$ ) datasets for CRC, 8 datasets for hypoxia (ΔZ_C < − 0.01), and 12 datasets for hyperosmotic stress ( $Δ {\bar{n}}_{H_{2} O} < - 0.01$ ). The individual diagrams for each of these groups are presented in Fig. S3.

In order to observe the central tendencies among the various datasets, the potential diagrams for each group in Fig. S3 were combined by taking the arithmetic mean of the WRD at all grid points in logf_O₂–loga_H₂O space. The resulting diagrams (Fig. 3) have equipotential lines, shown in white, and zones of positive and negative WRD of affinity, i.e., greater relative potential for formation of up- and down-expressed groups of proteins, colored red and blue, respectively.

Figure 3: Merged potential diagrams for proteomic transformations.
Plots are shown for (A) 13 datasets for colorectal cancer and (B) 11 datasets for pancreatic cancer with ΔZ_C > 0.01, (C) eight datasets for hypoxia or 3D culture with ΔZ_C < − 0.01, (D) 10 datasets for colorectal cancer and (E) eight datasets for pancreatic cancer with $Δ {\bar{n}}_{H_{2} O} > 0.01$ , and (F) 12 datasets for hyperosmotic stress with $Δ {\bar{n}}_{H_{2} O} < - 0.01$ . Red and blue colors denote higher relative potential for formation of up- and down-expressed proteins, respectively. White lines are equipotential lines, where the mean weighted rank difference of affinity (WRD; Eq. (3)) of the included datasets is 0; black lines show the median and interquartile range of the WRD = 0 lines for individual datasets (Fig. S3). See text for details.

Download full-size image

DOI: 10.7717/peerj.3421/fig-3

The solid black lines in Fig. 3 show the median position along the x- or y-axis for the equipotential lines in each group (Fig. S3), and the dashed black lines are positioned at the 1st and 3rd quartiles. The interquartile ranges for the cancer groups are smaller than those for hypoxia, but less so for hyperosmotic stress. The smaller range would be expected if the cancer datasets reflected a somewhat narrower set of conditions than the datasets for experiments with hypoxia; the latter represent a wide variety of organisms, cell types, and laboratory conditions (Table 3).

Discussion

Calculations of the average oxidation state of carbon and water demand per residue, derived from elemental stoichiometry, provide information on the microenvironmental factors affecting differential protein expression in cancer and laboratory experiments. Hypoxia or hyperosmotic stress generally induces the expression of proteins with lower overall oxidation state of carbon or lower water demand per residue, respectively, compared to down-expressed proteins. In contrast, proteomes of CRC and pancreatic cancer are often characterized by greater water demand per residue or oxidation state of carbon. The formation of more highly oxidized proteins despite the hypoxic conditions of many tumors hints at a complex set of microenvironmental–cellular interactions in cancer.

Plots of data from experiments with hypoxia and hyperosmotic stress illuminate two dimensions of possible compositional attraction to a low-energy state (Fig. 2). A thermodynamic model quantifies the altered potential for proteomic transformation in response to changing oxygen fugacity and water activity. The equipotential lines for cancer proteomes with high differential water demand lie between loga_H₂O = − 1 to −3, while the potential threshold for transformation of proteomes in hyperosmotic stress is closer to unit activity of water (loga_H₂O = − 0 to −2) (Figs. 3D–3F). Although there is considerable variability among the individual datasets (Fig. S3), the merged diagrams demonstrate a physiologically realistic range for the activity of water. Water activity in cells is close to one, but restricted diffusion of H₂O in “osmotically inactive” regions of cells (Model, 2014) could result in locally lower water activities. The present findings provide evidence that the molecular processes regulating proteomic transformations operate within the chemical constraints of subcellular regions of depleted water activity.

The finding of a frequently positive water demand for the transformation between normal and cancer proteomes offers a new perspective on the biochemistry of hydration in cancer. The thermodynamic calculations predict that, in contrast to hyperosmotic stress, proteomes of cancer tissues are stabilized by increasing water activity. A higher than normal water activity would be consistent with the greater hydration of tissue that is apparent in spectroscopic analysis of breast cancer tissue (e.g., Abramczyk et al., 2014). Speculatively, the relatively high water content needed for embryonic development (Moulton, 1923) could be recreated in cancer cells if they revert to an embryonic mode of growth (McIntyre, 2006).

The equipotentials for transformation of proteomes in cancer cluster near an oxygen fugacity of ca. 10⁻⁶⁸ to 10⁻⁶⁶. The oxygen fugacity should be interpreted not as actual oxygen concentration, rather as a internal scale of oxidation potential. Oxygen fugacity and water activity can be converted to the Eh scale for redox potential, giving values that are comparable to other biochemical measurements (Dick, 2016).

Although cancer proteomes are obtained from tissues that are likely derived from hypoxic tumor environments, their differential expression is most often in favor of oxidized proteins (Figs. 1A and 1B). What are some explanations for this finding? Perhaps the relatively high logf_O₂ threshold for chemical transformation of hypoxia-responsive proteins could support a buffering action that potentiates the formation of relatively oxidized proteins in cancer (compare the median and quartiles in Fig. 3C with those in Figs. 3A and 3B). This speculative hypothesis requires a division of the cellular proteome into localized, chemically interacting subsystems. Alternatively, the development of a high oxidation potential in cancer cells may be associated with a higher concentration of mitochondrially produced reactive oxygen species (ROS). Neither of these possibilities addresses the magnitude of the chemical differences in the proteomes, and the question remains: where do the electrons go?

A plausible hypothesis comes from considering the different oxidation states of biomolecules. Fatty acids are reduced compared to amino acids, nucleotides, and saccharides (Amend et al., 2013). In parallel with the formation of more reduced proteins, hypoxia induces the accumulation of lipids in cell culture (Gordon, Barcza & Bush, 1977). Cancer cells are also known for increased lipid synthesis. Lipid droplets, which are derived from the endoplasmic reticulum (ER), form in great quantities in cancer cells (Koizume & Miyagi, 2016). Assuming that lipids are synthesized from relatively oxidized metabolic precursors, their formation requires a source of electrons. These considerations lead to the hypothesis that increased lipid synthesis is coupled to the oxidation of the proteome.

Calculations that combine proteomic and cellular data can be used to quantify a hypothetical redox balance between cellular lipids and proteins. The major assumptions in the calculations here are that the overall cellular oxidation state of carbon is the same in cancer and hypoxia, and that changes in this cellular oxidation state are brought about by altering only the numbers of lipid and protein molecules. The overall chemical composition of the lipids is assumed to be constant, but the proteins are assigned different values of Z_C. These simplifying assumptions are meant to pose quantifiable “what if” questions, to serve as points of reference about the range of molecular composition of cells (Milo & Phillips, 2015).

The worked-out calculation is shown in Fig. 4. The lipid:protein ratio in hypoxia is taken from Gordon, Barcza & Bush (1977), and ballpark values for the differences in Z_C of proteins in hypoxia and cancer are from the present study. Notably, the lipid:protein weight ratio in hypoxia (0.19) is higher than in normal cells (i.e., 0.15 using data from Gordon, Barcza & Bush, 1977 or 0.16 using data compiled by Milo & Phillips, 2015 for E. coli). The calculation indicates that an increase of the lipid:protein weight ratio in cancer cells by ca. 20% over that in hypoxic normal cells could provide an electron sink that is large enough to take up the electrons released by oxidation of the proteome in hypoxic normal cells to generate that in hypoxic cancer cells. That proteomic transformation is quantified here by an increase of ΔZ_C from ca. −0.03 to 0.03, both relative to non-hypoxic normal cells (Fig. 4).

Figure 4: A computer-aided “back of the envelope” calculation to estimate the lipid to protein ratio (L:P) in cancer cells and the percent difference from normal cells in hypoxic conditions.
Bold text indicates function definitions (R code) or numerical results (comments/results (rounded)). Numerical values are taken from [1] the chemical formula of 1-palmitoyl-2,3-dioleoyl-glycerol, given as an example of a triacylglycerol (triglyceride) in the chapter on lipid metabolism in Voet, Voet & Pratt (2013), [2] the average chemical formula of proteins in the UniProt human proteome, for which amino acid compositions are stored in human_base.Rdata in the **canprot** package, [3] this study, and [4] Table 2 of Gordon, Barcza & Bush (1977) (mouse cells grown in hypoxic conditions).

Download full-size image

DOI: 10.7717/peerj.3421/fig-4

As found by Raman spectroscopy, levels of both lipids and proteins are elevated in colorectal cancer (Stone et al., 2004). Lipid droplets are formed extensively in CRC stem cells (Tirinato et al., 2015), suggestive of a higher lipid:protein ratio than either cancer or normal epithelial cells. In contrast to CRC, lipids are decreased in breast cancer compared to normal breast tissue (Frank, McCreery & Redd, 1995; Stone et al., 2004). Given a lower lipid content, and therefore smaller electron sink, one might expect that proteomes in breast cancer are oxidized to a lesser extent than those in CRC and pancreatic cancer. Other factors that affect the systemic redox balance, such as a more reduced gut microbiome in CRC (Dick, 2016) and metabolic coupling between epithelial and stromal cells, may be important for an accurate account of the compositional relationships among biomacromolecules.

These compositional and thermodynamic analyses support the notion that changes in bulk chemical composition of cells and the microenvironment have a significant role in shaping the differential expression of proteins. The analysis done here is primarily concerned with top-down causal factors (physical constraints on protein synthesis and degradation), but does not preclude a major role for bottom-up factors (e.g., regulation of gene expression). Speculatively, further applications of these methods could be used to predict the ability of chemotherapy or other treatments to reduce or reverse the potential for formation of the proteins required by cancer cells. Based on the current findings, a decreased proteomic oxidation and/or hydration state may emerge as one aspect of beneficial treatments.

This approach to the data differs from conventional interpretations of proteomic data that are based on the functions of proteins. Nevertheless, the scope of explanations dealing with functions and molecular interactions offers limited insight on the high-level organization of proteomes in a cellular and microenvironmental context. Although a variety of bioinformatics tools are available for functional interpretations (Laukens, Naulaerts & Berghe, 2015), none so far addresses the overall chemical requirements of proteomic transformations. The compositional and thermodynamic descriptions presented here encourage a fresh look at the question, “What is cancer made of?”

Conclusion

Although many hypoxia experiments induce the formation of proteins with lower oxidation state of carbon (Z_C), the up-expressed proteins in colorectal and pancreatic cancer are often relatively oxidized compared to the down-expressed ones. Hyperosmotic stress in the laboratory leads to the formation of proteins with relatively low water demand per residue ( ${\bar{n}}_{H_{2} O}$ ), but cancer proteomes often show the opposite trend, with up-expressed proteins having higher average ${\bar{n}}_{H_{2} O}$ than down-expressed ones.

The global proteomic differences can be described as compositional changes in terms of chemical basis species and quantified in a thermodynamic framework. A positive thermodynamic potential for each proteomic transformation is predicted in a specific range of oxidation and hydration potential. However, the distribution of biomolecules other than proteins should also be considered to account for changes in cellular redox balance. An electron sink associated with a ca. 20% greater lipid to protein ratio in cancer compared to normal hypoxic cells would be sufficient to balance the electrons released by the formation of more oxidized proteins in CRC and pancreatic cancer. It thus appears possible that a redox disproportionation develops in some cancers, leading to pools of both more reduced and more oxidized macromolecules compared to normal conditions.

Supplemental Information

R source package including protein expression and amino acid composition data (canprot_0.0.5.tar.gz)

DOI: 10.7717/peerj.3421/supp-1

Download

Project code file, to be used with R, the canprot package (this study), and CHNOSZ version 1.1.0

DOI: 10.7717/peerj.3421/supp-2

Download

Compositional summaries: mean values of Z_C and n_H2O and corresponding mean differences, p-values, and common-language effect sizes (CLES)

DOI: 10.7717/peerj.3421/supp-3

Download

Comparison of basis species: scatterplots of n_O2 vs Z_C and n_H2O vs Z_C for proteins in the UniProt human proteome

DOI: 10.7717/peerj.3421/supp-4

Download

Average compositions of down- and up-expressed proteins in each dataset, plotted as point symbols and arrowheads, on n_H2O–Z_C diagrams

DOI: 10.7717/peerj.3421/supp-5

Download

Potential diagrams for each dataset. These diagrams were merged to make the diagrams in Fig. 3

DOI: 10.7717/peerj.3421/supp-6

Download

[1] Abramczyk H, Brozek-Pluska B, Krzesniak M, Kopec M, Morawiec-Sztandera A. 2014. The cellular environment of cancerous human tissue. Interfacial and dangling water as a “hydration fingerprint”. Spectrochimica Acta, Part A: Molecular and Biomolecular Spectroscopy 129:609-623

[2] Albrethsen J, Knol JC, Piersma SR, Pham TV, De Wit M, Mongera S, Carvalho B, Verheul HMW, Fijneman RJA, Meijer GA, Jimenez CR. 2010. Subnuclear proteomics in colorectal cancer: identification of proteins enriched in the nuclear matrix fraction and regulation in adenoma to carcinoma progression. Molecular & Cellular Proteomics 9(5):988-1005

[3] Amend JP, Helgeson HC. 1997. Calculation of the standard molal thermodynamic properties of aqueous biomolecules at elevated temperatures and pressures. Part 1. l- α-amino acids. Journal of the Chemical Society, Faraday Transactions 93(10):1927-1941

[4] Amend JP, LaRowe DE, McCollom TM, Shock EL. 2013. The energetics of organic synthesis inside and outside the cell. Philosophical Transactions of the Royal Society, B: Biological Sciences 368(1622):20120255

[5] Ansari D, Aronsson L, Sasor A, Welinder C, Rezeli M, Marko-Varga G, Andersson R. 2014. The role of quantitative mass spectrometry in the discovery of pancreatic cancer biomarkers for translational science. Journal of Translational Medicine 12(1):1-15

[6] Baldock JA, Masiello CA, Gélinas Y, Hedges JI. 2004. Cycling and composition of organic matter in terrestrial and marine ecosystems. Marine Chemistry 92(1–4):39-64

[7] Besson D, Pavageau A-H, Valo I, Bourreau A, Bélanger A, Eymerit-Morin C, Moulière A, Chassevent A, Boisdron-Celle M, Morel A, Solassol J, Campone M, Gamelin E, Barré B, Coqueret O, Guette C. 2011. A quantitative proteomic approach of the different stages of colorectal cancer establishes OLFM4 as a new nonmetastatic tumor marker. Molecular & Cellular Proteomics 10(12):M111.009712

[8] Blankley RT, Robinson NJ, Aplin JD, Crocker IP, Gaskell SJ, Whetton AD, Baker PN, Myers JE. 2010. A gel-free quantitative proteomics analysis of factors released from hypoxic-conditioned placentae. Reproductive Sciences 17(3):247-257

[9] Bousquet PA, Sandvik JA, Arntzen MØ, Jeppesen Edin NF, Christoffersen S, Krengel U, Pettersen EO, Thiede B. 2015. Hypoxia strongly affects mitochondrial ribosomal proteins and translocases, as shown by quantitative proteomics of HeLa cells. International Journal of Proteomics 2015:678527

[10] Chen J-Y, Chou H-C, Chen Y-H, Chan H-L. 2013. High glucose-induced proteome alterations in hepatocytes and its possible relevance to diabetic liver disease. Journal of Nutritional Biochemistry 24(11):1889-1910

[11] Chen L, Li J, Guo T, Ghosh S, Koh SK, Tian D, Zhang L, Jia D, Beuerman RW, Aebersold R, Chan ECY, Zhou L. 2015. Global metabonomic and proteomic analysis of human conjunctival epithelial cells (IOBA-NHC) in response to hyperosmotic stress. Journal of Proteome Research 14(9):3982-3995

[12] Chen R, Brentnall TA, Pan S, Cooke K, Moyes KW, Lane Z, Crispin DA, Goodlett DR, Aebersold R, Bronner MP. 2007. Quantitative proteomics analysis reveals that proteins differentially expressed in chronic pancreatitis are also frequently involved in pancreatic cancer. Molecular & Cellular Proteomics 6(8):1331-1342

[13] Chen R, Yi EC, Donohoe S, Pan S, Eng J, Cooke K, Crispin DA, Lane Z, Goodlett DR, Bronner MP, Aebersold R, Brentnall TA. 2005. Pancreatic cancer proteome: the proteins that underlie invasion, metastasis, and immunologic escape. Gastroenterology 129(4):1187-1197

[14] Chen Y-H, Chen J-Y, Chen Y-W, Lin S-T, Chan H-L. 2012. High glucose-induced proteome alterations in retinal pigmented epithelium cells and its possible relevance to diabetic retinopathy. Molecular Biosystems 8(12):3107-3124

[15] Cifani P, Bendz M, Wårell K, Hansson K, Levander F, Sandin M, Krogh M, Ovenberger M, Fredlund E, Vaapil M, Pietras A, Påhlman S, James P. 2011. Hunting for protein markers of hypoxia by combining plasma membrane enrichment with a new approach to membrane protein analysis. Journal of Proteome Research 10(4):1645-1656

[16] Crnogorac-Jurcevic T, Gangeswaran R, Bhakta V, Capurso G, Lattimore S, Akada M, Sunamura M, Prime W, Campbell F, Brentnall TA, Costello E, Neoptolemos J, Lemoine NR. 2005. Proteomic analysis of chronic pancreatitis and pancreatic adenocarcinoma. Gastroenterology 129(5):1454-1463

[17] Cui Y, Tian M, Zong M, Teng M, Chen Y, Lu J, Jiang J, Liu X, Han J. 2009. Proteomic analysis of pancreatic ductal adenocarcinoma compared with normal adjacent pancreatic tissue and pancreatic benign cystadenoma. Pancreatology 9(1–2):89-98

[18] Da Silva Rodrigues LN, De Almeida Brito W, Parente AFA, Weber SS, Bailão AM, Casaletti L, Borges CL, De Almeida Soares CM. 2016. Osmotic stress adaptation of Paracoccidioides lutzii, Pb01, monitored by proteomics. Fungal Genetics and Biology 95:13-23

[19] Datta A, Park JE, Li X, Zhang H, Ho ZS, Heese K, Lim SK, Tam JP, Sze SK. 2010. Phenotyping of an in vitro model of ischemic penumbra by iTRAQ-based shotgun quantitative proteomics. Journal of Proteome Research 9(1):472-484

[20] DelNero P, Lane M, Verbridge SS, Kwee B, Kermani P, Hempstead B, Stroock A, Fischbach C. 2015. 3D culture broadly regulates tumor cell hypoxia response and angiogenesis via pro-inflammatory pathways. Biomaterials 55:110-118

[21] De Wit M, Kant H, Piersma SR, Pham TV, Mongera S, Van Berkel MPA, Boven E, Pontén F, Meijer GA, Jimenez CR, Fijneman RJA. 2014. Colorectal cancer candidate biomarkers identified by tissue secretome proteome profiling. Journal of Proteomics 99:26-39

[22] Dick JM. 2008. Calculation of the relative metastabilities of proteins using the CHNOSZ software package. Geochemical Transactions 9 Article 10

[23] Dick JM. 2014. Average oxidation state of carbon in proteins. Journal of the Royal Society Interface 11:20131095

[24] Dick JM. 2016. Proteomic indicators of oxidation and hydration state in colorectal cancer. PeerJ 4:e2238

[25] Dick JM, LaRowe DE, Helgeson HC. 2006. Temperature, pressure, and electrochemical constraints on protein speciation: group additivity calculation of the standard molal thermodynamic properties of ionized unfolded proteins. Biogeosciences 3(3):311-336

[26] Drack M, Wolkenhauer O. 2011. System approaches of Weiss and Bertalanffy and their relevance for systems biology today. Seminars in Cancer Biology 21(3):150-155

[27] Dutta B, Yan R, Lim SK, Tam JP, Sze SK. 2014. Quantitative profiling of chromatome dynamics reveals a novel role for HP1BP3 in hypoxia-induced oncogenesis. Molecular & Cellular Proteomics 13(12):3236-3249

[28] Ellis G. 2015. Recognising top-down causation. In: Aguirre A, Foster B, Merali Z, eds. Questioning the foundations of physics: which of our fundamental assumptions are wrong?. Cham: Springer International Publishing. 17-44

[29] Emmeche C, Koppe S, Stjernfelt F. 2000. Levels, emergence, and three versions of downward causation. In: Andersen P, Emmeche C, Finnemann N, Christiansen P, eds. Downward causation. Aarhus: University of Aarhus Press. 322-348

[30] Enver T, Pera M, Peterson C, Andrews PW. 2009. Stem cell states, fates, and the rules of attraction. Cell Stem Cell 4(5):387-397

[31] Frank CJ, McCreery RL, Redd DCB. 1995. Raman spectroscopy of normal and diseased human breast tissues. Analytical Chemistry 67(5):777-783

[32] Fuhrmann DC, Wittig I, Heide H, Dehne N, Brüne B. 2013. Chronic hypoxia alters mitochondrial composition in human macrophages. Biochimica et Biophysica Acta (BBA)—Proteins and Proteomics 1834(12):2750-2760

[33] Giardina BJ, Stanley BA, Chiang H-L. 2014. Glucose induces rapid changes in the secretome of Saccharomyces cerevisiae. Proteome Science 12(1) Article 9

[34] Gibbs JW. 1875. On the equilibrium of heterogeneous substances (first part) Transactions of the Connecticut Academy of Arts and Sciences 3:108-248

[35] Gordon GB, Barcza MA, Bush ME. 1977. Lipid accumulation in hypoxic tissue culture cells. American Journal of Pathology 88(3):663-678

[36] Han Y-H, Xia L, Song L-P, Zheng Y, Chen W-L, Zhang L, Huang Y, Chen G-Q, Wang L-S. 2006. Comparative proteomic analysis of hypoxia-treated and untreated human leukemic U937 cells. Proteomics 6(11):3262-3274

[37] Helgeson HC. 1979. Mass transfer among minerals and hydrothermal solutions. In: Barnes HL, ed. Geochemistry of hydrothermal ore deposits (2nd edition). New York: Wiley. 568-610

[38] Helgeson HC, Kirkham DH, Flowers GC. 1981. Theoretical prediction of the thermodynamic behavior of aqueous electrolytes at high pressures and temperatures: IV. Calculation of activity coefficients, osmotic coefficients, and apparent molal and standard and relative partial molal properties to 600 °C and 5 Kb. American Journal of Science 281(10):1249-1516

[39] Ho JJD, Wang M, Audas T, Kwon D, Carlsson S, Timpano S, Evagelou S, Brothers S, Gonzalgo M, Krieger J, Chen S, Uniacke J, Lee S. 2016. Systemic reprogramming of translation efficiencies on oxygen stimulus. Cell Reports 14(6):1293-1300

[40] Höckel M, Vaupel P. 2001. Tumor hypoxia: definitions and current clinical, biologic, and molecular aspects. Journal of the National Cancer Institute 93(4):266-276

[41] Iuga C, Seicean A, Iancu C, Buiga R, Sappa PK, Völker U, Hammer E. 2014. Proteomic identification of potential prognostic biomarkers in resectable pancreatic ductal adenocarcinoma. Proteomics 14(7–8):945-955

[42] Jankova L, Chan C, Fung CLS, Song X, Kwun SY, Cowley MJ, Kaplan W, Dent OF, Bokey EL, Chapuis PH, Baker MS, Robertson GR, Clarke SJ, Molloy MP. 2011. Proteomic comparison of colorectal tumours and non-neoplastic mucosa from paired patient samples using iTRAQ mass spectrometry. Molecular Biosystems 7(11):2997-3005

[43] Jimenez CR, Knol JC, Meijer GA, Fijneman RJA. 2010. Proteomics of colorectal cancer: overview of discovery studies and identification of commonly identified cancer-associated proteins and candidate CRC serum markers. Journal of Proteomics 73(10):1873-1895

[44] Johnson JW, Oelkers EH, Helgeson HC. 1992. SUPCRT92: a software package for calculating the standard molal thermodynamic properties of minerals, gases, aqueous species, and reactions from 1 to 5000 bar and 0 to 1000 °C. Computers & Geosciences 18(7):899-947

[45] Kang U-B, Yeom J, Kim H-J, Kim H, Lee C. 2012. Expression profiling of more than 3500 proteins of MSS-type colorectal cancer by stable isotope labeling and mass spectrometry. Journal of Proteomics 75(10):3050-3062

[46] Kawahara T, Hotta N, Ozawa Y, Kato S, Kano K, Yokoyama Y, Nagino M, Takahashi T, Yanagisawa K. 2013. Quantitative proteomic profiling identifies DPYSL3 as pancreatic ductal adenocarcinoma-associated molecule that regulates cell adhesion and migration by stabilization of focal adhesion complex. PLOS ONE 8(12):e79654

[47] Kelley KK. 1960. Contributions to the data in theoretical metallurgy XIII: high temperature heat content, heat capacities and entropy data for the elements and inorganic compounds. In: Bulletin 584. Washington, D.C.: U. S. Bureau of Mines.

[48] Kim H-J, Kang U-B, Lee H, Jung J-H, Lee S-T, Yu M-H, Kim H, Lee C. 2012. Profiling of differentially expressed proteins in stage IV colorectal cancers with good and poor outcomes. Journal of Proteomics 75(10):2983-2997

[49] Knol JC, De Wit M, Albrethsen J, Piersma SR, Pham TV, Mongera S, Carvalho B, Fijneman RJA, Meijer GA, Jiménez CR. 2014. Proteomics of differential extraction fractions enriched for chromatin-binding proteins from colon adenoma and carcinoma tissues. Biochimica et Biophysica Acta (BBA)—Proteins and Proteomics 1844(5):1034-1043

[50] Kocharunchitt C, King T, Gobius K, Bowman JP, Ross T. 2012. Integrated transcriptomic and proteomic analysis of the physiological response of Escherichia coli O157:H7 Sakai to steady-state conditions of cold and water activity stress. Molecular & Cellular Proteomics 11(1):M111.009019

[51] Kohler C, Lourenço RF, Bernhardt J, Albrecht D, Schüler J, Hecker M, Gomes SL. 2015. A comprehensive genomic, transcriptomic and proteomic analysis of a hyperosmotic stress sensitive α-proteobacterium. BMC Microbiology 15(1):1-15

[52] Koizume S, Miyagi Y. 2016. Lipid droplets: a key cellular organelle associated with cancer cell survival under normoxia and hypoxia. International Journal of Molecular Sciences 17(9) Article 1430

[53] Kojima K, Bowersock GJ, Kojima C, Klug CA, Grizzle WE, Mobley JA. 2012. Validation of a robust proteomic analysis carried out on formalin-fixed paraffin-embedded tissues of the pancreas obtained from mouse and human. Proteomics 12(22):3393-3402

[54] Kosanam H, Prassas I, Chrystoja CC, Soleas I, Chan A, Dimitromanolakis A, Blasutig IM, Rückert F, Gruetzmann R, Pilarsky C, Maekawa M, Brand R, Diamandis EP. 2013. Laminin, gamma 2 (LAMC2): a promising new putative pancreatic cancer biomarker identified by proteomic analysis of pancreatic adenocarcinoma tissues. Molecular & Cellular Proteomics 12(10):2820-2832

[55] Kroll JH, Lim CY, Kessler SH, Wilson KR. 2015. Heterogeneous oxidation of atmospheric organic aerosol: kinetics of changes to the amount and oxidation state of particle-phase organic carbon. The Journal of Physical Chemistry A 119(44):10767-10783

[56] Kuo K-K, Kuo C-J, Chiu C-Y, Liang S-S, Huang C-H, Chi S-W, Tsai K-B, Chen C-Y, Hsi E, Cheng K-H, Chiou S-H. 2016. Quantitative proteomic analysis of differentially expressed protein profiles involved in pancreatic ductal adenocarcinoma. Pancreas 45(1):71-83

[57] Lai M-C, Chang C-M, Sun HS. 2016. Hypoxia induces autophagy through translational up-regulation of lysosomal proteins in human colon cancer cells. PLOS ONE 11(4):1-21

[58] LaRowe DE, Dick JM. 2012. Calculation of the standard molal thermodynamic properties of crystalline peptides. Geochimica et Cosmochimica Acta 80:70-91

[59] Laukens K, Naulaerts S, Berghe WV. 2015. Bioinformatics approaches for the functional interpretation of protein lists: from ontology term enrichment to network analysis. Proteomics 15(5–6):981-996

[60] Li M, Peng F, Li G, Fu Y, Huang Y, Chen Z, Chen Y. 2016. Proteomic analysis of stromal proteins in different stages of colorectal cancer establishes tenascin-C as a stromal biomarker for colorectal cancer metastasis. Oncotarget 7(24):37226-37237

[61] Li X, Arslan F, Ren Y, Adav SS, Poh KK, Sorokin V, Lee CN, De Kleijn D, Lim SK, Sze SK. 2012. Metabolic adaptation to a disruption in oxygen supply during myocardial ischemia and reperfusion is underpinned by temporal and quantitative changes in the cardiac proteome. Journal of Proteome Research 11(4):2331-2346

[62] Liu Z, Dai S, Bones J, Ray S, Cha S, Karger BL, Li JJ, Wilson L, Hinckle G, Rossomando A. 2015. A quantitative proteomic analysis of cellular responses to high glucose media in Chinese hamster ovary cells. Biotechnology Progress 31(4):1026-1038

[63] Liu X, Xu Y, Meng Q, Zheng Q, Wu J, Wang C, Jia W, Figeys D, Chang Y, Zhou H. 2016. Proteomic analysis of minute amount of colonic biopsies by enteroscopy sampling. Biochemical and Biophysical Research Communications 476(4):286-292

[64] Lu Z, Hu L, Evers S, Chen J, Shen Y. 2004. Differential expression profiling of human pancreatic adenocarcinoma and healthy pancreatic tissue. Proteomics 4(12):3975-3988

[65] McGraw KO, Wong SP. 1992. A common language effect size statistic. Psychological Bulletin 111(2):361-365

[66] McIntyre GI. 2006. Cell hydration as the primary factor in carcinogenesis: a unifying concept. Medical Hypotheses 66(3):518-526

[67] McKinney KQ, Lee Y-Y, Choi H-S, Groseclose G, Iannitti DA, Martinie JB, Russo MW, Lundgren DH, Han DK, Bonkovsky HL, Hwang S-I. 2011. Discovery of putative pancreatic cancer biomarkers using subcellular proteomics. Journal of Proteomics 74(1):79-88

[68] McMahon KM, Volpato M, Chi HY, Musiwaro P, Poterlowicz K, Peng Y, Scally AJ, Patterson LH, Phillips RM, Sutton CW. 2012. Characterization of changes in the proteome in different regions of 3D multicell tumor spheroids. Journal of Proteome Research 11(5):2863-2875

[69] Mikula M, Rubel T, Karczmarski J, Goryca K, Dadlez M, Ostrowski J. 2011. Integrating proteomic and transcriptomic high-throughput surveys for search of new biomarkers of colon tumors. Functional and Integrative Genomics 11(2):215-224

[70] Milo R, Phillips R. 2015. Cell biology by the numbers. New York: Garland Science.

[71] Model MA. 2014. Possible causes of apoptotic volume decrease: an attempt at quantitative review. American Journal of Physiology: Cell Physiology 306(5):C417-C424

[72] Morrison BJ, Hastie ML, Grewal YS, Bruce ZC, Schmidt C, Reynolds BA, Gorman JJ, Lopez JA. 2012. Proteomic comparison of MCF-7 tumoursphere and monolayer cultures. PLOS ONE 7(12):e52692

[73] Moulton CR. 1923. Age and chemical development in mammals. Journal of Biological Chemistry 57(1):79-97

[74] Mu Y, Chen Y, Zhang G, Zhan X, Li Y, Liu T, Li G, Li M, Xiao Z, Gong X, Chen Z. 2013. Identification of stromal differentially expressed proteins in the colon carcinoma by quantitative proteomics. Electrophoresis 34(11):1679-1692

[75] Oswald ES, Brown LM, Bulinski JC, Hung CT. 2011. Label-free protein profiling of adipose-derived human stem cells under hyperosmotic treatment. Journal of Proteome Research 10(7):3050-3059

[76] Pan S, Brentnall TA, Kelly K, Chen R. 2013. Tissue proteomics in pancreatic cancer study: discovery, emerging technologies, and challenges. Proteomics 13(3–4):710-721

[77] Pan S, Chen R, Stevens T, Bronner MP, May D, Tamura Y, McIntosh MW, Brentnall TA. 2011. Proteomics portrait of archival lesions of chronic pancreatitis. PLOS ONE 6(11):1-12

[78] Paulo JA, Kadiyala V, Brizard S, Banks PA, Steen H, Conwell DL. 2013. A proteomic comparison of formalin-fixed paraffin-embedded pancreatic tissue from autoimmune pancreatitis, chronic pancreatitis, and pancreatic cancer. Journal of the Pancreas 14(4):405-414

[79] Peng F, Huang Y, Li M-Y, Li G-Q, Huang H-C, Guan R, Chen Z-C, Liang S-P, Chen Y-H. 2016. Dissecting characteristics and dynamics of differentially expressed proteins during multistage carcinogenesis of human colorectal cancer. World Journal of Gastroenterology 22(18):4515-4528

[80] Pham TK, Wright PC. 2008. The proteomic response of Saccharomyces cerevisiae in very high glucose conditions with amino acid supplementation. Journal of Proteome Research 7(11):4766-4774

[81] R Core Team. 2016. R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. software

[82] Rajcevic U, Knol J, Piersma S, Bougnaud S, Fack F, Sundlisaeter E, Sondenaa K, Myklebust R, Pham T, Niclou S, Jimenez C. 2014. Colorectal cancer derived organotypic spheroids maintain essential tissue characteristics but adapt their metabolism in culture. Proteome Science 12(1) Article 39

[83] Ren Y, Hao P, Dutta B, Cheow ESH, Sim KH, Gan CS, Lim SK, Sze SK. 2013. Hypoxia modulates A431 cellular pathways association to tumor radioresistance and enhanced migration revealed by comprehensive proteomic and functional studies. Molecular & Cellular Proteomics 12(2):485-498

[84] Riis S, Stensballe A, Emmersen J, Pennisi CP, Birkelund S, Zachar V, Fink T. 2016. Mass spectrometry analysis of adipose-derived stem cells reveals a significant effect of hypoxia on pathways regulating extracellular matrix. Stem Cell Research & Therapy 7(1):1-14

[85] Semenza GL. 2003. Targeting HIF-1 for cancer therapy. Nature Reviews. Cancer 3(10):721-732

[86] Sethi MK, Thaysen-Andersen M, Kim H, Park CK, Baker MS, Packer NH, Paik Y-K, Hancock WS, Fanayan S. 2015. Quantitative proteomic analysis of paired colorectal cancer and non-tumorigenic tissues reveals signature proteins and perturbed pathways involved in CRC progression and metastasis. Journal of Proteomics 126:54-67

[87] Shock EL. 2009. Minerals as energy sources for microorganisms. Economic Geology 104(8):1235-1248

[88] Stone N, Kendall C, Smith J, Crow P, Barr H. 2004. Raman spectroscopy for identification of epithelial cancers. Faraday Discussions 126:141-157

[89] Tcherkas YV, Denisenko AD. 2001. Simultaneous determination of several amino acids, including homocysteine, cysteine and glutamic acid, in human plasma by isocratic reversed-phase high-performance liquid chromatography with fluorimetric detection. Journal of Chromatography A 913(1–2):309-313

[90] Timasheff SN. 2002. Protein hydration, thermodynamic binding, and preferential hydration. Biochemistry 41(46):13473-13482

[91] Tirinato L, Liberale C, Di Franco S, Candeloro P, Benfante A, La Rocca R, Potze L, Marotta R, Ruffilli R, Rajamanickam VP, Malerba M, De Angelis F, Falqui A, Carbone E, Todaro M, Medema JP, Stassi G, Di Fabrizio E. 2015. Lipid droplets: a new player in colorectal cancer stem cells unveiled by spectroscopic imaging. Stem Cells 33(1):35-44

[92] Tse WKF, Sun J, Zhang H, Law AYS, Yeung BHY, Chow SC, Qiu J-W, Wong CKC. 2013. Transcriptomic and iTRAQ proteomic approaches reveal novel short-term hyperosmotic stress responsive proteins in the gill of the Japanese eel (Anguilla japonica) Journal of Proteomics 89:81-94

[93] Turtoi A, Musmeci D, Wang Y, Dumont B, Somja J, Bevilacqua G, De Pauw E, Delvenne P, Castronovo V. 2011. Identification of novel accessible proteins bearing diagnostic and therapeutic potential in human pancreatic ductal adenocarcinoma. Journal of Proteome Research 10(9):4302-4313

[94] Uzozie A, Nanni P, Staiano T, Grossmann J, Barkow-Oesterreicher S, Shay JW, Tiwari A, Buffoli F, Laczko E, Marra G. 2014. Sorbitol dehydrogenase overexpression and other aspects of dysregulated protein expression in human precancerous colorectal neoplasms: a quantitative proteomics study. Molecular & Cellular Proteomics 13(5):1198-1218

[95] Van den Beucken T, Magagnin MG, Jutten B, Seigneuric R, Lambin P, Koritzinsky M, Wouters BG. 2011. Translational control is a major contributor to hypoxia induced gene expression. Radiotherapy and Oncology 99(3):379-384

[96] Villeneuve L, Tiede LM, Morsey B, Fox HS. 2013. Quantitative proteomics reveals oxygen-dependent changes in neuronal mitochondria affecting function and sensitivity to rotenone. Journal of Proteome Research 12(10):4599-4606

[97] Voet D, Voet JG, Pratt CW. 2013. Fundamentals of biochemistry (4th edition). Hoboken: John Wiley & Sons.

[98] Waanders LF, Chwalek K, Monetti M, Kumar C, Lammert E, Mann M. 2009. Quantitative proteomic analysis of single pancreatic islets. Proceedings of the National Academy of Sciences of the United States of America 106(45):18902-18907

[99] Wagman DD, Evans WH, Parker VB, Schumm RH, Halow I, Bailey SM, Churney KL, Nuttall RL. 1982. The NBS tables of chemical thermodynamic properties. Selected values for inorganic and C₁ and C₂ organic substances in SI units. Journal of Physical and Chemical Reference Data 11(Suppl. 2):1-392

[100] Wang W-S, Liu X-H, Liu L-X, Jin D-Y, Yang P-Y, Wang X-L. 2013a. Identification of proteins implicated in the development of pancreatic cancer-associated diabetes mellitus by iTRAQ-based quantitative proteomics. Journal of Proteomics 84:52-60

[101] Wang W-S, Liu X-H, Liu L-X, Lou W-H, Jin D-Y, Yang P-Y, Wang X-L. 2013b. iTRAQ-based quantitative proteomics reveals myoferlin as a novel prognostic predictor in pancreatic adenocarcinoma. Journal of Proteomics 91:453-465

[102] Warn JRW, Peters APH. 1996. Concise chemical thermodynamics (2nd edition). London: CRC Press.

[103] Watanabe M, Takemasa I, Kawaguchi N, Miyake M, Nishimura N, Matsubara T, Matsuo E-I, Sekimoto M, Nagai K, Matsuura N, Monden M, Nishimura O. 2008. An application of the 2-nitrobenzenesulfenyl method to proteomic profiling of human colorectal carcinoma: a novel approach for biomarker discovery. Proteomics: Clinical Applications 2(6):925-935

[104] Wiśniewski JR, Duś-Szachniewicz K, Ostasiewicz P, Ziółkowski P, Rakus D, Mann M. 2015. Absolute proteome analysis of colorectal mucosa, adenoma, and cancer reveals drastic changes in fatty acid metabolism and plasma membrane transporters. Journal of Proteome Research 14(9):4005-4018

[105] Wiśniewski JR, Ostasiewicz P, Duś K, Zielińska DF, Gnad F, Mann M. 2012. Extensive quantitative remodeling of the proteome between normal colon tissue and adenocarcinoma. Molecular Systems Biology 8(1) Article 611

[106] Wrzesinski K, Rogowska-Wrzesinska A, Kanlaya R, Borkowski K, Schwämmle V, Dai J, Joensen KE, Wojdyla K, Carvalho VB, Fey SJ. 2014. The cultural divide: exponential growth in classical 2D and metabolic equilibrium in 3D environments. PLOS ONE 9(9):1-15

[107] Xie L-Q, Zhao C, Cai S-J, Xu Y, Huang L-Y, Bian J-S, Shen C-P, Lu H-J, Yang P-Y. 2010. Novel proteomic strategy reveal combined α1 antitrypsin and cathepsin D as biomarkers for colorectal cancer early screening. Journal of Proteome Research 9(9):4701-4709

[108] Xu Z-W, Chen X, Jin X-H, Meng X-Y, Zhou X, Fan F-X, Mao S-Y, Wang Y, Zhang W-C, Shan N-N, Li Y-M, Xu R-C. 2016. SILAC-based proteomic analysis reveals that salidroside antagonizes cobalt chloride-induced hypoxic effects by restoring the tricarboxylic acid cycle in cardiomyocytes. Journal of Proteomics 130:211-220

[109] Yang L-B, Dai X-M, Zheng Z-Y, Zhu L, Zhan X-B, Lin C-C. 2015. Proteomic analysis of erythritol-producing Yarrowia lipolytica from glycerol in response to osmotic pressure. Journal of Microbiology and Biotechnology 25(7):1056-1069

[110] Yao L, Lao W, Zhang Y, Tang X, Hu X, He C, Hu X, Xu LX. 2012. Identification of EFEMP2 as a serum biomarker for the early detection of colorectal cancer with lectin affinity capture assisted secretome analysis of cultured fresh tissues. Journal of Proteome Research 11(6):3281-3294