Atlas of prostate cancer heritability in European and African-American men pinpoints tissue-specific regulation

Gusev, Alexander; Shi, Huwenbo; Kichaev, Gleb; Pomerantz, Mark; Li, Fugen; Long, Henry W.; Ingles, Sue A.; Kittles, Rick A.; Strom, Sara S.; Rybicki, Benjamin A.; Nemesure, Barbara; Isaacs, William B.; Zheng, Wei; Pettaway, Curtis A.; Yeboah, Edward D.; Tettey, Yao; Biritwum, Richard B.; Adjei, Andrew A.; Tay, Evelyn; Truelove, Ann; Niwa, Shelley; Chokkalingam, Anand P.; John, Esther M.; Murphy, Adam B.; Signorello, Lisa B.; Carpten, John; Leske, M. Cristina; Wu, Suh-Yuh; Hennis, Anslem J. M.; Neslund-Dudas, Christine; Hsing, Ann W.; Chu, Lisa; Goodman, Phyllis J.; Klein, Eric A.; Witte, John S.; Casey, Graham; Kaggwa, Sam; Cook, Michael B.; Stram, Daniel O.; Blot, William J.; Eeles, Rosalind A.; Easton, Douglas; Kote-Jarai, ZSofia; Al Olama, Ali Amin; Benlloch, Sara; Muir, Kenneth; Giles, Graham G.; Southey, Melissa C.; Fitzgerald, Liesel M.; Gronberg, Henrik; Wiklund, Fredrik; Aly, Markus; Henderson, Brian E.; Schleutker, Johanna; Wahlfors, Tiina; Tammela, Teuvo L. J.; Nordestgaard, Børge G.; Key, Tim J.; Travis, Ruth C.; Neal, David E.; Donovan, Jenny L.; Hamdy, Freddie C.; Pharoah, Paul; Pashayan, Nora; Khaw, Kay-Tee; Stanford, Janet L.; Thibodeau, Stephen N.; McDonnell, Shannon K.; Schaid, Daniel J.; Maier, Christiane; Vogel, Walther; Luedeke, Manuel; Herkommer, Kathleen; Kibel, Adam S.; Cybulski, Cezary; Wokolorczyk, Dominika; Kluzniak, Wojciech; Cannon-Albright, Lisa; Teerlink, Craig; Brenner, Hermann; Dieffenbach, Aida K.; Arndt, Volker; Park, Jong Y.; Sellers, Thomas A.; Lin, Hui-Yi; Slavov, Chavdar; Kaneva, Radka; Mitev, Vanio; Batra, Jyotsna; Spurdle, Amanda; Clements, Judith A.; Teixeira, Manuel R.; Pandha, Hardev; Michael, Agnieszka; Paulo, Paula; Maia, Sofia; Kierzek, Andrzej; Conti, David V.; Albanes, Demetrius; Berg, Christine; Berndt, Sonja I.; Campa, Daniele; Crawford, E. David; Diver, W. Ryan; Gapstur, Susan M.; Gaziano, J. Michael; Giovannucci, Edward; Hoover, Robert; Hunter, David J.; Johansson, Mattias; Kraft, Peter; Le Marchand, Loic; Lindström, Sara; Navarro, Carmen; Overvad, Kim; Riboli, Elio; Siddiq, Afshan; Stevens, Victoria L.; Trichopoulos, Dimitrios; Vineis, Paolo; Yeager, Meredith; Trynka, Gosia; Raychaudhuri, Soumya; Schumacher, Frederick R.; Price, Alkes L.; Freedman, Matthew L.; Haiman, Christopher A.; Pasaniuc, Bogdan

doi:10.1038/ncomms10979

Download PDF

Article
Open access
Published: 07 April 2016

Atlas of prostate cancer heritability in European and African-American men pinpoints tissue-specific regulation

Alexander Gusev^1,2,
Huwenbo Shi³,
Gleb Kichaev³,
Mark Pomerantz⁴,
Fugen Li^5,6,
Henry W. Long^4,5,
Sue A. Ingles⁷,
Rick A. Kittles⁸,
Sara S. Strom⁹,
Benjamin A. Rybicki¹⁰,
Barbara Nemesure¹¹,
William B. Isaacs¹²,
Wei Zheng¹³,
Curtis A. Pettaway¹⁴,
Edward D. Yeboah^15,16,
Yao Tettey^15,16,
Richard B. Biritwum^15,16,
Andrew A. Adjei^15,16,
Evelyn Tay^15,16,
Ann Truelove¹⁷,
Shelley Niwa¹⁷,
Anand P. Chokkalingam¹⁸,
Esther M. John^19,20,
Adam B. Murphy²¹,
Lisa B. Signorello^1,22,
John Carpten²³,
M. Cristina Leske¹¹,
Suh-Yuh Wu¹¹,
Anslem J. M. Hennis^11,24,
Christine Neslund-Dudas¹⁰,
Ann W. Hsing^19,20,
Lisa Chu^19,20,
Phyllis J. Goodman²⁵,
Eric A. Klein²⁶,
John S. Witte^27,28,
Graham Casey⁷,
Sam Kaggwa²⁹,
Michael B. Cook³⁰,
Daniel O. Stram⁷,
William J. Blot^13,22,
Rosalind A. Eeles^31,32,
Douglas Easton³³,
ZSofia Kote-Jarai³¹,
Ali Amin Al Olama³³,
Sara Benlloch³³,
Kenneth Muir^34,35,
Graham G. Giles^36,37,
Melissa C. Southey³⁸,
Liesel M. Fitzgerald³⁶,
Henrik Gronberg³⁹,
Fredrik Wiklund³⁹,
Markus Aly^39,40,
Brian E. Henderson⁴¹,
Johanna Schleutker^42,43,
Tiina Wahlfors⁴³,
Teuvo L. J. Tammela⁴⁴,
Børge G. Nordestgaard^45,46,
Tim J. Key^47,48,
Ruth C. Travis^47,48,
David E. Neal^49,50,
Jenny L. Donovan⁵¹,
Freddie C. Hamdy^52,53,
Paul Pharoah ORCID: orcid.org/0000-0001-8494-732X⁵⁴,
Nora Pashayan^54,55,
Kay-Tee Khaw⁵⁶,
Janet L. Stanford^57,58,
Stephen N. Thibodeau⁵⁹,
Shannon K. McDonnell⁵⁹,
Daniel J. Schaid⁵⁹,
Christiane Maier⁶⁰,
Walther Vogel⁶⁰,
Manuel Luedeke⁶¹,
Kathleen Herkommer⁶²,
Adam S. Kibel⁶³,
Cezary Cybulski⁶⁴,
Dominika Wokolorczyk⁶⁴,
Wojciech Kluzniak⁶⁴,
Lisa Cannon-Albright^65,66,
Craig Teerlink^65,66,
Hermann Brenner^67,68,
Aida K. Dieffenbach^67,68,
Volker Arndt⁶⁷,
Jong Y. Park⁶⁹,
Thomas A. Sellers⁶⁹,
Hui-Yi Lin⁷⁰,
Chavdar Slavov⁷¹,
Radka Kaneva⁷²,
Vanio Mitev⁷²,
Jyotsna Batra⁷³,
Amanda Spurdle⁷⁴,
Judith A. Clements⁷³,
Manuel R. Teixeira^75,76,
Hardev Pandha⁷⁷,
Agnieszka Michael⁷⁷,
Paula Paulo⁷⁵,
Sofia Maia⁷⁵,
Andrzej Kierzek⁷⁷,
The PRACTICAL consortium,
David V. Conti⁷⁸,
Demetrius Albanes⁷⁹,
Christine Berg⁸⁰,
Sonja I. Berndt³⁰,
Daniele Campa⁸¹,
E. David Crawford⁸²,
W. Ryan Diver⁸³,
Susan M. Gapstur⁸³,
J. Michael Gaziano^1,84,85,
Edward Giovannucci^1,86,
Robert Hoover³⁰,
David J. Hunter¹,
Mattias Johansson^87,88,
Peter Kraft^1,89,
Loic Le Marchand⁹⁰,
Sara Lindström^1,89,
Carmen Navarro^91,92,
Kim Overvad⁸⁰,
Elio Riboli⁹³,
Afshan Siddiq⁹⁴,
Victoria L. Stevens⁸³,
Dimitrios Trichopoulos^1,95,96,
Paolo Vineis^97,98,
Meredith Yeager³⁰,
Gosia Trynka ORCID: orcid.org/0000-0002-6955-9529^99,100,
Soumya Raychaudhuri^2,99,101,
Frederick R. Schumacher⁷⁸,
Alkes L. Price^1,2,
Matthew L. Freedman^2,4,5,
Christopher A. Haiman⁷⁸ &
…
Bogdan Pasaniuc^3,102,103

Nature Communications volume 7, Article number: 10979 (2016) Cite this article

7541 Accesses
37 Citations
9 Altmetric
Metrics details

Subjects

Abstract

Although genome-wide association studies have identified over 100 risk loci that explain ∼33% of familial risk for prostate cancer (PrCa), their functional effects on risk remain largely unknown. Here we use genotype data from 59,089 men of European and African American ancestries combined with cell-type-specific epigenetic data to build a genomic atlas of single-nucleotide polymorphism (SNP) heritability in PrCa. We find significant differences in heritability between variants in prostate-relevant epigenetic marks defined in normal versus tumour tissue as well as between tissue and cell lines. The majority of SNP heritability lies in regions marked by H3k27 acetylation in prostate adenoc7arcinoma cell line (LNCaP) or by DNaseI hypersensitive sites in cancer cell lines. We find a high degree of similarity between European and African American ancestries suggesting a similar genetic architecture from common variation underlying PrCa risk. Our findings showcase the power of integrating functional annotation with genetic data to understand the genetic basis of PrCa.

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Genome-wide association studies

Article 26 August 2021

Utility of polygenic scores across diverse diseases in a hospital cohort for predictive modeling

Article Open access 12 April 2024

Introduction

Family history is a well-established risk factor for prostate cancer (PrCa), which has an estimated heritability of 58%—one of the highest across common cancers¹. Genome-wide association studies (GWAS) have been particularly successful in identifying over 100 risk loci that capture ∼33% of the estimated familial risk². Although most of the GWAS PrCa variants overlap prostate-specific regulatory elements (for example, androgen receptor-binding sites (ARBS))^{2,3,4,5,6,7,8}, a quantification of the contribution of genetic variation from various chromatin marks to PrCa risk is currently lacking.

Recent work form the ENCODE/ROADMAP consortia⁹ has shown that a large fraction of the genome plays a role in at least one biochemical event, in at least one tissue. Although this functional atlas of the human genome has greatly enhanced our understanding of regulatory elements, such functional elements are often tissue specific^10,11 making their interpretability in the context of PrCa risk challenging. Existing studies that have integrated PrCa GWAS findings with tissue-specific functional annotations have relied only on the GWAS significant variants (∼100 in the most recent study) or single-nucleotide polymorphisms (SNPs) tagging them^2,7, thus ignoring loci that do not reach genome-wide significance. Recent methodological advances have shown that the entire polygenic architecture of common traits can be interrogated using variance components across all assayed SNPs (typed and/or imputed) to increase power for detecting trait-specific functional annotations¹². In addition to offering superior performance relative to methods that evaluate only GWAS SNPs, the variance components methods also allow for comparison of estimates across different studies and sample sizes. This is because variance components yield an unbiased estimate (under standard assumptions) of SNP heritability —the variance in trait explained by SNPs that reside within elements of a given functional category^12,13,14,15.

Here, we use targeted and genome-wide SNP array data from 59,089 male PrCa cases and controls of European (BPC3 (ref. 16) and iCOGS (ref. 4), respectively, see Methods) and African American (AAPC (ref. 17), see Methods) ancestry to dissect the genetic risk of PrCa. We estimate the SNP heritability of previously implicated regulatory annotations^7,18 and perform a broad analysis of 544 epigenetic marks from ENCODE/ROADMAP (ref. 9). Our approach interrogates the entire common polygenic architecture of PrCa while accounting for potential correlations between related functional categories. First, we find that SNPs near ARBS assayed in prostate tumour explain significantly more of the heritability of PrCa than ARBS SNPs assayed in prostate normal tissue. Second, we localize most of the heritability of PrCa to regions in the genome marked by three functional categories: (i) H3K27ac histone modifications in prostate adenocarcinoma cell lines (LNCaP; typically marking active enhancers¹⁹); (ii) androgen receptors in prostate tissue¹⁸; and (iii) DNase I hypersensitivity sites (DHS) in cancer cell lines. We replicate the LNCaP H3K27ac and DHS results across different ancestries and show that risk prediction from genome-wide SNP data is significantly improved with a predictor that incorporates the functional atlas as prior. Overall, our results suggest a similar genetic architecture from common variation of PrCa risk across men of European and African ancestry and highlight H3k27ac histone mark in LNCaP and ARBS in prostate tissue for follow-up studies of PrCa risk.

Results

Partitioning the genetic risk for prostate cancer

We analysed multiple functional annotations and quantified the fraction of variance in trait explained by SNPs that are localized within each functional class. Our approach models the phenotype (PrCa) of a set of individuals as being drawn from a multivariate normal distribution with variance components estimated based on genetic data (that is, SNPs) plus an environmental term (see Methods)^13,14. For each functional category i, a genetic relationship matrix across all individuals is computed from all the SNPs residing in the given functional category to serve as a variance component. Multiple components are then jointly fitted using the restricted maximum likelihood (REML) as implemented in the GCTA software¹⁴ to estimate variance parameters for each component. The SNP heritability for component i is then estimated as , where the sum in the denominator is across all fitted components including the environmental term. Therefore, we view as an estimate of the variance in trait that can be explained by all the SNPs in the corresponding functional category with a linear model of the trait (that is, SNP heritability)¹². We expect functional categories that are enriched with casual variants for PrCa to attain a higher estimated SNP heritability as compared with functional categories depleted of causal variants for PrCa. To focus our results on noncoding variation and account for potential confounders because of linkage disequilibrium (LD), we explicitly included coding and coding-proximal regulatory variation as ‘background’ components whenever we quantified the effect of each functional annotation tested (see Methods).

The variance component model has previously been shown to yield robust estimates under the assumption that causal variants are typed and uniformly sampled from a given component^13,20,21. Here, we perform additional simulations using the UK10K whole-genome sequence data to confirm the validity of this model for our data, and to assess how representative SNP estimates are of true underlying biology at common sequenced variants. The simulation framework uses real genotype data from the UK10K consortium to generate additive, polygenic phenotypes with a given heritability and then performs heritability estimation with the variance component model (see Methods). Although the UK10K data contains a much smaller set of individuals as the iCOGS data (3,047 versus 42,613 individuals, see Methods), it contains variation from whole-genome sequencing; this allows us to evaluate model performance by simulation when restricting to SNPs genotyped on the iCOGS platform. We focused on the LNCaP: H3k27ac annotation (which was most significant in our data, see below) to evaluate the multiple component models. Over thousands of simulations, we confirmed that the variance components approach correctly recovered the causal contribution to trait from a given functional category when causal variants were typed (Supplementary Table 1, see Methods). Under both null and enriched scenarios the estimates were unbiased and standard errors properly calibrated (Supplementary Table 1). For common sequenced variants not present on the iCOGS platform, relative estimates of noncoding enrichment/depletion were conservative, with the tagged effects distributed across the typed components (Supplementary Table 2). Deviations from the standard variance components model assumptions on the distribution of effect-sizes and ancestry-specific effects in African Americans yielded either well calibrated or conservative estimates of SNP heritability in the focal LNCaP: H3k27ac category (see Methods, Supplementary Tables 1–3).

Our primary functional analyses focus on the densely genotyped iCOGS sample (21,678 cases and 20,935 controls), whose large sample size allowed for highly accurate estimates of component-specific . Although the iCOGS chip is custom built to oversample risk loci, it provides a broad coverage of the common variation genome wide⁴. To showcase the power of the variance components approach, we estimated the total SNP heritability of PrCa at 0.28 (s.e. 0.01) in the iCOGS data (not significantly different from the total SNP heritability estimate of 0.26 (s.e. 0.05) in the BPC3 data), a significant increase from the variance explained only by the known GWAS variants of 0.06 (s.e.m. 0.001) (see Methods; Supplementary Table 4). Interestingly, the total SNP heritability in the African American sample, which was genotyped on a different platform than iCOGS (see Methods), was estimated at 0.32 (s.e. 0.06) indicating a similar aggregate contribution of common variation to PrCa risk across the two ethnicities despite higher overall risk in African Americans²² (Supplementary Table 4).

Enrichment at androgen receptor-binding sites in tumours

We first focused on SNPs localized in the ARBS: an epigenetic profile causally implicated in prostate tumorigensis. In contrast to typical assays that focus on cell lines, the ARBS were defined by chromatin immunoprecipitation and high-throughput sequencing (ChIP-seq) directly in primary human tissue (seven normal and 13 tumour specimens)¹⁸. We observed that variants within 5 kb of tumour-specific ARBS explained 17.0% of the genome-wide (s.e. 1.7%; P=2.6 × 10⁻¹⁶ by Z-test), whereas the variants near normal-specific ARBS explained 0.0% of the (s.e. 0.9%; P=0.11 by Z-test) (Fig. 1). The difference between these two groups was highly significant and demonstrates the importance of assaying functional marks in both normal and tumour tissues. We note that the 5 kb extension may also include other regulatory variants near the tumour/normal-specific ARBS (but not heritability from coding/untranslated region (UTR)/promoter variants, which were explicitly modelled, see Methods). Smaller flanking regions were also investigated but did not include enough markers for the variance components model to converge. We also quantified the proportion of SNP heritability explained directly by all ARBS variants (both normal and tumour without 5 kb flanks) at 10.7% of ; significantly different from the SNP heritability of ARBS variants assayed in prostate adenocarcinoma cancer cell line (LNCaP; 3.2% of ) (P=4.4 × 10⁻⁷ for difference by Z-test) (Fig. 1). This difference is partially explained by the very low number of SNPs within cell line ARBS making their aggregate contribution small but not empowering us to place a strong bound on the enrichment. Overall, these findings highlight the increased complexity of ARBS in a sample of tissues as compared with the single LNCaP cell line.

**Figure 1: Functional partitioning for variants within ARBS for PrCa.**

Identification of functional marks relevant to PrCa risk

Next, we looked for marks that contribute to the heritability of PrCa across a broad spectrum of functional annotations without prior assumptions on relevance to disease. We investigated 544 epigenetic annotations spanning six major classes (DHS; H3k4me1; H3k4me3; H3k9ac; H3k27ac; and computationally predicted functional classes or ‘segmentations’^23,24) averaging 101 cell types per class (see Methods). After accounting for multiple testing, we identified 82 annotations that exhibited statistically significant deviations in SNP heritability from what was expected based on the proportion of the genome covered by that particular annotation (see Fig. 2 and Supplementary Data).

**Figure 2: Functional partitioning of heritability across six main epigenetic classes.**

We first focused on 17 functional marks measured in the prostate, of which 14 were statistically significant (Supplementary Table 5). The single most significant enrichment was observed for H3k27ac marks in LNCaP (P=1 × 10⁻³² by Z-test), which localized 22% of the total to the 2.9% of genotyped SNPs within the annotation. This was followed by variants in DHS marks in LNCaP (P=2 × 10⁻¹⁸ by Z-test; 16.7% of localized in 3.1% of genome). The DHS annotations allowed us to compare estimates across three major prostate cell lines: LNCaP; normal prostate epithelial (PrEC); and immortalized prostate epithelial (RWPE1) (overlapping by 25–50% with ARBS, Supplementary Fig. 1). We observed heritability explained by LNCaP DHS to be nominally significantly higher than PrEC (P=0.01 by Z-test); and both LNCaP and PrEC to be significantly higher than RWPE1 (P=1.5 × 10⁻⁹, P=1.2 × 10⁻⁵, respectively, by Z-test) (Fig. 3). More broadly, 10 out of 16 DHS marks measured in cancer cell lines were observed as significant, with colorectal cancer as the next most significant cancer (P=6.0 × 10⁻¹⁰ by Z-test; 9.4% of heritability localized in 2.0% of genome; Supplementary Data). H3k27ac in LNCaP remained the most significantly enriched mark across all 544 annotations (presented in detail in the Supplementary Data). The most depleted categories were repressed regions computationally predicted by Segway-chromHMM in HepG2 cells (P=1.3 × 10⁻¹⁹ by Z-test; 51.9% of from 74.3% of SNPs; Supplementary Data), with similar levels of depletion in repressed regions from other cell types. These regions are typically associated with decreased gene expression and repressive histone marks^23,24,25, further emphasizing the importance of active regulation.

**Figure 3: Pairwise analysis of DHS marks in three prostate cell types.**

As H3k27ac typically marks active enhancers, we further evaluated variants with respect to their enhancer or ‘super’-enhancer status (large clusters of enhancers that are enriched for genes involved in cell identity²⁶) (see Methods). We did not observe differences in average heritability explained by SNPs within the two marks across 49 cell lines (see Methods), with an average of 1.51 (1.47)-fold increase over random SNPs for enhancers (super enhancers) (Fig. 4). Surprisingly, we observed an individually significant difference only in LNCaP, with 4.9 (1.7)-fold enrichment at enhancers (super enhancers), in contrast to previous hypotheses²⁶ (Fig. 4).

**Figure 4: Comparison of enhancers and super enhancers across 49 cell types.**

Genomic functional atlas of prostate cancer SNP heritability

Although the results above showcase the power of the variance component approach in finding epigenetic marks relevant for PrCa, such marks often overlap making the causal mark difficult to identify (Supplementary Fig. 1). To account for the correlation among marks we grouped the 82 marginally significant annotations into 15 biologically relevant, non-overlapping groups organized by mark and cell line, and partitioned across all groups in a joint model (see Methods, Table 1, Fig. 5 and Supplementary Table 6). Five components were nominally significant in the joint model at P<0.05; out of the five components three remained significant after accounting for 15 tests: H3k27ac marks in LNCaP (P=2.5 × 10⁻²⁰ by Z-test); DHS marks in other cancer cell types (P=3.9 × 10⁻⁵ by Z-test); and repressed segmentations (P=2.1 × 10⁻²⁰ by Z-test). To further refine our model, we restricted to the significant annotations (and the background components accounting for LD to coding regions) and re-evaluated them jointly, referred to as the ‘selected’ model. This selected model localized 51.0% of the within 12.1% of SNPs (LNCaP: H3K27ac+ARBS+DHS cancer), whereas coding regions only explained 3.3% (s.e. 1.4%) of within 1.8% of SNPs (Supplementary Table 7). The localization was even stronger with imputed data, where 86% of the was localized to 8.6% of SNPs (Table 1 and Supplementary Tables 8 and 9). Estimates from imputed markers were more representative of underlying enrichment in our simulations (see Methods, Supplementary Table 2) but may include the effects of nearby markers¹² and so we consider them as an upper bound. None of the estimates changed significantly after adjusting for known GWAS associations² (79 of which were typed in this data), underscoring the polygenic nature of this effect.

Table 1 Partitioning of heritability across functional classes in prostate cancer.

Full size table

**Figure 5: Partitioning of heritability across functional classes in prostate cancer.**

Having inferred the selected model, we re-analysed each of the 82 marginally significant categories jointly with the selected model (see Methods). Only three marks remained significant: two H3k27ac annotations in the colon crypt and one H3k27ac annotation in pancreas (Supplementary Data). This implies that the marginal enrichment of the 82 annotations was primarily driven by the overlap with functional marks in the selected model. For example, the H3K4me1 mark in penis foreskin keratinocytes that was previously highly significant (24.6% , P=3.0 × 10⁻¹⁶ by Z-test, Fig. 1) was no longer enriched after conditioning on the selected model (7.1% , P=0.29 by Z-test, Supplementary Data). The reduction to a small number of categories in the selected model with limited loss in signal further emphasizes the extent to which the selected model has localized the functional sources of enrichment. Focusing on the two most enriched categories in the selected model, we found that SNPs present in both the prostate tissue ARBS and LNCaP H3k27ac marks yielded significantly higher average heritability per SNP than either mark individually (Supplementary Table 10). In contrast, the variants specific to ARBS or H3k27ac were comparable in SNP heritability.

Replication of genomic functional atlas across ancestries

We evaluated replication of our model using two separate genome-wide SNP data sets of PrCa, one of European ancestry (BPC3; 6,953 samples) and one of African ancestry (AAPC; 9,522 samples) for PrCa (see Methods). To account for the smaller sample size, we focused on the eight-component selected model, only retaining significant components and three coding-proximal classes (coding, UTR, promoter)¹². Because of platform differences between the populations, we used post-QC imputed variants in each data set, which are most reflective of underlying enrichment in our simulations (see Methods). We replicated the significant deviation in at H3k27ac and the repressed loci across both BPC3 and AAPC (Supplementary Tables 11 and 12). However, cancer DHS was only significant in the BPC3 data and ARBS not significant in either (though the estimates were not significantly different from the iCOGS estimate). The enrichment did not change after restricting to very high-quality imputed markers (Supplementary Table 13). Although the relatively small validation sample size did not provide enough power to test differences between the ancestries, the mean SNP heritability for variants within each mark were remarkably similar (ρ=0.90 between AAPC and BPC3 across eight components), suggesting a similar pattern of aggregate contribution to risk coming from common variants marked by epigenetic classes across European and African American ancestries (though individual risk variants themselves may differ).

H3k27ac mark in LNCaP is specific to PrCa

As a negative control, we evaluated the selected model with imputed SNPs across 11 common non-cancer diseases from the Wellcome Trust Case Control Consortium (WTCCC) (see Methods, Supplementary Table 14) where we observed two main differences: the LNCaP H3k27ac annotation was no longer significantly enriched (1.1% with 2.6% of SNPs); and the repressed regions were much less depleted from the null (28.1% with 87.8% of SNPs) compared with the 0.3% of observed in iCOGS imputed data (P=2.2 × 10⁻⁴ for difference by Z-test). Interestingly, although ARBS were significantly enriched in all 11 traits, the enrichment was no longer significant after excluding autoimmune traits. Overall, these differences indicate that the LNCaP H3k27ac mark is uniquely informative for PrCa, whereas variants near the ARBS and DHS cancer elements (which overlap other DHS annotations by 56%; Supplementary Fig. 2) may play a generally important role across other common diseases¹².

Genomic functional atlas improves polygenic risk prediction

To validate our SNP heritability genomic atlas, we compared the accuracy of predicting case/control status from genetic data with or without the functional atlas. We evaluated three distinct prediction models in the iCOGS sample: (i) a genetic risk score (GRS) from the genome-wide significant SNPs; (ii) the single best linear unbiased predictor (BLUP) using a single variance component from all SNPs; and (iii) the weighted sum of individual BLUPs from each epigenetic category in the selected model (multi-BLUP; see Methods). Evaluated by cross-validation, the GRS yielded an R²=0.029 with true phenotype, whereas the single BLUP yielded an R²=0.065 and the multi-BLUP had an R²=0.071 (Supplementary Table 15). In a joint model with all three predictors, the multi-BLUP was highly significant (P=5.3 × 10⁻³¹ from multiple regression). When we constructed the GRS from SNPs recently discovered in a much larger PrCa GWAS (ref. 2), the resulting prediction R² increased to 0.084. However, including the single BLUP or the multi-BLUP as an additional predictor still increased the prediction R² to 0.096 (joint P=6.7 × 10⁻⁴ from multiple regression) and 0.098 (joint P=1.3 × 10⁻²³ from multiple regression), respectively (Supplementary Table 15). The consistent statistical significance and increased prediction accuracy confirms the validity of the selected model in this data and in larger GWAS.

Discussion

Using large-scale genotype data from over 59,089 men of European and African American ancestries jointly with epigenetic annotations, we identified highly significant differences in SNP heritability of PrCa across variants from different epigenetic classes, tissue types and cell lines. Focusing on marks measured in prostate, we observed significantly higher around tumour-specific ARBS; ARBS measured in primary tissue relative to cell line; and DHS measured in PrCa cell line relative to prostate epithelial cell line. The enrichment at tumour-specific ARBS was consistent with recent findings showing that these sites were enriched for nearby genes highly expressed in tumours¹⁸. These analyses are comprehensive and cover most commonly studied prostate cell lines except for vertebral cancer of the prostate, which were not well represented in the ENCODE/ROADMAP. A search across 544 diverse functional annotations restricted most of the to a small fraction of the genome marked by prostate regulatory elements. Consistent with previous findings in common disease, functionally repressed regions were significantly depleted in heritability, highlighting the role of active regulation in PrCa susceptibility. Subsequent model selection localized the enrichment from 82 individually significant annotations to six that remained significant in a joint model. In particular, the abundance of enrichment in H3k27ac marks (active enhancers) relative to H3k4me1/H3k4me3 (poised enhancers/promoters) underscores their role in PrCa, though further enrichment in super enhancers was not observed. The enrichment within LNCaP: H3K27ac and depletion at repressed regions was replicated across different ancestries and yielded significant improvements in polygenic risk prediction.

With most GWAS associations falling outside coding regions, our analyses offer an important resource for prioritizing potential loci and focusing future studies on the most heritable genomic regions²⁷. The marginal analyses provide a ranking of 544 common functional assays, while the selected model localizes heritability to only those functional classes that are independently enriched. Emerging functional categories may further refine this signal or reveal other relevant epigenetic marks, though little enrichment beyond the selected model was observed in the comprehensive sampling of functional data analysed here. In general, the variance component model offers an opportunity to evaluate biological hypotheses in silico and without strictly relying on individually significant SNPs. However, as with any analysis of array-based data, the estimates will not include the contribution of SNPs that are untyped or poorly tagged, such as rare variants or other contributors to the missing heritability. Future analyses of whole-genome sequencing, additional functional annotations, and larger sample sizes can yield important insights into functional mechanisms that are still not localized. Overall, our results suggest similar patterns of functional enrichment across men of European and African American ancestry and highlight ARBS, H3k27ac marks in LNCaP cell lines and DHS in cancer cell lines for follow-up studies of PrCa risk.

Methods

Epigenetic annotations

Sample collection and processing for functional annotations was made publically available by the ENCODE/ROADMAP consortia²⁸. DHS, H3k4me1, H3k4me3, H3k9ac annotations and genome segmentations^20,29, enhancers and super enhancers²⁶ and PrCa-specific annotations^7,18 were assay and processed as detailed in the original studies. Tumour-only and normal-only ARBS were defined in seven normal and 13 tumour specimens in the original study¹⁸. All annotations curated for this paper (ENCODE/ROADMAP; Pomerantz et al.; and Hazelett et al.) are available at https://data.broadinstitute.org/alkesgroup/ANNOTATIONS/PRCA/. The full list of individual annotations with web-links to the corresponding boundary definitions is provided in the Supplementary Data. Some functional marks are listed multiple times due to multiple independent assays or laboratory protocols.

ARBS ChIP-seq in human tissue specimens

The ARBS assay was performed as described in REF (ref. 18) and summarized here. Fourteen subjects of European American ancestry were selected for ChIP analysis. Their chromatin was incubated overnight with 6 μg antibody AR (N-20, Santa Cruz Biotechnology, Dallas, TX) bound to protein A and protein G beads (Life Technologies, Carlsbad, CA). A fraction of the sample was not exposed to antibody to be used as control (input). The samples were de-crosslinked, treated with RNase and proteinase K, and DNA was extracted. The samples were then re-sheared to 100–300 base pairs using the Covaris ultra-sonicator, and concentrations of the ChIP DNA were quantified by Qubit Fluorometer (Life Technologies). DNA sequencing libraries were prepared using the ThruPLEX-FD Prep Kit (Rubicon Genomics, Ann Arbor, MI). Libraries were sequenced using 50-base pair reads on the Illumina platform (Illumina, San Diego, CA) at Dana-Farber Cancer Institute. AR binding sites were generated using Model-Based Analysis of ChIP-seq 2 (MACS2), with a qvalue (false discovery rate, FDR) threshold of 0.01.

The 13 tumours used in this study were androgen dependent and not exposed to androgen deprivation therapies. All of the tumours were specimens obtained from radical prostatectomies, derived from men with early stage disease. These samples were not selected based on any specific features; therefore, we would expect that the distribution of risk variants would be similar to a random sampling of PrCa cases. Large-scale genetic surveys have shown that somatically acquired alterations in primary localized prostate tumours (the type of tumour evaluated in this study) are infrequent. Based on these previous results, we believe that somatically acquired genetic events in regions related to androgen biology are not common and, therefore, do not influence our results.

Patient material

Informed consent was obtained from all subjects and all studies were approved by local Research Ethics Committees and/or Institutional Review Boards.

Data quality control

Quality control is crucial for accurate heritability estimation, where many small artifacts can add up to large biases. All data sets went through a stringent QC process with the following exclusion criteria: minor allele frequency (MAF)<1%; fraction of missing/uncalled SNPs>5%; Hardy–Weinberg equilibrium P value<0.01; case–control missingness P value<0.05; imputation INFO score>0.30. In addition, close relatives were pruned such that no pair of individuals had genetic relatedness (GRM) coefficients>0.05. The top 10 principal components and a coded study label were always included as fixed-effects. All analysed samples, cases and controls, were males.

iCOGS data

The iCOGS consortium genotyped balanced cases and controls on a custom targeted array⁴. After quality control, 42,613 samples and 153,621 genotyped SNPs remained. Imputation was performed to the 1000 Genomes reference panel using HAPI-UR (ref. 30) for phasing and IMPUTE2 (ref. 31) for imputation. Overall, 1,910,827 imputed and genotyped SNPs passed QC. Because of computational restrictions, the heritability estimation was carried out in two equally sized halves of the ICOGS, with total effects computed by inverse-variance meta-analysis. We partitioned the genotyped SNP heritability by MAF but observed no trend and only slight enrichment of % at high-frequency variants (Supplementary Table 16).

BPC3 data

The National Cancer Institute Breast & Prostate Cancer Cohort Consortium (BPC3) consortium genotyped individuals on the Illumina HumanHap610 quad array³². After quality control, 6,953 samples and 4,004,229 genotyped and imputed SNPs remained. Age was available for all samples and additionally included as a covariate.

AAPC data

The AAPC consortium genotyped individuals of African ancestry on the Illumina Human1M array^2,33,34. After quality control, 9,522 samples and 10,468,389 genotyped and imputed SNPs remained.

WTCCC data

The Wellcome Trust Case Control Consortium Genotyping genotyped cases for 11 traits as well as shared controls on multiple Illumina and Affymetrix arrays^35,36,37. The phenotypes analysed here were ankylosing spondylitis (AS); bipolar disorder (BD); coronary artery disease (CAD); Crohn's disease (CD); hypertension (HT); multiple sclerosis (MS); rheumatoid arthritis (RA); schizophrenia (SP); type 1 diabetes (T1D); type 2 diabetes (T2D); and ulcerative colitis (UC). After quality control, a total of 47,053 samples and 4–5 million genotyped and imputed SNPs remained. Reported values were estimated for each phenotype separately and meta-analysed using inverse-variance weighting.

UK10K data

The UK10K whole-genome sequence data from ALSPAC and TWINSUK (http://www.uk10k.org) was used only for simulation, and so stringent quality control was not applied. After relatedness filtering, 3,047 samples and 15,691,225 non-singleton variants were retained.

Heritability estimation of individual annotations

We estimated the SNP heritability captured by functional categories in a joint variance component model using GCTA as described in REF (ref. 20). Briefly, this model assumes the phenotype is drawn from a multivariate normal distribution with variance-covariance modelled by components computed from the SNPs and a normal residual. For each functional category (for example, DHS) i=1..M where M is the total number of categories in the model, a GRM across all pairs of individuals is computed restricting to SNPs within the functional category. Variance components for all GRMs in the model are then fitted using REML as implemented in GCTA to estimate a variance parameter used to compute . The corresponds to the fraction of trait variance that can be explained by the BLUP restricted to SNPs in the corresponding functional category (or annotation). For a given functional annotation, SNPs were categorized into a hierarchy of seven non-overlapping components: (1) coding; (2) UTR; (3) promoter (functional annotation of interest); (4) DHS; (5) intron; and (6) intergenic. SNPs belonging to multiple categories were partitioned explicitly into the first category in this list. The coding and coding-proximal components were included to ensure that the annotation heritability was not inflated by SNPs that were in high LD with coding variation. A genetic relatedness matrix was computed for each component by first standardizing the corresponding SNPs and then computing a SNP covariance over all pairs of samples. Component-specific σ² and errors were fitted iteratively using the Average Information algorithm³⁸. The analytical standard error for was estimated by transforming the GCTA-inferred and error covariance matrix using the delta method. As in REF (ref. 20) statistical significance was evaluated by comparing the explained by the category and it’s standard error to the %SNPs in the category using a Z-test (comparing nested models using a likelihood ratio test yielded similar results). Total estimates were computed as after transforming to the liability scale assuming a prevalence of 0.14 and using the study-specific case/control ratio.

Hierarchical joint models

For specific models of interest, we extended the individual annotation model described above to test intersecting and non-intersecting components. This allowed us to evaluate precisely which sub-annotations of overlapping components were likely to be causal. For the tumour/normal model, we expanded each tumour/normal mark by 5 kb in both directions from the center to capture nearby genes and other regulatory regions so that tumour (normal) covered 3.3% (1.4%) of the SNPs, respectively. We estimated from the joint hierarchical model: (1) coding; (2) UTR; (3) promoter; (4) normal-only; (5) tumour-only; (6) DHS; (7) intron; and (8) Other. When comparing ARBS from tissue and ARBS LNCaP from cell line, only 59 SNPs (0.03%) overlapped between the two categories, and so we tested two separate models: (1) coding; (2) UTR; (3) promoter; (4) (ARBS tissue/ARBS LNCaP); (5) DHS; (6) intron; and (7) other. For comparisons between LNCAP, PREC and RWPE1 using DHS we tested each pair of cell lines using the joint model: (1) coding; (2) UTR; (3) promoter; (4) DHS particular to one cell line; (5) DHS common to both cell lines; (6) DHS particular to other cell line; (7) DHS other cell lines; (8) Intron; and (9) Other. For comparisons between enhancers and super enhancers, we used the 86 cell-type-specific annotations from REF (ref. 26), testing each enhancer or super enhancer separately in the following joint model: (1) coding; (2) UTR; (3) promoter, (4) (enhancer/super enhancer for cell-type of interest); (5) DHS; (6) intron; (7) other. Of these, 49 cell types yielded model convergence for both the enhancer and corresponding super enhancer and were used to estimate means and correlation. The order and grouping of marginally significant annotations into epigenetic mark and cell type (for example, in Table 1) are listed in the Supplementary Data. For each of the 82 individually significant annotations, we re-evaluated them jointly with the selected model in the following hierarchical joint model: (1) coding; (2) UTR; (3) promoter; (4) LNCaP:H3k27ac; (5) ARBS; (6) DHS cancer; (7) (functional annotation of interest); (8) DHS; (9) intron; and (10) other. Only functional annotations that converged were reported in the Supplementary Data.

Accuracy of estimates from typed variants in simulations

The variance component model has previously been shown to yield robust estimates under the assumption that causal variants are typed and uniformly sampled from a given functional category^13,20,21. Here, we perform simulations using the UK10K whole-genome sequence data to confirm the validity of this model for our annotations, and to assess how representative SNP estimates are of true underlying biology at common sequenced variants. Overall, the simulations involve using real markers to generate additive, polygenic phenotypes with a given heritability and then estimating the heritability with the variance component model. We evaluated the UK10K data for three types of SNPs: (i) common sequenced variants (7,534,538 SNPs); (ii) UK10K SNPs typed by the iCOGS platform (178,509; 95% of iCOGS SNPs); and (iii) UK10K SNPs typed and imputed by the iCOGS platform (1,655,723; 87% of the iCOGS imputed SNPs). We focused on the LNCaP:H3k27ac annotation (which was most significant in our data) to evaluate the main joint model. All phenotypes were simulated by drawing 5,000 causal variants randomly from the specified categories and sampling causal effect-sizes from a normal distribution such that SNPs either explain equal variance (the model assumption) or variance in proportion to their MAF. The phenotype was then generated as the dot product of genotype and effect-size with random noise added to fix heritability at 50%. Phenotypes were simulated thousands of times until the standard error over simulations was low enough to evaluate unbiasedness.

We confirmed that estimates of from a polygenic trait were accurate under the model where causal variants are typed (Supplementary Table S1). Under the null, the LNCaP H3k27ac component is expected to explain 3.22% of the SNP heritability, and the model estimated 3.50% (0.22%) and 3.68% (0.21%) under a low-frequency and high-frequency disease architecture, respectively (Supplementary Table S1). None of the estimates were significantly different from the truth given the number of components tested. Under a scenario where LNCaP H3k27ac explains 50% of the , the model estimated 51.13% (0.40%) and 46.98% (0.35%) under a low-frequency and high-frequency disease architecture, respectively (Supplementary Table S1). Although the high-frequency architecture (where common variants explain more variance in trait than rare variants) represents a substantial model misspecification, our simulations show that this does not introduce substantial bias and is likely to slightly underestimate the SNP heritability at the focal chromatin mark. In all cases, the empirical standard deviation over 500 simulations was similar to the average analytical s.e.m. computed by GCTA (REML algorithm), thus showing that that analytical standard error is well calibrated (Supplementary Table S1). We note that the standard error is inversely related to the sample size^39,40, and is therefore much higher in these simulations than in the iCOGS data which is 14-fold larger.

Lastly, we performed the real data partitioned analysis in subsets of individuals to evaluate biasedness and power to detect significant enrichment. We confirmed that no significant differences were observed between estimates from the entire study compared with those averaged across subsets of the study (Supplementary Fig. 3). As such, we can confidently report estimates and bounds on the enrichment observed in the entire study that will hold for larger studies. Furthermore, all but one of the significant components from the main model remained significant in smaller samples (ARBS), making it unlikely that they were affected by winner’s curse. Recent work has quantified the theoretical relationship between estimation error and effective sample size for individual components^39,40.

Causal variants not tagged on the iCOGS genotyping platform

We used the sequenced UK10K common variants to evaluate how well the iCOGS genotyped and imputed SNPs captured underlying heritability by simulating phenotypes using causal variants from sequencing and estimating heritability from the iCOGS SNPs (that is, hiding variants that were not genotyped or imputed, Supplementary Table S2). 83% of common UK10K SNPs lie within 100 kb of an iCOGS SNP, so some common variation is likely to be partially tagged by the chip. If the imputed and/or genotyped SNPs served as a good proxy for the common sequence variation, then we would expect their estimates of to match the simulated fractions. When no functional category was enriched with causal variants, small but significant differences were observed for genotyped coding variants (4.75% estimated as compared with simulated 0.67%) and imputed intergenic variants (56.09% as compared with 50.52% simulated) but not the focal LNCaP:H3k27ac category. Similar deviations were observed for the disease architecture where common variants explain more variance in trait than rare variants (Supplementary Table S2). When causal variants where enriched within LNCaP:H3k27ac category, deviations between simulated and estimated SNP heritability were larger (Supplementary Table S2). Most of this deviation was due to a significant underestimate at LNCaP:H3k27ac, which was simulated to explain 50% of but explained only 12.55% (s.e.m.0.92%) and 30.92% (s.e.m. 1.09%) from genotyped and imputed SNPs, respectively. This heritability was distributed across all the remaining components, particularly in intergenic SNPs for the genotyped estimate and DHS SNPs for the imputed estimate, which tend to be nearby.

Overall, our simulations showed that the model is highly accurate when all causal variants are typed. When considering enrichment from untyped causal variants, the imputed estimate was consistently closer to the truth than the genotyped estimate. Most importantly, the estimate from the focal category (LNCaP H3k27ac in our simulations) was shown to be highly conservative both in the null and in the enriched scenario and unlikely to be biased due to tagging of untyped markers. We note that previous work has shown estimates from imputed SNPs (but not genotyped SNPs) may be contaminated by markers very close to an enriched annotation¹²; as such we focused our results on the densely genotyped iCOGS variants which are expected to be conservative, and primarily used imputed data for validation across data sets.

Estimates of from African American samples

To assess potential biases in estimating from an admixed population, we performed separate simulations in the AAPC data where causal variants were specifically sampled from varying F_ST bins. This framework evaluated the potential bias resulting from markers that had drifted to different frequencies in the two populations. The F_ST was estimated out-of-sample in the HapMap CEU European and YRI Yoruba populations. We tested the null six-component model (Coding, UTR, Promoter, DHS, Intron, Other) and observed no significant deviations from the null under any class of differentiated SNPs (Supplementary Table S3). However, we note that total was simulated at 0.50 but was inferred at 0.38–0.66 across increasing quintiles of causal F_ST (Supplementary Table 3), indicating that even with well-calibrated estimates of enrichment the total estimate may be biased upwards if the causal SNPs are highly differentiated (observed in this simulation when mean causal F_ST>=0.35).

Genetic prediction

We sought to validate the utility of our functional atlas by applying it to genetic prediction. The aim of genetic prediction is to use training individuals with genetics (for example, SNPs) and diagnosed phenotype to accurately predict the phenotype into individuals with only genetic data available^41,42. Here, we focus on correlation of predicted phenotype with true phenotype (R²), as it has a natural relationship to SNP heritability^12,42. Intuitively, better localization of the true effect-sizes will reduce noise in training the predictor and increase accuracy. If the functional atlas identified regions with increased heritability, this information should significantly improve the prediction. We evaluated three standard models of risk prediction: GRS; BLUP (ref. 43); and multi-component BLUP (ref. 14). The GRS was computed as a sum over SNPs of the log odds-ratios from the training sample⁴¹. The set of SNPs used was either the genome-wide significant markers in the training set (restricted to one per 1 MB locus) or the genome-wide significant markers identified in a recent large GWAS of PrCa². In contrast to the GRS, the BLUP used all markers in the data to form the prediction. The standard BLUP was estimated using GCTA over all SNPs. The multi-component BLUP was estimated using the components in the selected model (jointly) to compute a single score equal to the sum of the predictions from each component weighted by their component-specific . This is analogous to specifying a different prior on the effect-size variance in each component. All predictions were carried out by cross-validation in the full iCOGS data, removing 1,000 individuals in each fold. Prediction R² was then computed from a regression of phenotype on the predictor score with 10 PCs included as covariates to account for ancestry, subsequently subtracting the R²=0.021 from a model with PCs only. P values were estimated for each of the coefficients in the multiple regression of phenotype ∼ GRS+single-BLUP+multi-BLUP+PCs. To ensure that prediction across data sets was independent, we carefully removed all iCOGS individuals with a GRM value of >0.05 to any individual in the BPC3 when computing BLUP coefficients. We separately analysed the predictor in 26,000 iCOGS samples that had age at diagnosis, but did not observe significant differences before/after including age as a covariate.

Additional information

How to cite this article: Gusev, A. et al. Atlas of prostate cancer heritability in European and African-American men pinpoints tissue-specific regulation. Nat. Commun. 7:10979 doi: 10.1038/ncomms10979 (2016).

References

Hjelmborg, J. B. et al. The heritability of prostate cancer in the Nordic twin study of cancer. Cancer Epidemiol. Biomarkers Prev. 23, 2303–2310 (2014).
Article Google Scholar
Al Olama, A. A. et al. A meta-analysis of 87,040 individuals identifies 23 new susceptibility loci for prostate cancer. Nat. Genet. 46, 1103–1109 (2014).
Article CAS Google Scholar
Castro, E. et al. Germline BRCA mutations are associated with higher risk of nodal involvement, distant metastasis, and poor survival outcomes in prostate cancer. J. Clin. Oncol. 31, 1748–1757 (2013).
Article CAS Google Scholar
Eeles, R. A. et al. Identification of 23 new prostate cancer susceptibility loci using the iCOGS custom genotyping array. Nat. Genet. 45, 385–391 (2013).
Article CAS Google Scholar
Saunders, E. J. et al. Fine-mapping the HOXB region detects common variants tagging a rare coding allele: evidence for synthetic association in prostate cancer. PLoS Genet. 10, e1004129 (2014).
Article Google Scholar
Ewing, C. M. et al. Germline mutations in HOXB13 and prostate-cancer risk. N. Engl. J. Med. 366, 141–149 (2012).
Article CAS Google Scholar
Hazelett, D. J. et al. Comprehensive functional annotation of 77 prostate cancer risk loci. PLoS Genet. 10, e1004102 (2014).
Article Google Scholar
Hazelett, D. J., Coetzee, S. G. & Coetzee, G. A. A rare variant, which destroys a FoxA1 site at 8q24, is associated with prostate cancer risk. Cell Cycle 12, 379–380 (2013).
Article CAS Google Scholar
ENCODE Project Consortium. et al. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Stamatoyannopoulos, J. A. What does our genome encode? Genome Res. 22, 1602–1611 (2012).
Article CAS Google Scholar
Maurano, M. T. et al. Systematic localization of common disease-associated variation in regulatory DNA. Science 337, 1190–1195 (2012).
Article CAS ADS Google Scholar
Gusev, A. et al. Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases. Am. J. Hum. Genet. 95, 535–552 (2014).
Article CAS Google Scholar
Yang, J. et al. Common SNPs explain a large proportion of the heritability for human height. Nat. Genet. 42, 565–569 (2010).
Article CAS Google Scholar
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Article CAS Google Scholar
Yang, J. et al. Genome partitioning of genetic variation for complex traits using common SNPs. Nat. Genet. 43, 519–525 (2011).
Article CAS Google Scholar
Schumacher, F. R. et al. Genome-wide association study identifies new prostate cancer susceptibility loci. Hum. Mol. Genet. 20, 3867–3875 (2011).
Article CAS Google Scholar
Haiman, C. A. et al. Characterizing genetic risk at known prostate cancer susceptibility loci in African Americans. PLoS Genet. 7, e1001387 (2011).
Article CAS Google Scholar
Pomerantz, M. et al. The androgen receptor cistrome is extensively reprogrammed in human prostate tumorigenesis. Nat. Genet. 47, 1346–1351 (2015).
Shlyueva, D., Stampfel, G. & Stark, A. Transcriptional enhancers: from properties to genome-wide predictions. Nat. Rev. Genet. 15, 272–286 (2014).
Article CAS Google Scholar
Gusev, A. et al. Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases. Am. J. Hum. Genet. 95, 535–552.
Lee, S. H. et al. Estimation of SNP heritability from dense genotype data. Am. J. Hum. Genet. 93, 1151–1155 (2013).
Article CAS Google Scholar
Cancer Facts & Figures for African Americans 2009–2010. Accessed on: December 2015 .
Hoffman, M. M. et al. Integrative annotation of chromatin elements from ENCODE data. Nucleic Acids Res. 41, 827–841 (2013).
Article CAS Google Scholar
Haiman, C. A. et al. Multiple regions within 8q24 independently affect risk for prostate cancer. Nat. Genet. 39, 638–644 (2007).
Article CAS Google Scholar
Hoffman, M. M. et al. Unsupervised pattern discovery in human chromatin structure through genomic segmentation. Nat. Methods 9, 473–476 (2012).
Article CAS Google Scholar
Hnisz, D. et al. Super-enhancers in the control of cell identity and disease. Cell 155, 934–947 (2013).
Article CAS Google Scholar
Farh, K. K. et al. Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature 518, 337–343 (2014).
Article ADS Google Scholar
ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Pickrell, J. K. Joint analysis of functional genomic data and genome-wide association studies of 18 human traits. Am. J. Hum. Genet. 94, 559–573 (2014).
Article CAS Google Scholar
Williams, A. L., Patterson, N., Glessner, J., Hakonarson, H. & Reich, D. Phasing of many thousands of genotyped samples. Am. J. Hum. Genet. 91, 238–251 (2012).
Article CAS Google Scholar
Howie, B., Marchini, J. & Stephens, M. Genotype imputation with thousands of genomes. G3 (Bethesda) 1, 457–470 (2011).
Article Google Scholar
Schumacher, F. R. et al. Genome-wide association study identifies new prostate cancer susceptibility loci. Hum. Mol. Genet. 20, 3867–3875 (2011).
Article CAS Google Scholar
Haiman, C. A. et al. Characterizing genetic risk at known prostate cancer susceptibility loci in African Americans. PLoS Genet. 7, e1001387 (2011).
Article CAS Google Scholar
Kolonel, L. N. et al. A multiethnic cohort in Hawaii and Los Angeles: baseline characteristics. Am. J. Epidemiol. 151, 346–357 (2000).
Article CAS Google Scholar
Barrett, J. C. et al. Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region. Nat. Genet. 41, 1330–1334 (2009).
Article CAS Google Scholar
International Multiple Sclerosis Genetics Consortium. et al. Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis. Nature 476, 214–219 (2011).
Burton, P. R. et al. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447, 661–678 (2007).
Article CAS ADS Google Scholar
Yang, J., Zaitlen, N. A., Goddard, M. E., Visscher, P. M. & Price, A. L. Advantages and pitfalls in the application of mixed-model association methods. Nat. Genet. 46, 100–106 (2014).
Article Google Scholar
Visscher, P. M. & Goddard, M. E. A general unified framework to assess the sampling variance of heritability estimates using pedigree or marker-based relationships. Genetics 199, 223–232 (2015).
Article Google Scholar
Visscher, P. M. et al. Statistical power to detect genetic (co)variance of complex traits using SNP data in unrelated samples. PLoS Genet. 10, e1004269 (2014).
Article Google Scholar
International Schizophrenia, C.. et al. Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature 460, 748–752 (2009).
Wray, N. R. et al. Pitfalls of predicting complex traits from SNPs. Nat. Rev. Genet. 14, 507–515 (2013).
Article CAS Google Scholar
Robinson, G. K. That BLUP is a good thing: the estimation of random effects. Stat. Sci. 15–32 (1991).

Download references

Acknowledgements

This work was supported by NIH fellowship F32 GM106584 (AG), NIH grants R01 MH101244(A.G.), R01 CA188392 (B.P.), U01 CA194393(B.P.), R01 GM107427 (M.L.F.), R01 CA193910 (M.L.F./M.P.) and Prostate Cancer Foundation Challenge Award (M.L.F./M.P.). This study makes use of data generated by the Wellcome Trust Case Control Consortium and the Wellcome Trust Sanger Institute. A full list of the investigators who contributed to the generation of the Wellcome Trust Case Control Consortium data is available on www.wtccc.org.uk. Funding for the Wellcome Trust Case Control Consortium project was provided by the Wellcome Trust under award 076113. This study makes use of data generated by the UK10K Consortium. A full list of the investigators who contributed to the generation of the data is available online (http://www.UK10K.org). The PRACTICAL consortium was supported by the following grants: European Commission's Seventh Framework Programme grant agreement n° 223175 (HEALTH-F2-2009-223175), Cancer Research UK Grants C5047/A7357, C1287/A10118, C5047/A3354, C5047/A10692, C16913/A6135 and The National Institute of Health (NIH) Cancer Post-Cancer GWAS initiative Grant: no. 1 U19 CA 148537-01 (the GAME-ON initiative); Cancer Research UK (C1287/A10118, C1287/A 10710, C12292/A11174, C1281/A12014, C5047/A8384, C5047/A15007 and C5047/A10692), the National Institutes of Health (CA128978) and Post-Cancer GWAS initiative (1U19 CA148537, 1U19 CA148065 and 1U19 CA148112—the GAME-ON initiative), the Department of Defense (W81XWH-10-1-0341), A Linneus Centre (Contract ID 70867902), Swedish Research Council (grant no K2010-70X-20430-04-3), the Swedish Cancer Foundation (grant no 09-0677), grants RO1CA056678, RO1CA082664 and RO1CA092579 from the US National Cancer Institute, National Institutes of Health; US National Cancer Institute (R01CA72818); support from The National Health and Medical Research Council, Australia (126402, 209057, 251533, 396414, 450104, 504700, 504702, 504715, 623204, 940394 and 614296); NIH grants CA63464, CA54281 and CA098758; US National Cancer Institute (R01CA128813, PI: J.Y. Park); Bulgarian National Science Fund, Ministry of Education and Science (contract DOO-119/2009; DUNK01/2–2009; DFNI-B01/28/2012); Cancer Research UK grants [C8197/A10123] and [C8197/A10865]; grant code G0500966/75466; NIHR Health Technology Assessment Programme (projects 96/20/06 and 96/20/99); Cancer Research UK grant number C522/A8649, Medical Research Council of England grant number G0500966, ID 75466 and The NCRI, UK; The US Dept of Defense award W81XWH-04-1-0280; Australia Project Grant [390130, 1009458] and Enabling Grant [614296 to APCB]; the Prostate Cancer Foundation of Australia (Project Grant [PG7] and Research infrastructure grant [to APCB]); NIH grant R01 CA092447; Vanderbilt-Ingram Cancer Center (P30 CA68485); Cancer Research UK [C490/A10124] and supported by the UK National Institute for Health Research Biomedical Research Centre at the University of Cambridge; Competitive Research Funding of the Tampere University Hospital (9N069 and X51003); Award Number P30CA042014 from the National Cancer Institute.

Author information

Authors and Affiliations

Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, 02115, Massachusetts, USA
Alexander Gusev, Lisa B. Signorello, J. Michael Gaziano, Edward Giovannucci, David J. Hunter, Peter Kraft, Sara Lindström, Dimitrios Trichopoulos & Alkes L. Price
Broad Institute of Harvard and MIT, Cambridge, 02142, Massachusetts, USA
Alexander Gusev, Soumya Raychaudhuri, Alkes L. Price & Matthew L. Freedman
Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, 90095, California, USA
Huwenbo Shi, Gleb Kichaev & Bogdan Pasaniuc
Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School, Boston, 02115, Massachusetts, USA
Mark Pomerantz, Henry W. Long & Matthew L. Freedman
Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, 02115, Massachusetts, USA
Fugen Li, Henry W. Long & Matthew L. Freedman
Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, 02115, Massachusetts, USA
Fugen Li
Department of Preventative Medicine, Keck School of Medicine, University of Southern California/Norris Comprehensive Cancer Center, Los Angeles, 90033, California, USA
Sue A. Ingles, Graham Casey & Daniel O. Stram
University of Arizona College of Medicine and University of Arizona Cancer Center, Tucson, 85721, Arizona, USA
Rick A. Kittles
Department of Epidemiology, University of Texas M.D. Anderson Cancer Center, Houston, 77030, Texas, USA
Sara S. Strom
Department of Public Health Sciences, Henry Ford Hospital, Detroit, 48202, Michigan, USA
Benjamin A. Rybicki & Christine Neslund-Dudas
Department of Preventive Medicine, Stony Brook University, Stony Brook, 11794, New York, USA
Barbara Nemesure, M. Cristina Leske, Suh-Yuh Wu & Anslem J. M. Hennis
James Buchanan Brady Urological Institute, Johns Hopkins Hospital and Medical Institution, Baltimore, 21205, Maryland, USA
William B. Isaacs
Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt University School of Medicine, Nashville, 37232, Tennessee, USA
Wei Zheng & William J. Blot
Department of Urology, University of Texas M.D. Anderson Cancer Center, Houston, 77030, Texas, USA
Curtis A. Pettaway
Korle Bu Teaching Hospital, Accra, Ghana
Edward D. Yeboah, Yao Tettey, Richard B. Biritwum, Andrew A. Adjei & Evelyn Tay
University of Ghana Medical School, Accra, Ghana
Edward D. Yeboah, Yao Tettey, Richard B. Biritwum, Andrew A. Adjei & Evelyn Tay
Westat, Rockville, 20850, Maryland, USA
Ann Truelove & Shelley Niwa
School of Public Health, University of California, Berkeley, 94720, California, USA
Anand P. Chokkalingam
Cancer Prevention Institute of California, Fremont, 94538, California, USA
Esther M. John, Ann W. Hsing & Lisa Chu
Stanford University School of Medicine and Stanford Cancer Institute, Palo Alto, 94305, California, USA
Esther M. John, Ann W. Hsing & Lisa Chu
Department of Urology, Northwestern University, Chicago, 60611, Illinois, USA
Adam B. Murphy
International Epidemiology Institute, Rockville, 20850, Maryland, USA
Lisa B. Signorello & William J. Blot
The Translational Genomics Research Institute, Phoenix, 85004, Arizona, USA
John Carpten
Chronic Disease Research Centre and Faculty of Medical Sciences, University of the West Indies, Bridgetown, Barbados
Anslem J. M. Hennis
SWOG Statistical Center, Fred Hutchinson Cancer Research Center, Seattle, 98109, Washington, USA
Phyllis J. Goodman
Glickman Urological & Kidney Institute, Cleveland Clinic, Cleveland, 44195, Ohio, USA
Eric A. Klein
Department of Epidemiology and Biostatistics, University of California, San Francisco, San Francisco, 94118, California, USA
John S. Witte
Institute for Human Genetics, University of California, San Francisco, 94118, San Francisco, California, USA
John S. Witte
Department of Surgery, Makerere University College of Health Sciences, Kampala, 94118, Uganda
Sam Kaggwa
Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, 20892, Maryland, USA
Michael B. Cook, Sonja I. Berndt, Robert Hoover & Meredith Yeager
The Institute of Cancer Research, Sutton, SM2 5NG, UK
Rosalind A. Eeles, ZSofia Kote-Jarai, Michelle Guy, Koveela Govindasami, Daniel Leongamornlert, Emma J. Sawyer, Rosemary Wilkinson, Edward J. Saunders, Malgorzata Tymrakiewicz, Tokhir Dadaev, Angela Morgan, Cyril Fisher, Steve Hazel & Naomi Livni
Royal Marsden National Health Service (NHS) Foundation Trust, London and Sutton, UK
Rosalind A. Eeles
Department of Public Health and Primary Care, Centre for Cancer Genetic Epidemiology, University of Cambridge, Strangeways Laboratory, Worts Causeway, Cambridge, CB1 8RN, UK
Douglas Easton, Ali Amin Al Olama, Sara Benlloch & Margaret Cook
Institute of Population Health, University of Manchester, Manchester, M13 9PL, UK
Kenneth Muir & Artitaya Lophatananon
Warwick Medical School, University of Warwick, Coventry, CV4 7AL, UK
Kenneth Muir & Artitaya Lophatananon
Cancer Epidemiology Centre, The Cancer Council Victoria, 615 St Kilda Road, Melbourne, 3004, Victoria, Australia
Graham G. Giles & Liesel M. Fitzgerald
Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Victoria, 3004, Australia
Graham G. Giles & John L. Hopper
Department of Pathology, Genetic Epidemiology Laboratory, The University of Melbourne, Grattan Street, Parkville, 3010, Victoria, Australia
Melissa C. Southey
Department of Medical Epidemiology and Biostatistics, Karolinska Institute, Stockholm, 171 77, Sweden
Henrik Gronberg, Fredrik Wiklund, Markus Aly, Jan Adolfson, Paer Stattin, Jan-Erik Johansson, Carin Cavalli-Bjoerkman, Ami Karlsson & Michael Broms
Department of Clinical Sciences at Danderyds Hospital, Stockholm, 171 77, Sweden
Markus Aly
Department of Preventive Medicine, Keck School of Medicine, University of Southern California/Norris Comprehensive Cancer Center, Los Angeles, 90007, California, USA
Brian E. Henderson
Department of Medical Biochemistry and Genetics Institute of Biomedicine Kiinamyllynkatu 10, University of Turku, Turku, FI-20014, Finland
Johanna Schleutker
BioMediTech, University of Tampere and FimLab Laboratories, Tampere, 33200, Finland
Johanna Schleutker & Tiina Wahlfors
Department of Urology, Tampere University Hospital and Medical School, University of Tampere, Tampere, 33200, Finland
Teuvo L. J. Tammela
Department of Clinical Biochemistry, Herlev Hospital, Copenhagen University Hospital, Herlev Ringvej 75, Herlev, DK-2730, Denmark
Børge G. Nordestgaard, Maren Weischer & Sune F. Nielsen
Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, 1165, Densmark
Børge G. Nordestgaard & Sune F. Nielsen
Nuffield Department of Population Health, Cancer Epidemiology,
Tim J. Key & Ruth C. Travis
University of Oxford, Oxford, OX3 7LF, UK
Tim J. Key & Ruth C. Travis
Department of Oncology, University of Cambridge, Addenbrooke's Hospital, Box 279, Hills Road, Cambridge, CB2 0QQ
David E. Neal
Cancer Research UK Cambridge Research Institute, Li Ka Shing Centre, Cambridge, UK
David E. Neal
School of Social and Community Medicine, University of Bristol, Canynge Hall, 39 Whatley Road, Bristol, BS8 2PS, UK
Jenny L. Donovan, Paul Brown, Anne George, Gemma Marsden, Athene Lane & Michael Davis
Department of Public Health, Section for Epidemiology, Aarhus University, Aarhus, OX1 3PN, Denmark
Freddie C. Hamdy
Faculty of Medical Science, University of Oxford, John Radcliffe Hospital, Oxford, OX1 3PN, UK
Freddie C. Hamdy
Department of Oncology, Centre for Cancer Genetic Epidemiology, University of Cambridge, Strangeways Laboratory, Worts Causeway, CB1 8RN, Cambridge, UK
Paul Pharoah & Nora Pashayan
Department of Applied Health Research, University College London, 1-19 Torrington Place, London, WC1E 7HB, UK
Nora Pashayan
Clinical Gerontology Unit, University of Cambridge, Cambridge, CB1 8RN, UK
Kay-Tee Khaw
Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, 98109-1024, Washington, USA
Janet L. Stanford
Department of Epidemiology, School of Public Health, University of Washington, Seattle, 98109, Washington, USA
Janet L. Stanford
Mayo Clinic, Rochester, 55905, Minnesota, USA
Stephen N. Thibodeau, Shannon K. McDonnell, Daniel J. Schaid, Lori Tillmans, Shaun Riska & Liang Wang
Institute of Human Genetics, University Hospital Ulm, Ulm, 89081, Germany
Christiane Maier & Walther Vogel
Department of Urology, University Hospital Ulm, Ulm, 89081, Germany
Manuel Luedeke & Antje Rinckleb
Department of Urology, Klinikum rechts der Isar der Technischen Universitaet Muenchen, Munich, 81675, Germany
Kathleen Herkommer
Division of Urologic Surgery, Brigham and Womens Hospital, Dana-Farber Cancer Institute, 75 Francis Street, Boston, 02115, Massachusetts, USA
Adam S. Kibel
Department of Genetics and Pathology, International Hereditary Cancer Center, Pomeranian Medical University, Szczecin, Poland
Cezary Cybulski, Dominika Wokolorczyk, Wojciech Kluzniak & Jan Lubiski
Division of Genetic Epidemiology, Department of Medicine, University of Utah School of Medicine, Salt Lake City, 84132, Utah, USA
Lisa Cannon-Albright & Craig Teerlink
George E. Wahlen Department of Veterans Affairs Medical Center, Salt Lake City, 84132, Utah, USA
Lisa Cannon-Albright & Craig Teerlink
Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, 69120, Germany
Hermann Brenner, Aida K. Dieffenbach & Volker Arndt
German Cancer Consortium (DKTK), Heidelberg, 69120, Germany
Hermann Brenner & Aida K. Dieffenbach
Department of Cancer Epidemiology, Moffitt Cancer Center, 12902 Magnolia Drive, Tampa, 33612, Florida, USA
Jong Y. Park, Thomas A. Sellers, Julio Pow-Sang, Hyun Park, Selina Radlein, Maria Rincon, James Haley & Babu Zachariah
Biostatistics Program, Moffitt Cancer Center, 12902 Magnolia Drive, Tampa, 33612, Florida, USA
Hui-Yi Lin
Department of Urology and Alexandrovska University Hospital, Medical University, Sofia, 1431, Bulgaria
Chavdar Slavov & Elenko Popov
Department of Medical Chemistry and Biochemistry, Molecular Medicine Center, Medical University, Sofia, 2 Zdrave Str., Sofia, 1431, Bulgaria
Radka Kaneva, Vanio Mitev, Darina Kachakova & Atanaska Mitkova
Australian Prostate Cancer Research Centre-Qld, Institute of Health and Biomedical Innovation and School of Biomedical Science, Queensland University of Technology, Brisbane, 4000, Queensland, Australia
Jyotsna Batra, Judith A. Clements, Peter Heathcote, Glenn Wood, Greg Malone, Pamela Saunders, Allison Eckert, Trina Yeadon, Kris Kerr, Angus Collins, Megan Turner, Srilakshmi Srinivasan, Mary-Anne Kedda, Kimberly Alexander & Tracy Omara
Molecular Cancer Epidemiology Laboratory, Queensland Institute of Medical Research, Brisbane, 4000, Queensland, Australia
Amanda Spurdle
Department of Genetics, Portuguese Oncology Institute, Porto, 4200, Portugal
Manuel R. Teixeira, Paula Paulo, Sofia Maia, Rui Henrique, Pedro Pinto, Joana Santos & Joao Barros-Silva
Biomedical Sciences Institute (ICBAS), University of Porto, Porto, 4200, Portugal
Manuel R. Teixeira
The University of Surrey, Guildford, GU2 7XH, Surrey, UK
Hardev Pandha, Agnieszka Michael, Andrzej Kierzek, Aleksandrina Vlahova, Tihomir Dikov & Svetlana Christova
Department of Preventive Medicine, Norris Cancer Center, University of Southern California Keck School of Medicine, Los Angeles, 90033, California, USA
David V. Conti, Frederick R. Schumacher & Christopher A. Haiman
Division of Cancer Epidemiology and Genetics, Nutritional Epidemiology Branch, National Cancer Institute, US National Institute of Health, Bethesda, 20892, Maryland, USA
Demetrius Albanes
Department of Radiation Oncology and Molecular Radiation Sciences, Johns Hopkins Medicine, Baltimore, 21287, Maryland, USA
Christine Berg & Kim Overvad
Genomic Epidemiology Group, German Cancer Research Center (DKFZ), Heidelberg, 69121, Germany
Daniele Campa
Urologic Oncology, University of Colorado at Denver Health Sciences Center, Denver, 80230, Colorado, USA
E. David Crawford
Epidemiology Research Program, American Cancer Society, Atlanta, 30303, Georgia, USA
W. Ryan Diver, Susan M. Gapstur & Victoria L. Stevens
Department of Medicine, Harvard Medical School, Boston, 02115, Massachusetts, USA
J. Michael Gaziano
Division of Aging, Brigham and Women's Hospital, Boston, 02115, Massachusetts, USA
J. Michael Gaziano
Department of Nutrition, Harvard School of Public Health, Boston, 02115, Massachusetts, USA
Edward Giovannucci
International Agency for Research on Cancer, Lyon, 69008, France
Mattias Johansson
Department of Surgical and Perioperative Sciences, Urology and Andrology, Umeå University, Umeå, 907 36, Sweden
Mattias Johansson
Department of Biostatistics, Harvard School of Public Health, Boston, 02115, Massachusetts, USA
Peter Kraft & Sara Lindström
Epidemiology Program, University of Hawaii Cancer Center, Honolulu, 96813, Hawaii, USA
Loic Le Marchand
Department of Epidemiology, Regional Health Authority, Murcia, 30009, Spain
Carmen Navarro
CIBER Epidemiología y Salud Pública (CIBERESP), Barcelona, 28029, Spain
Carmen Navarro
Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, SW7 2AZ, UK
Elio Riboli
Department of Genomics of Common Disease, School of Public Health, Imperial College London, London, SW7 2AZ, UK
Afshan Siddiq
Bureau of Epidemiologic Research, Academy of Athens, Athens, 106 79, Greece
Dimitrios Trichopoulos
Hellenic Health Foundation, Athens, 106 79, Greece
Dimitrios Trichopoulos
HuGeF Foundation, Torino, 10126, Italy
Paolo Vineis
School of Public Health, Imperial College London, London, SW7 2AZ, UK
Paolo Vineis
Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA
Gosia Trynka & Soumya Raychaudhuri
Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, CB10 1SA, UK
Gosia Trynka
Institute of Inflammation and Repair, University of Manchester, Manchester, M13 9PT, UK
Soumya Raychaudhuri
Departments of Pathology and Laboratory Medicine, University of California Los Angeles, Los Angeles, California, USA
Bogdan Pasaniuc
Department of Human Genetics, University of California Los Angeles, Los Angeles, 90095, California, USA
Bogdan Pasaniuc
Tissupath Pty Ltd., Melbourne, 3122, Victoria, Australia
John Pedersen
Department of Epidemiology, School of Health Sciences, University of Tampere, Tampere, 33014, Finland
Anssi Auvinen
Fimlab Laboratories, Tampere University Hospital, Tampere, Finland
Paula Kujala
Finnish Cancer Registry, Helsinki, Finland
Liisa Maeaettaenen
School of Medicine, University of Tampere, Tampere, Finland
Teemu Murtola
Department of Urology, Tampere University Hospital, Tampere, Finland
Teemu Murtola
Department of Urology, Helsinki University Central Hospital and University of Helsinki, Helsinki, 00100, Finland
Kimmo Taari
Department of Urology, Herlev Hospital, Copenhagen University Hospital, Herlev Ringvej 75, Herlev, DK-230, Denmark
Peter Klarskov
Department of Urology, Copenhagen Prostate Cancer Center, Rigshospitalet, Copenhagen University Hospital, Tagensvej 20, 7521, Copenhagen, DK-2200, Denmark
Andreas Roder & Peter Iversen
Department of Epidemiology and Biostatistics, School of Public Health, Imperial College, London, SW7 2AZ, UK
Hans Wallinder & Sven Gustafsson
CR-UK/YCR Sheffield Cancer Research Centre, University of Sheffield, Sheffield, S10 2TN, UK
Angela Cox
Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, 2525 West End Avenue, Suite 800, Nashville, 37232, Tennessee, USA
Wei Zheng
National Cancer Institute, NIH, 9609 Medical Center Drive, Suite 2W-172, MSC 9712, Bethesda, MD 20892-9712 (mail), Rockville, 20850 (FedEx/Courier), Maryland, USA
Lisa B. Signorello
International Epidemiology Institute, 1555 Research Blvd., Suite 550, Rockville, 20850, Maryland, USA
William J. Blot
Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, 37232, Tennessee, USA
William J. Blot
Saarland Cancer Registry, Saarbrücken, 66119, Germany
Christa Stegmaier
The University of Surrey, Guildford, GU2 7XH, Surrey
Huihai Wu

Authors

Alexander Gusev
View author publications
You can also search for this author in PubMed Google Scholar
Huwenbo Shi
View author publications
You can also search for this author in PubMed Google Scholar
Gleb Kichaev
View author publications
You can also search for this author in PubMed Google Scholar
Mark Pomerantz
View author publications
You can also search for this author in PubMed Google Scholar
Fugen Li
View author publications
You can also search for this author in PubMed Google Scholar
Henry W. Long
View author publications
You can also search for this author in PubMed Google Scholar
Sue A. Ingles
View author publications
You can also search for this author in PubMed Google Scholar
Rick A. Kittles
View author publications
You can also search for this author in PubMed Google Scholar
Sara S. Strom
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin A. Rybicki
View author publications
You can also search for this author in PubMed Google Scholar
Barbara Nemesure
View author publications
You can also search for this author in PubMed Google Scholar
William B. Isaacs
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Curtis A. Pettaway
View author publications
You can also search for this author in PubMed Google Scholar
Edward D. Yeboah
View author publications
You can also search for this author in PubMed Google Scholar
Yao Tettey
View author publications
You can also search for this author in PubMed Google Scholar
Richard B. Biritwum
View author publications
You can also search for this author in PubMed Google Scholar
Andrew A. Adjei
View author publications
You can also search for this author in PubMed Google Scholar
Evelyn Tay
View author publications
You can also search for this author in PubMed Google Scholar
Ann Truelove
View author publications
You can also search for this author in PubMed Google Scholar
Shelley Niwa
View author publications
You can also search for this author in PubMed Google Scholar
Anand P. Chokkalingam
View author publications
You can also search for this author in PubMed Google Scholar
Esther M. John
View author publications
You can also search for this author in PubMed Google Scholar
Adam B. Murphy
View author publications
You can also search for this author in PubMed Google Scholar
Lisa B. Signorello
View author publications
You can also search for this author in PubMed Google Scholar
John Carpten
View author publications
You can also search for this author in PubMed Google Scholar
M. Cristina Leske
View author publications
You can also search for this author in PubMed Google Scholar
Suh-Yuh Wu
View author publications
You can also search for this author in PubMed Google Scholar
Anslem J. M. Hennis
View author publications
You can also search for this author in PubMed Google Scholar
Christine Neslund-Dudas
View author publications
You can also search for this author in PubMed Google Scholar
Ann W. Hsing
View author publications
You can also search for this author in PubMed Google Scholar
Lisa Chu
View author publications
You can also search for this author in PubMed Google Scholar
Phyllis J. Goodman
View author publications
You can also search for this author in PubMed Google Scholar
Eric A. Klein
View author publications
You can also search for this author in PubMed Google Scholar
John S. Witte
View author publications
You can also search for this author in PubMed Google Scholar
Graham Casey
View author publications
You can also search for this author in PubMed Google Scholar
Sam Kaggwa
View author publications
You can also search for this author in PubMed Google Scholar
Michael B. Cook
View author publications
You can also search for this author in PubMed Google Scholar
Daniel O. Stram
View author publications
You can also search for this author in PubMed Google Scholar
William J. Blot
View author publications
You can also search for this author in PubMed Google Scholar
Rosalind A. Eeles
View author publications
You can also search for this author in PubMed Google Scholar
Douglas Easton
View author publications
You can also search for this author in PubMed Google Scholar
ZSofia Kote-Jarai
View author publications
You can also search for this author in PubMed Google Scholar
Ali Amin Al Olama
View author publications
You can also search for this author in PubMed Google Scholar
Sara Benlloch
View author publications
You can also search for this author in PubMed Google Scholar
Kenneth Muir
View author publications
You can also search for this author in PubMed Google Scholar
Graham G. Giles
View author publications
You can also search for this author in PubMed Google Scholar
Melissa C. Southey
View author publications
You can also search for this author in PubMed Google Scholar
Liesel M. Fitzgerald
View author publications
You can also search for this author in PubMed Google Scholar
Henrik Gronberg
View author publications
You can also search for this author in PubMed Google Scholar
Fredrik Wiklund
View author publications
You can also search for this author in PubMed Google Scholar
Markus Aly
View author publications
You can also search for this author in PubMed Google Scholar
Brian E. Henderson
View author publications
You can also search for this author in PubMed Google Scholar
Johanna Schleutker
View author publications
You can also search for this author in PubMed Google Scholar
Tiina Wahlfors
View author publications
You can also search for this author in PubMed Google Scholar
Teuvo L. J. Tammela
View author publications
You can also search for this author in PubMed Google Scholar
Børge G. Nordestgaard
View author publications
You can also search for this author in PubMed Google Scholar
Tim J. Key
View author publications
You can also search for this author in PubMed Google Scholar
Ruth C. Travis
View author publications
You can also search for this author in PubMed Google Scholar
David E. Neal
View author publications
You can also search for this author in PubMed Google Scholar
Jenny L. Donovan
View author publications
You can also search for this author in PubMed Google Scholar
Freddie C. Hamdy
View author publications
You can also search for this author in PubMed Google Scholar
Paul Pharoah
View author publications
You can also search for this author in PubMed Google Scholar
Nora Pashayan
View author publications
You can also search for this author in PubMed Google Scholar
Kay-Tee Khaw
View author publications
You can also search for this author in PubMed Google Scholar
Janet L. Stanford
View author publications
You can also search for this author in PubMed Google Scholar
Stephen N. Thibodeau
View author publications
You can also search for this author in PubMed Google Scholar
Shannon K. McDonnell
View author publications
You can also search for this author in PubMed Google Scholar
Daniel J. Schaid
View author publications
You can also search for this author in PubMed Google Scholar
Christiane Maier
View author publications
You can also search for this author in PubMed Google Scholar
Walther Vogel
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Luedeke
View author publications
You can also search for this author in PubMed Google Scholar
Kathleen Herkommer
View author publications
You can also search for this author in PubMed Google Scholar
Adam S. Kibel
View author publications
You can also search for this author in PubMed Google Scholar
Cezary Cybulski
View author publications
You can also search for this author in PubMed Google Scholar
Dominika Wokolorczyk
View author publications
You can also search for this author in PubMed Google Scholar
Wojciech Kluzniak
View author publications
You can also search for this author in PubMed Google Scholar
Lisa Cannon-Albright
View author publications
You can also search for this author in PubMed Google Scholar
Craig Teerlink
View author publications
You can also search for this author in PubMed Google Scholar
Hermann Brenner
View author publications
You can also search for this author in PubMed Google Scholar
Aida K. Dieffenbach
View author publications
You can also search for this author in PubMed Google Scholar
Volker Arndt
View author publications
You can also search for this author in PubMed Google Scholar
Jong Y. Park
View author publications
You can also search for this author in PubMed Google Scholar
Thomas A. Sellers
View author publications
You can also search for this author in PubMed Google Scholar
Hui-Yi Lin
View author publications
You can also search for this author in PubMed Google Scholar
Chavdar Slavov
View author publications
You can also search for this author in PubMed Google Scholar
Radka Kaneva
View author publications
You can also search for this author in PubMed Google Scholar
Vanio Mitev
View author publications
You can also search for this author in PubMed Google Scholar
Jyotsna Batra
View author publications
You can also search for this author in PubMed Google Scholar
Amanda Spurdle
View author publications
You can also search for this author in PubMed Google Scholar
Judith A. Clements
View author publications
You can also search for this author in PubMed Google Scholar
Manuel R. Teixeira
View author publications
You can also search for this author in PubMed Google Scholar
Hardev Pandha
View author publications
You can also search for this author in PubMed Google Scholar
Agnieszka Michael
View author publications
You can also search for this author in PubMed Google Scholar
Paula Paulo
View author publications
You can also search for this author in PubMed Google Scholar
Sofia Maia
View author publications
You can also search for this author in PubMed Google Scholar
Andrzej Kierzek
View author publications
You can also search for this author in PubMed Google Scholar
David V. Conti
View author publications
You can also search for this author in PubMed Google Scholar
Demetrius Albanes
View author publications
You can also search for this author in PubMed Google Scholar
Christine Berg
View author publications
You can also search for this author in PubMed Google Scholar
Sonja I. Berndt
View author publications
You can also search for this author in PubMed Google Scholar
Daniele Campa
View author publications
You can also search for this author in PubMed Google Scholar
E. David Crawford
View author publications
You can also search for this author in PubMed Google Scholar
W. Ryan Diver
View author publications
You can also search for this author in PubMed Google Scholar
Susan M. Gapstur
View author publications
You can also search for this author in PubMed Google Scholar
J. Michael Gaziano
View author publications
You can also search for this author in PubMed Google Scholar
Edward Giovannucci
View author publications
You can also search for this author in PubMed Google Scholar
Robert Hoover
View author publications
You can also search for this author in PubMed Google Scholar
David J. Hunter
View author publications
You can also search for this author in PubMed Google Scholar
Mattias Johansson
View author publications
You can also search for this author in PubMed Google Scholar
Peter Kraft
View author publications
You can also search for this author in PubMed Google Scholar
Loic Le Marchand
View author publications
You can also search for this author in PubMed Google Scholar
Sara Lindström
View author publications
You can also search for this author in PubMed Google Scholar
Carmen Navarro
View author publications
You can also search for this author in PubMed Google Scholar
Kim Overvad
View author publications
You can also search for this author in PubMed Google Scholar
Elio Riboli
View author publications
You can also search for this author in PubMed Google Scholar
Afshan Siddiq
View author publications
You can also search for this author in PubMed Google Scholar
Victoria L. Stevens
View author publications
You can also search for this author in PubMed Google Scholar
Dimitrios Trichopoulos
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Vineis
View author publications
You can also search for this author in PubMed Google Scholar
Meredith Yeager
View author publications
You can also search for this author in PubMed Google Scholar
Gosia Trynka
View author publications
You can also search for this author in PubMed Google Scholar
Soumya Raychaudhuri
View author publications
You can also search for this author in PubMed Google Scholar
Frederick R. Schumacher
View author publications
You can also search for this author in PubMed Google Scholar
Alkes L. Price
View author publications
You can also search for this author in PubMed Google Scholar
Matthew L. Freedman
View author publications
You can also search for this author in PubMed Google Scholar
Christopher A. Haiman
View author publications
You can also search for this author in PubMed Google Scholar
Bogdan Pasaniuc
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

The PRACTICAL consortium

Margaret Cook
, Michelle Guy
, Koveela Govindasami
, Daniel Leongamornlert
, Emma J. Sawyer
, Rosemary Wilkinson
, Edward J. Saunders
, Malgorzata Tymrakiewicz
, Tokhir Dadaev
, Angela Morgan
, Cyril Fisher
, Steve Hazel
, Naomi Livni
, Artitaya Lophatananon
, John Pedersen
, John L. Hopper
, Jan Adolfson
, Paer Stattin
, Jan-Erik Johansson
, Carin Cavalli-Bjoerkman
, Ami Karlsson
, Michael Broms
, Anssi Auvinen
, Paula Kujala
, Liisa Maeaettaenen
, Teemu Murtola
, Kimmo Taari
, Maren Weischer
, Sune F. Nielsen
, Peter Klarskov
, Andreas Roder
, Peter Iversen
, Hans Wallinder
, Sven Gustafsson
, Angela Cox
, Paul Brown
, Anne George
, Gemma Marsden
, Athene Lane
, Michael Davis
, Wei Zheng
, Lisa B. Signorello
, William J. Blot
, Lori Tillmans
, Shaun Riska
, Liang Wang
, Antje Rinckleb
, Jan Lubiski
, Christa Stegmaier
, Julio Pow-Sang
, Hyun Park
, Selina Radlein
, Maria Rincon
, James Haley
, Babu Zachariah
, Darina Kachakova
, Elenko Popov
, Atanaska Mitkova
, Aleksandrina Vlahova
, Tihomir Dikov
, Svetlana Christova
, Peter Heathcote
, Glenn Wood
, Greg Malone
, Pamela Saunders
, Allison Eckert
, Trina Yeadon
, Kris Kerr
, Angus Collins
, Megan Turner
, Srilakshmi Srinivasan
, Mary-Anne Kedda
, Kimberly Alexander
, Tracy Omara
, Huihai Wu
, Rui Henrique
, Pedro Pinto
, Joana Santos
& Joao Barros-Silva

Contributions

A.G., A.L.P., M.L.F., C.A.H. and B.P. planned the study and wrote the paper. A.G., H.S., G.K. and B.P. performed primary analysis. All authors contributed to study design, data collection and analysis of individual data and annotations.

Corresponding authors

Correspondence to Alexander Gusev or Bogdan Pasaniuc.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-3, Supplementary Tables 1-16 and Supplementary Note 1 (PDF 276 kb)

Supplementary Data 1

Estimates of partitioned heritability from all analyzed annotations, with corresponding web resource (XLSX 83 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Gusev, A., Shi, H., Kichaev, G. et al. Atlas of prostate cancer heritability in European and African-American men pinpoints tissue-specific regulation. Nat Commun 7, 10979 (2016). https://doi.org/10.1038/ncomms10979

Download citation

Received: 26 December 2014
Accepted: 03 February 2016
Published: 07 April 2016
DOI: https://doi.org/10.1038/ncomms10979

This article is cited by

Algorithmic fairness in artificial intelligence for medicine and healthcare
- Richard J. Chen
- Judy J. Wang
- Faisal Mahmood
Nature Biomedical Engineering (2023)
Family history of prostate cancer and prostate tumor aggressiveness in black and non-black men;results from an equal access biopsy study
- Kimberly R. Jenkins
- Taofik Oyekunle
- Emma H. Allott
Cancer Causes & Control (2021)
Behandeling van prostaatkanker bij mannen met een somatische of BRCA-kiembaanmutatie
- Niven Mehra
Tijdschrift voor Urologie (2020)
Prostate cancer reactivates developmental epigenomic programs during metastatic progression
- Mark M. Pomerantz
- Xintao Qiu
- Matthew L. Freedman
Nature Genetics (2020)
Probabilistic fine-mapping of transcriptome-wide association studies
- Nicholas Mancuso
- Malika K. Freund
- Bogdan Pasaniuc
Nature Genetics (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.