Large eQTL meta-analysis reveals differing patterns between cerebral cortical and cerebellar brain regions

Sieberts, Solveig K.; Perumal, Thanneer M.; Carrasquillo, Minerva M.; Allen, Mariet; Reddy, Joseph S.; Hoffman, Gabriel E.; Dang, Kristen K.; Calley, John; Ebert, Philip J.; Eddy, James; Wang, Xue; Greenwood, Anna K.; Mostafavi, Sara; Omberg, Larsson; Peters, Mette A.; Logsdon, Benjamin A.; De Jager, Philip L.; Ertekin-Taner, Nilüfer; Mangravite, Lara M.

doi:10.1038/s41597-020-00642-8

Download PDF

Analysis
Open access
Published: 12 October 2020

Large eQTL meta-analysis reveals differing patterns between cerebral cortical and cerebellar brain regions

Scientific Data volume 7, Article number: 340 (2020) Cite this article

11k Accesses
46 Citations
4 Altmetric
Metrics details

Subjects

Abstract

The availability of high-quality RNA-sequencing and genotyping data of post-mortem brain collections from consortia such as CommonMind Consortium (CMC) and the Accelerating Medicines Partnership for Alzheimer’s Disease (AMP-AD) Consortium enable the generation of a large-scale brain cis-eQTL meta-analysis. Here we generate cerebral cortical eQTL from 1433 samples available from four cohorts (identifying >4.1 million significant eQTL for >18,000 genes), as well as cerebellar eQTL from 261 samples (identifying 874,836 significant eQTL for >10,000 genes). We find substantially improved power in the meta-analysis over individual cohort analyses, particularly in comparison to the Genotype-Tissue Expression (GTEx) Project eQTL. Additionally, we observed differences in eQTL patterns between cerebral and cerebellar brain regions. We provide these brain eQTL as a resource for use by the research community. As a proof of principle for their utility, we apply a colocalization analysis to identify genes underlying the GWAS association peaks for schizophrenia and identify a potentially novel gene colocalization with lncRNA RP11-677M14.2 (posterior probability of colocalization 0.975).

Single-cell long-read sequencing-based mapping reveals specialized splicing patterns in developing and adult mouse and human brain

Article Open access 09 April 2024

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

Article Open access 12 April 2024

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Introduction

Defining the landscape of genetic regulation of gene expression in a tissue-specific manner is useful for understanding both normal gene regulation and how variation in gene expression can alter disease risk. In the latter case, a variety of approaches now leverage the association between genetic variants and gene expression changes, including colocalization analysis^{1,2,3,4,5,6,7}, transcriptome-wide association studies (TWAS)^8,9, and gene regulatory network inference^{10,11,12,13,14,15,16}.

There has been a relative lack of expression quantitative trait loci (eQTL) studies from the brain. Because of the more accessible nature of tissues such as blood or lymphoblastoid cell lines (LCLs), much of the large-scale identification of expression quantitative trait loci (eQTL) has occurred in these tissues^17,18,19,20. For most other tissues, obtaining samples for RNA sequencing (RNA-seq) requires invasive biopsy, and brain tissues are typically only available in post-mortem brain samples. One effort, the Genotype-Tissue Expression (GTEx) project^21,22, has profiled a broad range of tissues (42 distinct) for eQTL discovery, however, samples sizes in brain have been small (typically 100–150). Recently, efforts to understand gene expression changes in neuropsychiatric^23,24,25,26 and neurodegenerative diseases^{27,28,29,30,31,32,33,34} have generated brain RNA-seq from disease and normal tissue, as well as genome-wide genotypes. These analyses have found little evidence for widespread disease-specific eQTL, as well as high cross-cohort overlap^24,35, meaning that most eQTL detected are disease-condition independent. This implies that meta-analysis across disease-based cohorts will capture eQTL which are unconfounded by disease state despite differences in disease ascertainment of the samples, and leverages thousands of available samples to produce a well-powered brain eQTL resource for use in downstream research.

Here we generate a public eQTL resource from cerebral cortical tissue using 1433 samples from 4 cohorts from the CommonMind Consortium (CMC)^23,24,26 and the Accelerating Medicines Partnership for Alzheimer’s Disease (AMP-AD) Consortium^30,31, as well as from cerebellum using 261 samples from AMP-AD. We show that eQTL discovered in GTEx, which consists of control individuals (without disease) only, are replicated in this larger brain eQTL resource. We further show widespread differences in regulation between cerebral cortex and cerebellum. To demonstrate one example of the utility of these data, we apply a colocalization analysis, which seeks to identify expression traits whose eQTL association pattern appears to co-occur at the same loci as the clinical trait association, to identify putative genes underlying the GWAS association peaks for schizophrenia³⁶.

Results

We generated eQTL from the publicly available AMP-AD (ROSMAP^27,28,35,37, Mayo RNAseq^29,38,39,40) and CMC^23,24,26 (MSSM-Penn-Pitt^24,26, HBCC²⁶) cohorts with available genotypes and RNA-seq data, using a common analysis pipeline (Supplementary Table 1) (https://www.synapse.org/#!Synapse:syn17015233). Analyses proceeded separately by cohort. Briefly, the RNA-seq data were normalized for gene length and GC content prior to adjustment for clinical confounders, processing batch information, and hidden confounders using Surrogate Variable Analysis (SVA)⁴¹. Genes having at least 1 count per million (CPM) in at least 50% of samples were retained for downstream analysis (Supplementary Table 2). Genotypes were imputed to the Haplotype Reference Consortium (HRC) reference panel⁴². eQTL were generated adjusting for diagnosis (AD, control, other for AMP-AD cohorts and schizophrenia, control, other for CMC cohorts) and principal components of ancestry separately for ROSMAP, Mayo temporal cortex (TCX), Mayo cerebellum (CER), MSSM-Penn-Pitt, and HBCC. For HBCC, which had a small number of samples derived from infant and adolescents, we excluded samples with age-of-death less than 18, to limit heterogeneity due to differences between the mature and developing brain.

We then performed a meta-analysis using the eQTL from cortical brain regions from the individual cohorts (dorsolateral prefrontal cortex (DLPFC) from ROSMAP, MSSM-Penn-Pitt, and HBCC and TCX from Mayo). The meta-analysis identifies substantially more eQTL than the individual cohorts (Table 1, Fig. 1). There is a strong relationship between the sample size in the individual cohorts and meta-analysis and the number of significant eQTL and genes with eQTL (Fig. 1b,c). Notably, the meta-analysis identified significant eQTL (at FDR ≤ 0.05) in >1000 genes for which no eQTL were observed in any individual cohort. Additionally, we find significant eQTL for 18,295 (18,433 when considering markers with minor allele frequency (MAF) down to 1%) of the 19,392 genes included in the analysis.

Table 1 eQTL results from individual cohorts and meta-analysis.

Full size table

We then compared our cortical eQTL to those from GTEx (v7)²¹, which is the most comprehensive brain eQTL database available in terms of number of available brain tissues (Table 1, Table 2). Due to the substantially larger power in these data, we find >3.8 million eQTL not identified in GTEx cortical regions (Anterior Cingulate Cortex, Cortex or Frontal Cortex) and we find eQTL for >11,000 genes with no eQTL in these cortical regions in GTEx. While GTEx employs a stricter approach to the control of false discovery rate (FDR), we find that re-analysis of the GTEx cortical regions using an approach similar to ours (see Methods) did not account for the number of eQTL and genes with eQTL discovered in this analysis, but not in GTEx (3,619,693 and 6,866 for eQTL and genes, respectively, when using the less conservative approach). Next, we evaluated the replication within our cortical and cerebellar eQTL of the region specific eQTL identified in GTEx. The cortical eQTL generated through the current analyses strongly replicate the eQTL available through GTEx, not only for cortical regions, but for all brain regions including cervical spinal cord (Table 2). Interestingly, the replication in these cortical eQTL of eQTL derived from the two GTEx cerebellar brain regions (cerebellum and cerebellar hemisphere) is consistently lower than for other brain regions represented in GTEx. However, replication of GTEx cerebellar eQTL is high when compared to the cerebellar eQTL generated in this analysis from the Mayo Clinic CER samples. We also performed the reverse comparison, by examining the replication of our eQTL in those region-specific eQTL identified in GTEx. Unsurprisingly, the replication levels were substantially lower, due to the lower power in the GTEx analyses. Replication rates were not substantially changed by using GTEx eQTL discovered using our less conservative approach.

Table 2 Replication rates between GTEx and publicly available eQTL from these analyses.

Full size table

Additionally, we compared our eQTL to a publically available fetal brain eQTL resource⁴³ and found good replication of these eQTL as well (estimated replication rate π₁ = 0.909 for the cortical meta-analysis, and π₁ = 0.861 for cerebellum), though somewhat lower than the replication observed in the GTEx cohorts, which are comprised of adult-derived samples.

Finally, as a proof of concept, we performed a colocalization analysis between our eQTL meta-analysis and the Psychiatric Genomics Consortium (PGC) v2 schizophrenia GWAS summary statistics³⁶. Seventeen genes showed posterior probability of colocalization using coloc⁷ (PP(H₄)) > 0.7 (Table 3), with 3 showing PP(H₄) > 0.95 (FURIN, ZNF823, RP11-677M14.2). FURIN, having previously identified as a candidate through colocalization²⁴ has recently been shown to reduce brain-derived neurotrophic factor (BDNF) maturation and secretion when inhibited by miR-338-3p⁴⁴. ZNF823 has been identified in previous colocalization analyses^45,46. RP11-677M14.2, a lncRNA located inside NRGN, while not previously identified through colocalization analysis, has been shown to be down-regulated in the amygdala of schizophrenia patients⁴⁷. Noteably, NRGN does not appear to show eQTL colocalization (PP(H₄) = 0.006), instead showing strong evidence for the eQTL and GWAS associations occurring independently (PP(H₃) = 0.994). Two additional strong colocalizations THOC7 (PP(H₄) = 0.943) and FAM85B (PP(H₄) = 0.948) show other potential candidates in the region (Supplementary Table 3). At the THOC7 locus, the competing gene, C3orf49 shows slightly lower strength for colocalization (PP(H₄) = 0.820), and the associations do not appear to be independent (R² between best SNPs = 0.979). At the FAM85B locus, the competing pseudo gene FAM86B3P shows substantially lower evidence for colocalization (PP(H₄) = 0.513) and in this case too, the associations appear to be non-independent (R² = 0.902).

Table 3 Top colocalized genes as inferred between meta-analysis eQTL and the PGC2 schizophrenia GWAS.

Full size table

Discussion

Using resources generated in the AMP-AD and CMC consortia, we have generated a well-powered brain eQTL resource for use by the scientific community. Unsurprisingly, we see a strong relationship between the number of significant eQTL, as well as genes with significant eQTL, and sample size using analyses from the individual cohorts as well as the meta-analysis of those cohorts. This result has previously been shown for lower sample sizes²¹. We also show higher replication of GTEx eQTL in the meta-analysis relative to the individual cohorts. These conclusions appear to be independent of methodological differences between our analysis and the one done by GTEx.

Notably, we find significant eQTL for nearly every gene in our analysis, which include all but very lowly expressed genes (less than 1 cpm in more than 50% of samples). The wide discovery of eQTL is potentially beneficial for analyses utilizing these results, such as colocalization analysis or TWAS imputation, because more genes with significant eQTL means more genes can be evaluated with these approaches. Because we have discovered eQTL for most genes, further increasing sample size will not substantially increase the number of genes with significant eQTL. However it is likely that the number of significant eQTL associations within each gene would continue to increase, which may include additional associations tagging the same regulator or independent associations tagging weaker regulators, along with the accuracy of estimated effect sizes. This will result in a more accurate landscape of regulatory association, which will improve the ability to fine-map causal regions, and colocalize eQTL signal with clinical traits of interest. Thus, it will be valuable to continue to update this meta-analysis with additional data from these consortia and other resources as they become available, and continue to improve this resource as future data permits. Future work may also focus on using well-powered analysis to study the landscape of causal variation and co-variation in gene regulation.

We found distinct eQTL patterns across cerebral cortical and cerebellar brain regions in our resource. Specifically, comparison of eQTL from our resource with those from GTEx shows high replication for the majority of brain regions. However, cerebellar regions show consistently lower replication with the cerebral cortical eQTL generated here. In contrast, the cerebellar eQTL generated from the Mayo Clinic study replicate GTEx cerebellar eQTL at a substantially higher rate, suggesting a different pattern of regulatory variation affecting expression in cerebellum versus other brain regions. Indeed, epigenomic analyses show substantial differences between cerebellar and cerebral cortical regions^48,49,50,51, particularly in methylation patterns, which could drive different eQTL association patterns. This is further corroborated by the observation of substantial coexpression differences between cerebellar and other brain regions⁵². These effects could be due to differences in cell type composition, with cerebellar regions consisting of substantially more neurons than other brain regions⁵³. This is supported by a gene enrichment analysis of genes showing different eQTL association patterns between cerebellum and cortex, which showed that many of the top gene sets were neuron or signaling related (Supplementary Table 4). One recent report suggests that there are also widespread differences in histone modifications within cell types derived from cerebellar and cortical regions⁵⁴, though this effect had not been noted in other studies. In particular, Ma et al.⁵⁴ observed that both neuronal and non-neuronal cell types show differing histone modifications across tissue of origin. Further work is necessary to confirm this finding and to develop models to deconvolve the cell-type specific regulatory effects in different brain regions^55,56,57, however our analysis demonstrates that this meta-analysis is representative of eQTL across the majority of brain regions, with the exception of cerebellum. Future meta-analytic analyses may also cast a wider net in terms of brain regions included.

The replication of fetal eQTL, while significant, is somewhat lower than the replication of adult eQTL represented in GTEx. This may be due to multiple factors. The fetal eQTL analysis was generated from brain homogenate, rather than dissected brain regions, though the lower replication likely also reflects broad transcriptional differences between developing and mature brain⁵⁸. These transcriptional differences may also explain why we find substantially more eQTL than a recently published, similarly sized eQTL analysis which uses samples from across developmental and adult timepoints²⁵, and why this meta-analysis shows higher replication of GTEx eQTL.

Previous studies report a lack of widespread disease-specific eQTL observed in schizophrenia (CMC)²⁴ and Alzheimer’s (ROSMAP)³⁵. In accordance, we find a strong overlap among eQTL across disparate disease samples, particularly those with neuropsychiatric and neurodegenerative disorders, as well as normal individuals from these and other cohorts such as GTEx^24,35. This suggests that disease-specific eQTL, if they exist, are likely few in number and/or small in effect size, relative to condition-independent eQTL in general. If they do exist, disease-specific eQTL discovery may be successful in more targeted analyses or with larger sample sizes or meta-analyses, but was not explored for the purpose of this general resource. Thus, the heterogeneous samples derived from different disease-based cohorts can be meta-analyzed to create a general-purpose brain eQTL resource representing adult gene regulation, despite comprising samples with different disease backgrounds, along with normal controls. Therefore, these eQTL will be useful both within and outside these specific disease contexts. For example, since these eQTL are not disease specific they may be used to understand healthy gene expression regulation in the brain, as well as to infer colocalization of eQTL signatures with disease risk for any disease whose tissue etiology is from the brain, since these signatures are reflective of normal brain regulation. It should be stated that while many eQTL are not disease specific, i.e. they are identified under various central nervous system (CNS) disease diagnoses and in control brains, they may still contribute to common CNS diseases as previously demonstrated^{24,32,33,34,45,46}. While we have demonstrated a proof-of-concept colocalization analysis with a previously published schizophrenia GWAS, these eQTL are a broadly useful resource for studying neuropsychiatric and neurodegenerative disorders, as well as for understanding the landscape of gene regulation in brain.

Methods

RNA-seq Re-alignment

For the CMC studies (MSSM-Penn-Pitt, HBCC), RNA-seq reads were aligned to GRCh37 with STAR v2.4.0g1⁵⁹ from the original FASTQ files. Uniquely mapping reads overlapping genes were counted with featureCounts v1.5.2⁶⁰ using annotations from ENSEMBL v75.

For the AMP-AD studies (ROSMAP, Mayo RNAseq), Picard v2.2.4 (https://broadinstitute.github.io/picard/) was used to generate FASTQ files from the available BAM files, using the Picard SamToFastq function. Picard SortSam was first applied to ensure that R1 and R2 reads were correctly ordered in the intermediate SAM file before converting to FASTQ. The converted FASTQs were aligned to the GENCODE24 (GRCh38) reference genome using STAR v2.5.1b, with twopassMode set as Basic. Gene counts were computed for each sample by STAR by setting quantMode as GeneCounts.

RNA-seq normalization

To account for differences between samples, studies, experimental batch effects and unwanted RNA-seq-specific technical variations, we performed library normalization and covariate adjustments for each study separately using fixed/mixed effects modeling. A mixed effect model was required to jointly normalize both tissues from the Mayo cohort. All other cohorts contained only one tissue, so a fixed effect model was used. The workflow consisted of the following steps:

1.
Gene filtering: Out of ~56 K aligned and quantified genes, only genes showing at least modest expression were used in this analysis. Genes that were expressed more than 1 CPM (read Counts Per Million total reads) in at least 50% of samples in each tissue and diagnosis category were retained for analysis. Additionally, genes with available gene length and percentage GC content from BioMart December 2016 archive were subselected from the above list. This resulted in approximately 14 K to 16 K genes in each study.
2.
Calculation of normalized expression values: Sequencing reads were then normalized in two steps. First, conditional quantile normalization (CQN)⁶¹ was applied to account for variations in gene length and GC content. In the second step, the confidence of sampling abundance was estimated using a weighted linear model using the voom-limma package in bioconductor^62,63. The normalized observed read counts, along with the corresponding weights, were used in the following steps.
3.
Outlier detection: Based on normalized log2(CPM) of expression values, outlier samples were detected using principal component analysis (PCA)^64,65 and hierarchical clustering. Samples identified as outliers using both the above methods were removed from further analysis.
4.
Covariate imputation: Before identifying associated covariates, important missing covariates were imputed. Principally, post-mortem interval (PMI), or the latency between death and tissue collection, which is frequently an important covariate for the analysis of gene expression from post-mortem brain tissue, was imputed for a portion of samples in Mayo RNAseq data for which true values were unavailable. Genomic predictors of PMI were estimated using ROSMAP and MSSM (an additional RNA-seq study available through AMP-AD) samples and were used to impute missing values as necessary.
5.
Covariate identification: Normalized log2(CPM) counts were then explored to determine which known covariates (both biological and technical) should be adjusted. Except for the HBCC study, we used a stepwise (weighted) fixed/mixed effect regression modeling approach to select the relevant covariates having a significant association with gene expression. Here, covariates were sequentially added to the model if they were significantly associated with any of the top principal components, explaining more than 1% of variance of expression residuals. For HBCC, we used a model selection based on Bayesian information criteria (BIC) to identify the covariates that improve the model in more than 50% of genes.
6.
SVA adjustments: After identifying the relevant known confounders, hidden-confounders were identified using the Surrogate Variable Analysis (SVA)⁴¹. We used a similar approach as previously defined²⁴ to find the number of surrogate variables (SVs), which is more conservative than the default method provided by the SVA package in R⁶⁶. The basic idea of this approach is that for an eigenvector decomposition of permuted residuals each eigenvalue should explain an equal amount of the variation. By the nature of eigenvalues, however, there will always be at least one that exceeds the expected value. Thus, from a series of 100 permutations of residuals (white noise) we identified the number of covariates as shown in Supplementary Table 1. We applied the “irw” (iterative re-weighting) version of SVA to the normalized gene expression matrix, along with the covariate model described above to obtain residual gene expression for eQTL analysis.
7.
Covariate adjustments: We performed a variant of fixed/mixed effect linear regression, choosing mixed-effect models when multiple tissues or samples, were available per individual, as shown here: gene expression ~ Diagnosis + Sex + covariates + (1|Donor), where each gene was linearly regressed independently. Here Donor (individual) was modeled as a random effect when multiple tissues from the same individual were present. Observation weights (if any) were calculated using the voom-limma^62,63 pipeline, which has a net effect of up-weighting observations with inferred higher precision in the linear model fitting process to adjust for the mean-variance relationship in RNA-seq data. The diagnosis component was then added back to the residuals to generate covariate-adjusted expression for eQTL analysis.

This workflow was applied separately for each study. For the AMP-AD studies, gene locations were lifted over to GRCh37 for comparison with the genotype imputation panel (described below). For HBCC, samples with age <18 were excluded prior to analysis. Supplementary Table 1 shows the covariates and surrogate variables identified in each study.

AD diagnosis harmonization

Prior to RNA-seq normalization, we harmonized the LOAD definition across AMP-AD studies. AD controls were defined as patients with a low burden of plaques and tangles, as well as lack of evidence of cognitive impairment. For the ROSMAP study, we defined AD cases to be individuals with a Braak⁶⁷ greater than or equal to 4, CERAD score⁶⁸ less than or equal to 2, and a cognitive diagnosis of probable AD with no other causes (cogdx = 4), and controls to be individuals with Braak less than or equal to 3, CERAD score greater than or equal to 3, and cognitive diagnosis of ‘no cognitive impairment’ (cogdx = 1). For the Mayo Clinic study, we defined disease status based on neuropathology, where individuals with Braak score greater than or equal to 4 were defined to be AD cases, and individuals with Braak less than or equal to 3 were defined to be controls. Individuals not meeting “AD case” or “control” criteria were retained for analysis, and were categorized as “other” for the purposes of RNA-seq adjustment.

Genotype QC and imputation

Genotype QC was performed using PLINK v1.9⁶⁹. Markers with zero alternate alleles, genotyping call rate ≤ 0.98, Hardy-Weinberg p-value < 5e−5 were removed, as well as individuals with genotyping call rate < 0.90. Samples were then imputed to HRC (Version r1.1 2016)⁴² as follows: marker positions were lifted-over to GRCh37, if necessary. Markers were then aligned to the HRC loci using HRC-1000G-check-bim-v4.2 (http://www.well.ox.ac.uk/~wrayner/tools/), which checks the strand, alleles, position, reference/alternate allele assignments and frequencies of the markers, removing A/T & G/C single nucleotide polymorphisms (SNPs) with minor allele frequency (MAF) > 0.4, SNPs with differing alleles, SNPs with > 0.2 allele frequency difference between the genotyped samples and the HRC samples, and SNPs not in the reference panel. Imputation was performed via the Michigan Imputation Server⁷⁰ using the Eagle v2.3⁷¹ phasing algorithm. Imputation was done separately by cohort and by chip within cohort, and markers with R² ≥ 0.7 and minor allele frequency (MAF) ≥ 0.01 (within cohort) were retained for analysis.

Genetic ancestry inference

GEMTOOLs⁷² was used to infer ancestry and compute ancestry components separately by cohort. The GEMTOOLs algorithm uses a significance test to estimate the number of eigenvectors (ancestry components) necessary to represent the variability in the data⁷³. For each cohort, we used the top components as estimated by the GEMTOOLs algorithm, which resulted in some variation in the number of components selected. For MSSM-Penn-Pitt and HBCC, which are multi-ethnic cohorts, only Caucasian samples were retained for eQTL analysis.

eQTL analysis

eQTL were generated separately in each cohort and tissue using MatrixEQTL⁷⁴ adjusting for harmonized Diagnosis and inferred Ancestry components using “cis” gene-marker comparisons: Expression ~ Genotype + Diagnosis + PC₁ + … + PC_n,, where PC_k is the k^th ancestry component, using Expression variables which were previously covariate adjusted as described above. Here we define “cis” as ± 1 MB around the gene, and GRCh37 gene locations were used for consistency with the marker imputation panel. Meta-analysis was performed via fixed-effect model⁷⁵ using an adaptation of the metareg function in the gap package in R. In order to assess potential inflation of Type 1 error, we performed 5 permutations of the gene expression values, relative to genotype and ancestry components, within diagnosis for each cohort, and repeated the regression analyses as described above. For each of the 5 iterations of permutation, a meta-analysis was then performed across the 4 cohorts. We found that Type 1 error was well controlled (Fig. 1a). Given that multiple tissues were present, we also evaluated a random-effect model, but found substantially deflated p-values (less significant) in the permutations, relative to the expected distribution, suggesting that this model is over-conservative.

Comparison with GTEx and fetal eQTL

Full summary statistics for the GTEx v7²¹ eQTL for all available brain regions were obtained from the GTEx Portal (https://gtexportal.org/), and fetal eQTL were obtained from Figshare⁷⁶. For each replication comparison (e.g. meta-analysis vs GTEx or meta-analysis vs. fetal eQTL), only markers and genes present in both the external eQTL and our analysis were retained for comparison. As this was done separately for GTEx and for the fetal eQTL resource, the list of genes and SNPs varies slightly for each comparison. The replication rate was estimated as the π₁ statistic using the qvalue package⁷⁷ in R as follows: we extracted the meta-analysis p-values for all SNP-gene pairs, which were significant in GTEx at FDR ≤ 0.05. We then applied the ‘qvalue’ command to the meta-analysis p-values to generate \({\widehat{\pi }}_{1}=1-{\widehat{\pi }}_{0}\), which corresponds to estimated proportion of non-null p-values⁷⁷. The ‘smoother’ option was used to estimate \({\widehat{\pi }}_{0}\) as a function of the tuning parameter λ as it approaches 1. The variance around this estimate is relatively small (see Supplementary Figures 1 and 2 for example) and does not materially affect the observations in this manuscript.

Conversely, we estimated the replication rate of significant meta-analysis eQTL SNP-gene pairs in GTEx. Analogous methods were used to estimate all other replication rates. For the purposes of reporting the total number of eQTL not present in GTEx, and genes without eQTL in GTEx (Table 1), we have included genes and SNP-gene pairs not present in GTEx in the count, however this accounts for a relatively small proportion of the difference (472,995 eQTL and 1481 genes).

GTEx eQTL generation

In order to verify that the observed power increase and replication imbalances were not due to methodological differences between this manuscript and those performed by GTEx, we obtained access to the GTEx v7 data, and generated eQTL for cortex, anterior cingulate cortex, and frontal cortex using our approach. We used gene expression and imputed genotypes as provided, as well as the provided covariates, which included 3 ancestry covariates, 14-15 surrogate variable covariates, sex and platform. We then repeated the comparisons with the meta-analysis described in the previous section, using a MAF cutoff of 0.03, which best appeared to control Type 1 error, as observed by permutation between genotype and gene expression, while maximizing the number of significant eQTL in the true data. Results did not change materially.

Pathway analysis of cerebellar eQTL genes

In order to identify whether genes showing cerebellar-specific eQTL patterns showed any biological coherence, we performed a pathway analysis as follows. For genes with at least 5 significant cerebellar eQTL, we computed the Spearman correlation of effect-size between cerebellum eQTL and cortical eQTL for the loci that were significant in cerebellum. We then selected genes for which the effect-sizes did not show positive correlation (Spearman’s ρ < 0.1) between the two tissues as showing different eQTL association patterns across the gene and performed a pathway analysis with GO biological processes Fisher’s exact test. Note that due to the (power-mediated) greater detection of eQTL in cortex, we did not perform the reverse comparison. The results were relatively robust to the choice of minimum number of significant eQTL, correlation cutoff, and choice of correlation statistic (Spearman vs Pearson).

Coloc analysis

We applied Approximate Bayes Factor colocalization (coloc.abf)⁷ from the coloc R package to the summary statistics from the PGC2 Schizophrenia GWAS³⁶ downloaded from the PGC website (http://pgc.unc.edu), and the summary statistics from the eQTL meta-analysis. Each gene present in the meta-analysis was compared to the GWAS in turn, and suggestive and significant GWAS peaks with p-value < 5e-6 were considered for analysis.

Data availability

Data for the ROSMAP³⁷ and Mayo cohorts⁴⁰ are available through the AMP-AD Knowledge Portal³¹. Data for the MSSM-Penn-Pitt and HBCC cohorts are available through the CommonMind Knowledge Portal²³.

eQTL results for the ROSMAP⁷⁸, Mayo TCX⁷⁹, Mayo CER⁸⁰ and cortical meta-analysis⁸¹ are available through the AMP-AD Knowledge Portal. These results include SNP (location, rsid, alleles, and allele frequency) and gene (location, gene symbol, strand and biotype) information, as well as estimated effect size (beta), statistic (z), p-value, FDR, and expression-increasing allele.

Code availability

An R package with all code for the gene expression normalization is available at https://github.com/Sage-Bionetworks/ampad-diffexp. All other analyses were generated using packages publicly available from their respective authors.

References

Zhu, Z. et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat. Genet. 48, 481–487 (2016).
Article CAS PubMed Google Scholar
Nica, A. C. et al. Candidate Causal Regulatory Effects by Integration of Expression QTLs with Complex Trait Genetic Associations. PLoS Genet 6, e1000895 (2010).
Article PubMed PubMed Central CAS Google Scholar
Ongen, H. et al. Estimating the causal tissues for complex traits and diseases. Nat. Genet. 49, 1676–1683 (2017).
Article CAS PubMed Google Scholar
Chun, S. et al. Limited statistical evidence for shared genetic effects of eQTLs and autoimmune-disease-associated loci in three major immune-cell types. Nat. Genet. 49, 600–605 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hormozdiari, F., Kostem, E., Kang, E. Y., Pasaniuc, B. & Eskin, E. Identifying Causal Variants at Loci with Multiple Signals of Association. Genetics 198, 497–508 (2014).
Article CAS PubMed PubMed Central Google Scholar
He, X. et al. Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS. Am. J. Hum. Genet. 92, 667–680 (2013).
Article CAS Google Scholar
Giambartolomei, C. et al. Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics. PLoS Genet 10, e1004383 (2014).
Article PubMed PubMed Central CAS Google Scholar
Gamazon, E. R. et al. A gene-based association method for mapping traits using reference transcriptome data. Nat. Genet. 47, 1091–1098 (2015).
Article CAS PubMed PubMed Central Google Scholar
Barbeira, A. N. et al. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat. Commun. 9, 1825 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Zhu, J. et al. An integrative genomics approach to the reconstruction of gene networks in segregating populations. Cytogenet. Genome Res. 105, 363–74 (2004).
Article CAS PubMed Google Scholar
Schadt, E. et al. Mapping the Genetic Architecture of Gene Expression in Human Liver. PLoS Biol 6, e107 (2008).
Article PubMed PubMed Central CAS Google Scholar
Zhu, J. et al. Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks. Nat. Genet. 40, 854–61 (2008).
Article CAS PubMed PubMed Central Google Scholar
Greenawalt, D. M. et al. A survey of the genetics of stomach, liver, and adipose gene expression from a morbidly obese cohort. Genome Res. 21, (2011).
Zhang, B. et al. Integrated systems approach identifies genetic nodes and networks in late-onset Alzheimer’s disease. Cell 153, 707–20 (2013).
Article CAS PubMed PubMed Central Google Scholar
Franzén, O. et al. Cardiometabolic risk loci share downstream cis- and trans-gene regulation across tissues and diseases. Science 353, 827–30 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Peters, L. A. et al. A functional genomics predictive network model identifies regulators of inflammatory bowel disease. Nat. Genet. 49, 1437–1449 (2017).
Article CAS PubMed PubMed Central Google Scholar
Battle, A. et al. Characterizing the genetic basis of transcriptome diversity through RNA-sequencing of 922 individuals. Genome Res 24, 14–24 (2014).
Article CAS PubMed PubMed Central Google Scholar
Westra, H.-J. et al. Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat. Genet. 45, 1238–1243 (2013).
Article CAS PubMed PubMed Central Google Scholar
Võsa, U. et al. Unraveling the polygenic architecture of complex traits using blood eQTL meta-analysis. bioRxiv Preprint at, http://biorxiv.org/content/early/2018/10/19/447367.abstract (2018).
Qi, T. et al. Identifying gene targets for brain-related traits using transcriptomic and methylomic data from blood. Nat. Commun. 9, 2282 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Aguet, F. et al. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017).
Article ADS Google Scholar
Ardlie, K. G. et al. The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans. Science (80-.) 348, 648–660 (2015).
Article ADS CAS Google Scholar
CommonMind Consortium Data Release Portal. Synapse https://doi.org/10.7303/syn2759792 (2015).
Fromer, M. et al. Gene expression elucidates functional impact of polygenic risk for schizophrenia. Nature Neuroscience 19, 1442–1453 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wang, D. et al. Comprehensive functional genomic resource and integrative model for the human brain. Science (80-.). 362, eaat8464 (2018).
Article ADS CAS Google Scholar
Hoffman, G. E. et al. CommonMind Consortium provides transcriptomic and epigenomic data for Schizophrenia and Bipolar Disorder. Sci. Data 6, 180 (2019).
Article PubMed PubMed Central CAS Google Scholar
Chibnik, L. B. et al. Susceptibility to neurofibrillary tangles: role of the PTPRD locus and limited pleiotropy with other neuropathologies. Mol. Psychiatry 23, 1521 (2017).
Article PubMed PubMed Central CAS Google Scholar
Mostafavi, S. et al. A molecular network of the aging human brain provides insights into the pathology and cognitive decline of Alzheimer’s disease. Nat. Neurosci. 21, 811–819 (2018).
Article CAS PubMed PubMed Central Google Scholar
Allen, M. et al. Human whole genome genotype and transcriptome data for Alzheimer’s and other neurodegenerative diseases. Sci. Data 3, 160089 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wan, Y.W. et al. Meta-Analysis of the Alzheimer’s Disease Human Brain Transcriptome and Functional Dissection in Mouse Models. Cell Rep. 32(2), 107908 (2020).
AMP AD Target Discovery Data Portal. Synapse https://doi.org/10.7303/syn2580853 (2015).
Allen, M. et al. Novel late-onset Alzheimer disease loci variants associate with brain gene expression. Neurology 79, 221–8 (2012).
Article CAS PubMed PubMed Central Google Scholar
Zou, F. et al. Brain expression genome-wide association study (eGWAS) identifies human disease-associated variants. PLoS Genet 8, e1002707 (2012).
Article CAS PubMed PubMed Central Google Scholar
Allen, M. et al. Late-onset Alzheimer disease risk variants mark brain regulatory loci. Neurol. Genet. 1, e15 (2015).
Article PubMed PubMed Central CAS Google Scholar
Ng, B. et al. An xQTL map integrates the genetic architecture of the human brain’s transcriptome and epigenome. Nat. Neurosci. 20, 1418–1426 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ripke, S. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014).
Article ADS CAS PubMed Central Google Scholar
ROSMAP Study. Synapse https://doi.org/10.7303/syn3219045 (2016).
Allen, M. et al. Conserved brain myelination networks are altered in Alzheimer’s and other neurodegenerative diseases. Alzheimers. Dement. 14, 352–366 (2018).
Article PubMed Google Scholar
Allen, M. et al. Divergent brain gene expression patterns associate with distinct cell-specific tau neuropathology traits in progressive supranuclear palsy. Acta Neuropathol 136, 709–727 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mayo RNAseq Study. Synapse https://doi.org/10.7303/syn5550404 (2016).
Leek, J. T. & Storey, J. D. Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis. Plos Genet 3, e161 (2007).
Article PubMed Central CAS Google Scholar
McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 48, 1279–83 (2016).
Article CAS PubMed PubMed Central Google Scholar
O’Brien, H. E. et al. Expression quantitative trait loci in the developing human brain and their enrichment in neuropsychiatric disorders. Genome Biol 19, 194 (2018).
Article PubMed PubMed Central CAS Google Scholar
Hou, Y. et al. Schizophrenia-associated rs4702 G allele-specific downregulation of FURIN expression by miR-338-3p reduces BDNF production. Schizophr. Res. 199, 176–180 (2018).
Article PubMed Google Scholar
Dobbyn, A. et al. Landscape of Conditional eQTL in Dorsolateral Prefrontal Cortex and Co-localization with Schizophrenia GWAS. Am. J. Hum. Genet. 102, 1169–1184 (2018).
Article CAS Google Scholar
Pardiñas, A. F. et al. Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection. Nat. Genet. 50, 381–389 (2018).
Article PubMed PubMed Central CAS Google Scholar
Liu, Y. et al. Non-coding RNA dysregulation in the amygdala region of schizophrenia patients contributes to the pathogenesis of the disease. Transl. Psychiatry 8, 44 (2018).
Article PubMed PubMed Central CAS Google Scholar
Lu, A. T. et al. Genetic architecture of epigenetic and neuronal ageing rates in human brain regions. Nat. Commun. 8, 15353 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Davies, M. N. et al. Functional annotation of the human brain methylome identifies tissue-specific epigenetic variation across brain and blood. Genome Biol 13, R43 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hannon, E., Lunnon, K., Schalkwyk, L. & Mill, J. Interindividual methylomic variation across blood, cortex, and cerebellum: implications for epigenetic studies of neurological and neuropsychiatric phenotypes. Epigenetics 10, 1024–1032 (2015).
Article PubMed PubMed Central Google Scholar
Guintivano, J., Aryee, M. J. & Kaminsky, Z. A. A cell epigenotype specific model for the correction of brain cellular heterogeneity bias and its application to age, brain region and major depression. Epigenetics 8, 290–302 (2013).
Article CAS PubMed PubMed Central Google Scholar
Negi, S. K. & Guda, C. Global gene expression profiling of healthy human brain and its application in studying neurological disorders. Sci. Rep. 7, 897 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Azevedo, F. A. C. et al. Equal numbers of neuronal and nonneuronal cells make the human brain an isometrically scaled-up primate brain. J. Comp. Neurol. 513, 532–541 (2009).
Article PubMed Google Scholar
Ma, S., Hsieh, Y.-P., Ma, J. & Lu, C. Low-input and multiplexed microfluidic assay reveals epigenomic variation across cerebellum and prefrontal cortex. Sci. Adv. 4, eaar8187 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Westra, H.-J. et al. Cell Specific eQTL Analysis without Sorting Cells. Plos Genet 11, e1005223 (2015).
Article PubMed PubMed Central CAS Google Scholar
van der Wijst, M. G. P. et al. Single-cell RNA sequencing identifies celltype-specific cis-eQTLs and co-expression QTLs. Nat. Genet. 50, 493–497 (2018).
Article PubMed PubMed Central CAS Google Scholar
Wang, J., Devlin, B. & Roeder, K. Using multiple measurements of tissue to estimate individual- and cell-type-specific gene expression via deconvolution. bioRxiv 379099, https://doi.org/10.1101/379099 (2018).
Li, M. et al. Integrative functional genomic analysis of human brain development and neuropsychiatric risks. Science (80-.). 362, eaat7615 (2018).
Article ADS CAS Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article CAS PubMed Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–30 (2014).
Article CAS PubMed Google Scholar
Hansen, K. D., Irizarry, R. A. & WU, Z. Removing technical variability in RNA-seq data using conditional quantile normalization. Biostatistics 13, 204–216 (2012).
Article PubMed PubMed Central MATH Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 43, e47–e47 (2015).
Article PubMed PubMed Central CAS Google Scholar
Law, C. W., Chen, Y., Shi, W. & Smyth, G. K. Voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 15, R29 (2014).
Article PubMed PubMed Central CAS Google Scholar
Pearson, K. LIII. On lines and planes of closest fit to systems of points in space. London, Edinburgh, Dublin Philos. Mag. J. Sci 2, 559–572 (1901).
Article MATH Google Scholar
Hotelling, H. Analysis of a complex of statistical variables into principal components. J. Educ. Psychol. 24, 417–441 (1933).
Article MATH Google Scholar
Leek, J. T. et al. Bioconductor - sva., https://doi.org/10.18129/B9.bioc.sva (2018).
Braak, H. & Braak, E. Neuropathological stageing of Alzheimer-related changes. Acta Neuropathol. 82, 239–259 (1991).
Article CAS PubMed Google Scholar
Chandler, M. J. et al. A total score for the CERAD neuropsychological battery. Neurology 65, 102–106 (2005).
Article CAS PubMed Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
Article PubMed PubMed Central CAS Google Scholar
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287 (2016).
Article CAS PubMed PubMed Central Google Scholar
Loh, P.-R. et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 48, 1443 (2016).
Article CAS PubMed PubMed Central Google Scholar
Klei, L., Kent, B. P., Melhem, N., Devlin, B. & Roeder, K. GemTools: A fast and efficient approach to estimating genetic ancestry. (2011).
Lee, A. B., Luca, D., Klei, L., Devlin, B. & Roeder, K. Discovering genetic ancestry using spectral graph theory. Genet. Epidemiol. 34, 51–59 (2010).
Article CAS PubMed PubMed Central Google Scholar
Shabalin, A. A. Matrix eQTL: ultra fast eQTL analysis via large matrix operations. Bioinformatics 28, 1353–1358 (2012).
Article CAS PubMed PubMed Central Google Scholar
Begum, F., Ghosh, D., Tseng, G. C. & Feingold, E. Comprehensive literature review and statistical considerations for GWAS meta-analysis. Nucleic Acids Res 40, 3777–3784 (2012).
Article CAS PubMed PubMed Central Google Scholar
O’Brien, H. & Bray, N. J. Summary statistics for expression quantitative trait loci in the developing human brain and their enrichment in neuropsychiatric disorders. Figshare https://doi.org/10.6084/m9.figshare.6881825.v1 (2018).
Storey, J.D., Bass, A.J., Dabney, A. & Robinson, D. qvalue: Q-value estimation for false discovery rate control http://github.com/jdstorey/qvalue (2020).
Sieberts, S. ROSMAP DLPFC eQTL. Synapse https://doi.org/10.7303/syn16984409.1 (2019).
Sieberts, S. Mayo Temporal Cortex eQTL. Synapse https://doi.org/10.7303/syn16984410.1 (2019).
Sieberts, S. Mayo Cerebellum eQTL. Synapse https://doi.org/10.7303/syn16984411.1 (2019).
Sieberts, S. Cortical eQTL Meta-analysis. Synapse https://doi.org/10.7303/syn16984815.1 (2019).

Download references

Acknowledgements

For the ROSMAP and Mayo RNAseq studies, the results published here are in whole or in part based on data obtained from the AMP-AD Knowledge Portal (doi:10.7303/syn2580853). ROSMAP study data were provided by the Rush Alzheimer’s Disease Center, Rush University Medical Center, Chicago. Data collection was supported through funding by NIA grants P30AG10161, R01AG15819, R01AG17917, R01AG30146, R01AG36836, U01AG32984, U01AG46152, the Illinois Department of Public Health, and the Translational Genomics Research Institute. Mayo RNA-seq study data were provided by the following sources: The Mayo Clinic Alzheimers Disease Genetic Studies, led by Dr. Nilufer Ertekin-Taner and Dr. Steven G. Younkin, Mayo Clinic, Jacksonville, FL using samples from the Mayo Clinic Study of Aging, the Mayo Clinic Alzheimer’s Disease Research Center, and the Mayo Clinic Brain Bank. Data collection was supported through funding by NIA grants P50 AG016574, R01 AG032990, U01 AG046139, R01 AG018023, U01 AG006576, U01 AG006786, R01 AG025711, R01 AG017216, R01 AG003949, NINDS grant R01 NS080820, CurePSP Foundation, and support from Mayo Foundation. Study data includes samples collected through the Sun Health Research Institute Brain and Body Donation Program of Sun City, Arizona. The Brain and Body Donation Program is supported by the National Institute of Neurological Disorders and Stroke (U24 NS072026 National Brain and Tissue Resource for Parkinsons Disease and Related Disorders), the National Institute on Aging (P30 AG19610 Arizona Alzheimers Disease Core Center), the Arizona Department of Health Services (contract 211002, Arizona Alzheimers Research Center), the Arizona Biomedical Research Commission (contracts 4001, 0011, 05-901 and 1001 to the Arizona Parkinson’s Disease Consortium) and the Michael J. Fox Foundation for Parkinsons Research. This study was in part supported by NIH RF1 AG051504 and R01 AG061796 (NET). For CommonMind, data were generated as part of the CommonMind Consortium supported by funding from Takeda Pharmaceuticals Company Limited, F. Hoffmann-La Roche Ltd and NIH grants R01MH085542, R01MH093725, P50MH066392, P50MH080405, R01MH097276, RO1-MH-075916, P50M096891, P50MH084053S1, R37MH057881, AG02219, AG05138, MH06692, R01MH110921, R01MH109677, R01MH109897, U01MH103392, and contract HHSN271201300031C through IRP NIMH. Brain tissue for the study was obtained from the following brain bank collections: the Mount Sinai NIH Brain and Tissue Repository, the University of Pennsylvania Alzheimer’s Disease Core Center, the University of Pittsburgh NeuroBioBank and Brain and Tissue Repositories, and the NIMH Human Brain Collection Core. CMC Leadership: Panos Roussos, Joseph Buxbaum, Andrew Chess, Schahram Akbarian, Vahram Haroutunian (Icahn School of Medicine at Mount Sinai), Bernie Devlin, David Lewis (University of Pittsburgh), Raquel Gur, Chang-Gyu Hahn (University of Pennsylvania), Enrico Domenici (University of Trento), Mette A. Peters, Solveig Sieberts (Sage Bionetworks), Thomas Lehner, Geetha Senthil, Stefano Marenco, Barbara K. Lipska (NIMH). SKS, TP, KKD, JE, AKG, LO, BAL, and LMM were additionally supported by NIA grants U24 AG61340, U01 AG46170, U01 AG 46161, R01 AG46171, R01 AG 46174. All data used in this manuscript have been previously released through their respective consortia and have been reviewed by IRBs at their institution of origin. Informed consent has been obtained from all individuals.

Author information

Authors and Affiliations

Sage Bionetworks, Seattle, WA, 98121, USA
Solveig K. Sieberts, Thanneer M. Perumal, Kristen K. Dang, James Eddy, Anna K. Greenwood, Kelsey Montgomery, Yooree Chae, Kelsey Montgomery, Sumit Mukherjee, Phil Snyder, Kara H. Woo, Larsson Omberg, Mette A. Peters, Benjamin A. Logsdon & Lara M. Mangravite
Department of Neuroscience, Mayo Clinic Florida, Jacksonville, FL, 32224, USA
Minerva M. Carrasquillo, Mariet Allen, Joseph S. Reddy, Xue Wang, Steven G. Younkin & Nilüfer Ertekin-Taner
Pamela Sklar Division of Psychiatric Genomics, Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
Gabriel E. Hoffman
Icahn Institute for Data Science and Genomic Technology, Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
Gabriel E. Hoffman
Lilly Research Labs, Eli Lilly and Company, Indianapolis, IN, 46225, USA
John Calley, Philip J. Ebert, Laura Addis, David Charles Airey, Anna Cavalla, David Andrew Collier, Jeffrey L. Dage, Brian J. Eastwood, Basavaraj Hooli, Neil Humphryes-Kirilov, Corey James, Pascale N. Lacor, Emma Laing, Yupeng Li, Yushi Liu, Bradley Miller, Neil Pearson, Cara Lee Ann Ruble, James E. Scherschel, Yaming Wang, Hong Wang & Wenling Zhang
Departments of Statistics and Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
Sara Mostafavi
Centre for Molecular Medicine and Therapeutics, Vancouver, British Columbia, Canada
Sara Mostafavi
Canadian Institute for Advanced Research, CIFAR Program in Child and Brain Development, Toronto, Ontario, Canada
Sara Mostafavi
Center for Translational & Computational Neuroimmunology, Department of Neurology, Columbia University Medical Center, New York, NY, 10032, USA
Elizabeth Bradhsaw, Daniel Felsky, Hans-Ulrich Klein, Yiyi Ma, Vilas Menon, Mariko Taga & Philip L. De Jager
Department of Neurology, Mayo Clinic Florida, Jacksonville, FL, 32224, USA
Nilüfer Ertekin-Taner
Division of Psychiatric Epigenomics, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, New York, USA
Schahram Akbarian, Leanne Brown, Lizette Couto, Olivia Devillers, Elie Flatow, Sergio Espeso Gil, Rivky Jacobov, Yan Jiang, Bibi Kassim, Royce Park, Panagiotis Roussos, Jennifer Wiseman & Elizabeth Zharovsky
Division of Psychiatric Genomics, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, New York, USA
Jaroslav Bendl, Kristen Brennand, Andrew Browne, Joseph D. Buxbaum, Alexander Charney, Amanda Dobbyn, John Fullard, Kiran Girdhar, Vahram Haroutunian, Mads Engel Hauberg, Laura Huckins, Jessica S. Johnson, Yungil Kim, Laura Sloofman, Eli Stahl, Wen Zhang & Vahram Haroutunian
Seaver Autism for Research and Treatment, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, New York, USA
Michael S. Breen & Joseph D. Buxbaum
Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, New York, USA
Andrew Browne, Joseph D. Buxbaum, Panagiotis Roussos & Hardik R. Shah
Institute for Genomics and Multiscale Biology, Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, USA
Andrew Chess, Panagiotis Roussos & Ying-Chih Wang
Department of Cell, Developmental and Regenerative Biology, Icahn School of Medicine at Mount Sinai, New York, New York, USA
Andrew Chess, Attila Gulyás-Kovács & Chaggai Rosenbluh
Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, New York, USA
Andrew Chess, Vahram Haroutunian & Vahram Haroutunian
Center for Genomic & Computational Biology, Duke University, Durham, North Carolina, USA
Greg Crawford & Lingyun Song
Division of Medical Genetics, Department of Pediatrics, Duke University, Durham, North Carolina, USA
Greg Crawford
Department of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA
Bernie Devlin, Lambertus Klei & David A. Lewis
Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, USA
Amanda Dobbyn, Nancy Francoeur, John Fullard, Eva Xia, Michelle Ehrlich, Eric Schadt, Minghui Wang, Bin Zhang & Jun Zhu
Laboratory of Neurogenomic Biomarkers, Centre for Integrative Biology (CIBIO), University of Trento, Trento, Italy
Enrico Domenici, Michele Filosi & Roberto Visintainer
Neuropsychiatry Section, Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
Raquel Gur
Neuropsychiatric Signaling Program, Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
Chang-Gyu Hahn
Department of Biomedicine, Aarhus University, Aarhus, Denmark
Mads Engel Hauberg
Human Brain Collection Core, National Institutes of Health, NIMH, Bethesda, Maryland, USA
Robin Kramer & Barbara K. Lipska
Department of Mathematics, University of Trento, Trento, Italy
Mario Lauria
National Institute of Mental Health, Bethesda, Maryland, USA
Thomas Lehner & Geetha Senthil
Psychiatry, JJ Peters VA Medical Center, Bronx, New York, USA
Panagiotis Roussos
Department of Medicine, Psychiatry and Biomedical Informatics, Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee, USA
Douglas M. Ruderfer
Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Patrick Sullivan
Department of Biostatistics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
Jiebiao Wang
AbbVie Inc, Chicago, IL, USA
Sadiya N. Addo, Yingtao Bi, Knut Biber, Sherry Cao, Jie Cheng, Justin Wade Davis, Benjamin Ellingson, Brett W. Engelmann, Sahar Esmaeelinieh, Shaun E. Grosskurth, Rishi R. Gupta, Paul M. Jung, Markus Kummer, Samantha Lipsky, Thomas P. Misko, Jennifer E. Mollon, Danjuma X. Quarless, Brinda Ravikumar, Janina S. Ried, Holly Soares, Gyan P. Srivastava, Henning Stockmann, Aparna Vasanthakumar, Astrid Wachter, Jessica W. Wu & Hualin S. Xi
Helmholtz Zentrum München, Neuherberg, Germany
Matthias Arnold & Gabi Kastenmuller
Rush Alzheimer’s Disease Center, Rush University Medical Center, Chicago, IL, USA
David A. Bennett, Chris Gaiteri, Shinya Tasaki & Lei Yu
Duke University, Durham, NC, USA
Colette Blach, Rima Kaddurah-Daouk, Gregory Louie & Jessie Tenenbaum
Nuffield Department of Medicine, Oxford University, Headington, UK
Paul Brennan, John Davis & Opher Gileadi
Foundation for the National Institutes of Health (FNIH), Bethesda, MD, USA
Rosa Canet-Aviles
Biogen Inc, Cambridge, MA, USA
William W. Chen, Jimmy Liu, Michelle Penny, Heiko Runz, Christopher D. Whelan, Maria Zavodszky & Guoqiang Zhang
Emory University, Atlanta, GA, USA
Eric B. Dammer, Duc Duong, James Lah, Allan Levey & Nicholas Seyfried
Department of Genetics, Harvard Medical School, Boston, MA, USA
Derek Drake, Tao Lu & Bruce A. Yankner
Institute for Systems Biology, Seattle, WA, USA
Cory Funk & Nathan Price
Neurology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Samuel Gandy
Picower Institute, Massachusetts Institute of Technology, Cambridge, MA, USA
Fan Gao, Ping-Chieh Pao & Li-Huei Tsai
Department of Neuroscience, University of Florida, Gainesville, FL, USA
Todd Golde
GlaxoSmithKline, Brentford, UK
Alex X. Gutteridge, Yasuji Y. Matsuoka & Paul Wren
National Center for Geriatrics and Gerontology, Nagoya, Japan
Koichi Iijima
Baylor College of Medicine, Houston, TX, USA
Zhandong Liu & Joshua M. Shulman
New York Stem Cell Foundation, New York, NY, USA
Scott Noggle
Brigham & Women’s Hospital, Boston, MA, USA
Tracy Young Pearce
Pacific Northwest National Laboratory, Richland, WA, USA
Vladislav A. Petyuk
Indiana University School of Medicine, Indianapolis, IN, USA
Andrew J. Saykin
Broad Institute, Cambridge, MA, USA
Charles White

Authors

Solveig K. Sieberts
View author publications
You can also search for this author in PubMed Google Scholar
Thanneer M. Perumal
View author publications
You can also search for this author in PubMed Google Scholar
Minerva M. Carrasquillo
View author publications
You can also search for this author in PubMed Google Scholar
Mariet Allen
View author publications
You can also search for this author in PubMed Google Scholar
Joseph S. Reddy
View author publications
You can also search for this author in PubMed Google Scholar
Gabriel E. Hoffman
View author publications
You can also search for this author in PubMed Google Scholar
Kristen K. Dang
View author publications
You can also search for this author in PubMed Google Scholar
John Calley
View author publications
You can also search for this author in PubMed Google Scholar
Philip J. Ebert
View author publications
You can also search for this author in PubMed Google Scholar
James Eddy
View author publications
You can also search for this author in PubMed Google Scholar
Xue Wang
View author publications
You can also search for this author in PubMed Google Scholar
Anna K. Greenwood
View author publications
You can also search for this author in PubMed Google Scholar
Sara Mostafavi
View author publications
You can also search for this author in PubMed Google Scholar
Larsson Omberg
View author publications
You can also search for this author in PubMed Google Scholar
Mette A. Peters
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin A. Logsdon
View author publications
You can also search for this author in PubMed Google Scholar
Philip L. De Jager
View author publications
You can also search for this author in PubMed Google Scholar
Nilüfer Ertekin-Taner
View author publications
You can also search for this author in PubMed Google Scholar
Lara M. Mangravite
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

The CommonMind Consortium (CMC)

Schahram Akbarian
, Jaroslav Bendl
, Michael S. Breen
, Kristen Brennand
, Leanne Brown
, Andrew Browne
, Joseph D. Buxbaum
, Alexander Charney
, Andrew Chess
, Lizette Couto
, Greg Crawford
, Olivia Devillers
, Bernie Devlin
, Amanda Dobbyn
, Enrico Domenici
, Michele Filosi
, Elie Flatow
, Nancy Francoeur
, John Fullard
, Sergio Espeso Gil
, Kiran Girdhar
, Attila Gulyás-Kovács
, Raquel Gur
, Chang-Gyu Hahn
, Vahram Haroutunian
, Mads Engel Hauberg
, Laura Huckins
, Rivky Jacobov
, Yan Jiang
, Jessica S. Johnson
, Bibi Kassim
, Yungil Kim
, Lambertus Klei
, Robin Kramer
, Mario Lauria
, Thomas Lehner
, David A. Lewis
, Barbara K. Lipska
, Kelsey Montgomery
, Royce Park
, Chaggai Rosenbluh
, Panagiotis Roussos
, Douglas M. Ruderfer
, Geetha Senthil
, Hardik R. Shah
, Laura Sloofman
, Lingyun Song
, Eli Stahl
, Patrick Sullivan
, Roberto Visintainer
, Jiebiao Wang
, Ying-Chih Wang
, Jennifer Wiseman
, Eva Xia
, Wen Zhang
& Elizabeth Zharovsky

The AMP-AD Consortium

Laura Addis
, Sadiya N. Addo
, David Charles Airey
, Matthias Arnold
, David A. Bennett
, Yingtao Bi
, Knut Biber
, Colette Blach
, Elizabeth Bradhsaw
, Paul Brennan
, Rosa Canet-Aviles
, Sherry Cao
, Anna Cavalla
, Yooree Chae
, William W. Chen
, Jie Cheng
, David Andrew Collier
, Jeffrey L. Dage
, Eric B. Dammer
, Justin Wade Davis
, John Davis
, Derek Drake
, Duc Duong
, Brian J. Eastwood
, Michelle Ehrlich
, Benjamin Ellingson
, Brett W. Engelmann
, Sahar Esmaeelinieh
, Daniel Felsky
, Cory Funk
, Chris Gaiteri
, Samuel Gandy
, Fan Gao
, Opher Gileadi
, Todd Golde
, Shaun E. Grosskurth
, Rishi R. Gupta
, Alex X. Gutteridge
, Vahram Haroutunian
, Basavaraj Hooli
, Neil Humphryes-Kirilov
, Koichi Iijima
, Corey James
, Paul M. Jung
, Rima Kaddurah-Daouk
, Gabi Kastenmuller
, Hans-Ulrich Klein
, Markus Kummer
, Pascale N. Lacor
, James Lah
, Emma Laing
, Allan Levey
, Yupeng Li
, Samantha Lipsky
, Yushi Liu
, Jimmy Liu
, Zhandong Liu
, Gregory Louie
, Tao Lu
, Yiyi Ma
, Yasuji Y. Matsuoka
, Vilas Menon
, Bradley Miller
, Thomas P. Misko
, Jennifer E. Mollon
, Kelsey Montgomery
, Sumit Mukherjee
, Scott Noggle
, Ping-Chieh Pao
, Tracy Young Pearce
, Neil Pearson
, Michelle Penny
, Vladislav A. Petyuk
, Nathan Price
, Danjuma X. Quarless
, Brinda Ravikumar
, Janina S. Ried
, Cara Lee Ann Ruble
, Heiko Runz
, Andrew J. Saykin
, Eric Schadt
, James E. Scherschel
, Nicholas Seyfried
, Joshua M. Shulman
, Phil Snyder
, Holly Soares
, Gyan P. Srivastava
, Henning Stockmann
, Mariko Taga
, Shinya Tasaki
, Jessie Tenenbaum
, Li-Huei Tsai
, Aparna Vasanthakumar
, Astrid Wachter
, Yaming Wang
, Hong Wang
, Minghui Wang
, Christopher D. Whelan
, Charles White
, Kara H. Woo
, Paul Wren
, Jessica W. Wu
, Hualin S. Xi
, Bruce A. Yankner
, Steven G. Younkin
, Lei Yu
, Maria Zavodszky
, Wenling Zhang
, Guoqiang Zhang
, Bin Zhang
& Jun Zhu

Contributions

S.K.S., S.M., M.P., P.L.D.J., N.E.T. and L.M.M., the AMP-AD Consortium, and the CMC Consortium contributed to the design and generation of the study data. S.K.S., T.P., M.C., M.A., J.S.R., G.H., K.D.D., J.C., P.J.E. and J.E. contributed the data analysis. S.K.S., T.P., G.H., A.K.G., L.O., M.P., B.A.L., L.M.M. contributed to the manuscript preparation and data sharing.

Corresponding authors

Correspondence to Solveig K. Sieberts or Lara M. Mangravite.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figures

Supplementary Table 1

Supplementary Table 2

Supplementary Table 3

Supplementary Table 4

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sieberts, S.K., Perumal, T.M., Carrasquillo, M.M. et al. Large eQTL meta-analysis reveals differing patterns between cerebral cortical and cerebellar brain regions. Sci Data 7, 340 (2020). https://doi.org/10.1038/s41597-020-00642-8

Download citation

Received: 28 May 2019
Accepted: 24 August 2020
Published: 12 October 2020
DOI: https://doi.org/10.1038/s41597-020-00642-8

This article is cited by

DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction
- Pramod Bharadwaj Chandrashekar
- Sayali Alatkar
- Daifeng Wang
Genome Medicine (2023)
Sex differences in brain protein expression and disease
- Aliza P. Wingo
- Yue Liu
- Thomas S. Wingo
Nature Medicine (2023)
Prioritization of potential causative genes for schizophrenia in placenta
- Gianluca Ursini
- Pasquale Di Carlo
- Daniel R. Weinberger
Nature Communications (2023)
Expanding causal genes for Parkinson’s disease via multi-omics analysis
- Xiao-Jing Gu
- Wei-Ming Su
- Yong-Ping Chen
npj Parkinson's Disease (2023)
The IPDGC/GP2 Hackathon - an open science event for training in data science, genomics, and collaboration using Parkinson’s disease data
- Hampton L. Leonard
- Ruqaya Murtadha
- Alastair J. Noyce
npj Parkinson's Disease (2023)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Discussion

Methods

RNA-seq Re-alignment

RNA-seq normalization

AD diagnosis harmonization

Genotype QC and imputation

Genetic ancestry inference

eQTL analysis

Comparison with GTEx and fetal eQTL

GTEx eQTL generation

Pathway analysis of cerebellar eQTL genes

Coloc analysis

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

The CommonMind Consortium (CMC)

The AMP-AD Consortium

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links