Trans-ethnic kidney function association study reveals putative causal genes and effects on kidney-specific disease aetiologies

Morris, Andrew P.; Le, Thu H.; Wu, Haojia; Akbarov, Artur; van der Most, Peter J.; Hemani, Gibran; Smith, George Davey; Mahajan, Anubha; Gaulton, Kyle J.; Nadkarni, Girish N.; Valladares-Salgado, Adan; Wacher-Rodarte, Niels; Mychaleckyj, Josyf C.; Dueker, Nicole D.; Guo, Xiuqing; Hai, Yang; Haessler, Jeffrey; Kamatani, Yoichiro; Stilp, Adrienne M.; Zhu, Gu; Cook, James P.; Ärnlöv, Johan; Blanton, Susan H.; de Borst, Martin H.; Bottinger, Erwin P.; Buchanan, Thomas A.; Cechova, Sylvia; Charchar, Fadi J.; Chu, Pei-Lun; Damman, Jeffrey; Eales, James; Gharavi, Ali G.; Giedraitis, Vilmantas; Heath, Andrew C.; Ipp, Eli; Kiryluk, Krzysztof; Kramer, Holly J.; Kubo, Michiaki; Larsson, Anders; Lindgren, Cecilia M.; Lu, Yingchang; Madden, Pamela A. F.; Montgomery, Grant W.; Papanicolaou, George J.; Raffel, Leslie J.; Sacco, Ralph L.; Sanchez, Elena; Stark, Holger; Sundstrom, Johan; Taylor, Kent D.; Xiang, Anny H.; Zivkovic, Aleksandra; Lind, Lars; Ingelsson, Erik; Martin, Nicholas G.; Whitfield, John B.; Cai, Jianwen; Laurie, Cathy C.; Okada, Yukinori; Matsuda, Koichi; Kooperberg, Charles; Chen, Yii-Der Ida; Rundek, Tatjana; Rich, Stephen S.; Loos, Ruth J. F.; Parra, Esteban J.; Cruz, Miguel; Rotter, Jerome I.; Snieder, Harold; Tomaszewski, Maciej; Humphreys, Benjamin D.; Franceschini, Nora

doi:10.1038/s41467-018-07867-7

Download PDF

Article
Open access
Published: 03 January 2019

Trans-ethnic kidney function association study reveals putative causal genes and effects on kidney-specific disease aetiologies

Andrew P. Morris ORCID: orcid.org/0000-0002-6805-6014^1,2,
Thu H. Le³,
Haojia Wu⁴,
Artur Akbarov⁵,
Peter J. van der Most ORCID: orcid.org/0000-0001-8450-3518⁶,
Gibran Hemani ORCID: orcid.org/0000-0003-0920-1055⁷,
George Davey Smith ORCID: orcid.org/0000-0002-1407-8314⁷,
Anubha Mahajan ORCID: orcid.org/0000-0001-5585-3420²,
Kyle J. Gaulton⁸,
Girish N. Nadkarni^9,10,
Adan Valladares-Salgado¹¹,
Niels Wacher-Rodarte¹²,
Josyf C. Mychaleckyj ORCID: orcid.org/0000-0003-2595-0005¹³,
Nicole D. Dueker¹⁴,
Xiuqing Guo¹⁵,
Yang Hai¹⁵,
Jeffrey Haessler¹⁶,
Yoichiro Kamatani ORCID: orcid.org/0000-0001-8748-5597¹⁷,
Adrienne M. Stilp¹⁸,
Gu Zhu¹⁹,
James P. Cook¹,
Johan Ärnlöv^20,21,
Susan H. Blanton^14,22,
Martin H. de Borst ORCID: orcid.org/0000-0002-4127-8733²³,
Erwin P. Bottinger⁹,
Thomas A. Buchanan²⁴,
Sylvia Cechova³,
Fadi J. Charchar ORCID: orcid.org/0000-0002-6164-9941^25,26,27,
Pei-Lun Chu²⁸,
Jeffrey Damman²⁹,
James Eales ORCID: orcid.org/0000-0001-6238-5952⁵,
Ali G. Gharavi ORCID: orcid.org/0000-0003-2801-233X³⁰,
Vilmantas Giedraitis ORCID: orcid.org/0000-0003-3423-2021³¹,
Andrew C. Heath³²,
Eli Ipp^33,34,
Krzysztof Kiryluk³⁰,
Holly J. Kramer³⁵,
Michiaki Kubo³⁶,
Anders Larsson³⁷,
Cecilia M. Lindgren^2,38,39,
Yingchang Lu⁹,
Pamela A. F. Madden³²,
Grant W. Montgomery ORCID: orcid.org/0000-0002-4140-8139⁴⁰,
George J. Papanicolaou⁴¹,
Leslie J. Raffel⁴²,
Ralph L. Sacco^43,44,45,
Elena Sanchez ORCID: orcid.org/0000-0002-7041-5485³⁰,
Holger Stark⁴⁶,
Johan Sundstrom ORCID: orcid.org/0000-0003-2247-8454³⁷,
Kent D. Taylor¹⁵,
Anny H. Xiang⁴⁷,
Aleksandra Zivkovic⁴⁶,
Lars Lind³⁷,
Erik Ingelsson^48,49,50,51,
Nicholas G. Martin ORCID: orcid.org/0000-0003-4069-8020¹⁹,
John B. Whitfield¹⁹,
Jianwen Cai⁵²,
Cathy C. Laurie¹⁸,
Yukinori Okada^17,53,
Koichi Matsuda ORCID: orcid.org/0000-0001-7292-2686⁵⁴,
Charles Kooperberg¹⁶,
Yii-Der Ida Chen¹⁵,
Tatjana Rundek^43,44,
Stephen S. Rich¹³,
Ruth J. F. Loos^9,55,
Esteban J. Parra⁵⁶,
Miguel Cruz¹¹,
Jerome I. Rotter¹⁵,
Harold Snieder⁶,
Maciej Tomaszewski ORCID: orcid.org/0000-0001-8215-6567^5,57,
Benjamin D. Humphreys ORCID: orcid.org/0000-0002-6420-8703⁴ &
…
Nora Franceschini⁵⁸

Nature Communications volume 10, Article number: 29 (2019) Cite this article

9874 Accesses
89 Citations
32 Altmetric
Metrics details

Subjects

Abstract

Chronic kidney disease (CKD) affects ~10% of the global population, with considerable ethnic differences in prevalence and aetiology. We assemble genome-wide association studies of estimated glomerular filtration rate (eGFR), a measure of kidney function that defines CKD, in 312,468 individuals of diverse ancestry. We identify 127 distinct association signals with homogeneous effects on eGFR across ancestries and enrichment in genomic annotations including kidney-specific histone modifications. Fine-mapping reveals 40 high-confidence variants driving eGFR associations and highlights putative causal genes with cell-type specific expression in glomerulus, and in proximal and distal nephron. Mendelian randomisation supports causal effects of eGFR on overall and cause-specific CKD, kidney stone formation, diastolic blood pressure and hypertension. These results define novel molecular mechanisms and putative causal genes for eGFR, offering insight into clinical outcomes and routes to CKD treatment development.

A catalog of genetic loci associated with kidney function from analyses of a million individuals

Article 31 May 2019

Matthias Wuttke, Yong Li, … Cristian Pattaro

Sex-specific and pleiotropic effects underlying kidney function identified from GWAS meta-analysis

Article Open access 23 April 2019

Sarah E. Graham, Jonas B. Nielsen, … Cristen J. Willer

Mapping eGFR loci to the renal transcriptome and phenome in the VA Million Veteran Program

Article Open access 26 August 2019

Jacklyn N. Hellwege, Digna R. Velez Edwards, … Adriana M. Hung

Introduction

Chronic kidney disease (CKD) affects ~10% of the global population, with considerable racial/ethnic differences in prevalence and risk factors^1,2. CKD is associated with premature cardiovascular disease and mortality, and has enormous healthcare costs for treatment, prescriptions and hospitalizations^3,4,5,6. The underlying mechanisms for CKD predisposition and development are unknown, limiting progress in the identification of prognostic biomarkers or the advancement of treatment interventions.

Large-scale genome-wide association studies (GWAS) of estimated glomerular filtration rate (eGFR), a measure of kidney function used to define CKD, have mostly been undertaken in populations of European^7,8,9 and East Asian¹⁰ ancestry. Despite the success of these GWAS in identifying loci contributing to kidney function and risk of CKD, the common single nucleotide variants (SNVs) driving the association signals explain no more than ~4% of the observed-scale heritability of eGFR, and efforts to replicate these findings in other ancestry groups have been limited¹¹. Furthermore, efforts to localise the variants driving eGFR association signals at these loci, and the putative causal genes through which their effects are mediated, have been hampered by the extensive linkage disequilibrium (LD) across common variation in European and East Asian ancestry populations.

To enhance understanding of the genetic contribution to kidney function and CKD across diverse populations, and to inform global public health and personalised medicine, we recently established the Continental Origins and Genetic Epidemiology Network Kidney (COGENT-Kidney) Consortium. We undertook trans-ethnic meta-analysis of eGFR GWAS in 71,638 individuals ascertained from populations of African, East Asian, European and Hispanic/Latino ancestry¹². These investigations provided no evidence of heterogeneity in allelic effects on eGFR association signals between ancestry groups, emphasizing the power of trans-ethnic GWAS meta-analysis for locus discovery that will be relevant to diverse populations.

To further extend characterization of the genetic contribution to eGFR, and determine the molecular mechanisms and putative causal genes through which association signals impact on kidney function, we expand the COGENT-Kidney Consortium in this investigation by assembling GWAS in up to 312,468 individuals of diverse ancestry. With these data, we identify novel loci and distinct associations for kidney function, assess the evidence for heterogeneity in their allelic effects on eGFR, and determine genomic annotations in which these signals are enriched. We identify high-confidence variants driving eGFR association signals through annotation-informed trans-ethnic fine-mapping, and highlight putative causal genes through which their effects are mediated via integration with expression in kidney tissue. Finally, we evaluate the causal effects of eGFR on clinically-relevant renal and cardiovascular outcomes through Mendelian randomisation (MR) with our expanded catalogue of kidney function loci.

Results

Study overview

We assembled GWAS in up to 312,468 individuals from three sources (Methods): (i) 19 studies of diverse ancestry from the COGENT-Kidney Consortium, expanding the previously published trans-ethnic meta-analysis¹² to include additional individuals of Hispanic/Latino descent; (ii) a published meta-analysis of 33 studies of European ancestry from the CKDGen Consortium⁹; and (iii) a published study of East Asian ancestry from the Biobank Japan Project¹⁰. Each GWAS was imputed up to the Phase 1 integrated 1000 Genomes Project reference panel¹³, and SNVs passing quality control were tested for association with eGFR, calculated from serum creatinine, accounting for age, sex and ethnicity, as appropriate (Methods).

The current study represented a 2.2-fold increase in sample size over the largest published GWAS of kidney function¹⁰. Assuming homogeneous allelic effects on eGFR across populations, we had more than 80% power to detect an association (p < 5 × 10⁻⁸) with SNVs explaining at least 0.0127% of the trait variance under an additive genetic model. This corresponded to common/low-frequency SNVs with minor allele frequency (MAF) ≥5%/≥0.5% that decrease eGFR by ≥0.0366/≥0.113 standard deviations.

Trans-ethnic meta-analysis

To discover novel loci contributing to kidney function in diverse populations, we first aggregated eGFR association summary statistics across studies through trans-ethnic meta-analysis (Methods). We employed Stouffer’s method, implemented in METAL¹⁴, because allelic effect sizes were reported on different scales in each of the three sources contributing to the meta-analysis. We identified 93 loci attaining genome-wide significant evidence of association with eGFR (p < 5 × 10⁻⁸), including 20 mapping outside regions previously implicated in kidney function (Supplementary Figure 1, Supplementary Table 1). The strongest novel associations (Table 1) mapped to/near MYPN (rs7475348, p = 8.6 × 10⁻¹⁹), SHH (rs6971211, p = 6.5 × 10⁻¹³), XYLB (rs36070911, p = 2.3 × 10⁻¹¹) and ORC4 (rs13026220, p = 3.1 × 10⁻¹¹).

Table 1 Novel loci attaining genome-wide significant evidence (p < 5 × 10⁻⁸) of association with eGFR in trans-ethnic meta-analysis of up to 312,468 individuals of diverse ancestry

Full size table

Across the 93 loci, we then delineated 127 distinct association signals (at locus-wide significance, p < 10⁻⁵) through approximate conditional analyses implemented in GCTA¹⁵ (Methods), each arising from different underlying causal variants and/or haplotype effects (Supplementary Tables 1 and 2). The most complex genetic architecture was observed at SLC22A2 and UMOD-PDILT, where the eGFR association was delineated to four distinct signals at each locus (Supplementary Figure 2). Genome-wide, application of LD Score regression¹⁶ to a meta-analysis of only European ancestry studies revealed the observed scale heritability of eGFR to be 7.6%, of which 44.7%/5.4% was attributable to variation in the known/novel loci reported here (Methods).

Trans-ethnic heterogeneity in eGFR association signals

To assess the evidence for a genetic contribution to ethnic differences in CKD prevalence, we investigated differences in eGFR associations across the diverse populations contributing to our meta-analysis. We performed trans-ethnic meta-regression of allelic effect sizes obtained from GWAS contributing to the COGENT-Kidney Consortium, implemented in MR-MEGA¹⁷, including two axes of genetic variation that separate population groups as covariates to account for heterogeneity that is correlated with ancestry (Methods, Supplementary Figure 3). Despite substantial differences in allele frequencies at index SNVs for the distinct associations across ethnicities, we observed no significant evidence (p < 0.00039, Bonferroni correction for 127 signals) of heterogeneity in allelic effects on eGFR that was correlated with ancestry (Supplementary Tables 2 and 3). Furthermore, all index SNVs had minor allele frequencies >1% in multiple ethnic groups, indicating that the distinct eGFR association signals were not ancestry-specific. These observations are consistent with a model in which causal variants for eGFR as a measure of kidney function are shared across global populations and arose prior to human population migration out of Africa.

Enrichment of eGFR associations for genomic annotations

To gain insight into the molecular mechanisms that underlie the genetic contribution to kidney function, we investigated genomic signatures of functional and regulatory annotation that were enriched for eGFR associations across the 127 distinct signals. Specifically, we compared the odds of eGFR association for SNVs mapping to each annotation with those that did not map to the annotation (Methods). We began by considering genic regions, as defined by the GENCODE Project¹⁸, and observed significant enrichment (p < 0.05) of eGFR associations in protein-coding exons (p = 0.0049), but not in 3’ or 5’ UTRs. We then interrogated chromatin immuno-precipitation sequence (ChIP-seq) binding sites for 161 transcription factors from the ENCODE Project¹⁹, which revealed significant joint enrichment of eGFR associations for HDAC2 (p = 0.0088) and EZH2 (p = 0.030). Class I histone deacetylases (including HDAC2) are required for embryonic kidney gene expression, growth and differentiation²⁰, whilst EZH2 participates in histone methylation and transcriptional repression²¹. Finally, we considered ten groups of cell-type-specific regulatory annotations for histone modifications (H3K4me1, H3K4me3, H3K9ac and H3K27ac)^22,23. Significant enrichment of eGFR associations was observed only for kidney-specific annotations (p = 7.4 × 10⁻¹⁴). In a joint model of these four enriched annotations, the odds of eGFR association for SNVs mapping to protein-coding exons, binding sites for HDAC2 and EZH2, and kidney-specific histone modifications were increased by 3.06-, 2.13-, 1.76- and 4.29-fold, respectively (Supplementary Figure 4).

Annotation-informed trans-ethnic fine-mapping

We performed trans-ethnic fine-mapping to localise putative causal variants for distinct eGFR association signals that were shared across global populations by taking advantage of differences in the structure of LD between ancestry groups²⁴. To further enhance fine-mapping resolution, we incorporated an annotation-informed prior model for causality, upweighting SNVs mapping to the globally enriched genomic signatures of eGFR associations (Methods). Under this prior, we derived credible sets of variants for each distinct signal, which together account for 99% of the posterior probability (π) of driving the eGFR association (Supplementary Table 4). For 40 signals, a single SNV accounted for more than 50% of the posterior probability of driving the eGFR association, which we defined as high-confidence for causality (Supplementary Table 5). We assessed the evidence of association of these high-confidence SNVs with other measures of kidney function and damage in published GWAS^9,10,25 (Supplementary Table 6). Several SNVs demonstrated nominal associations (p < 0.05) with eGFR calculated from cystatin C, blood urea nitrogen and urine albumin creatinine ratio, with the expected direction of effect of the eGFR decreasing allele.

Putative causal genes at eGFR association signals

We sought to identify the most likely target gene(s) through which the effects of each of the 40 high-confidence SNVs on eGFR were mediated via functional annotation and colocalisation with expression quantitative trait loci (eQTLs) in kidney tissue.

Only four of the SNVs were missense variants (Table 2), encoding CACNA1S p.Arg1539Cys (rs3850625, p = 2.5 × 10⁻⁹, π = 99.0%), CPS1 p.Thr1406Asn (rs1047891, p = 1.5 × 10⁻²⁹, π = 98.1%), GCKR p.Leu446Pro (rs1260326, p = 2.0 × 10⁻³⁵, π = 86.1%) and CERS2 p.Glu115Ala (rs267738, p = 1.7 × 10⁻¹⁰, π = 55.3%). Functional annotation of these high-confidence missense variants highlighted predicted deleterious impact of CPS1 p.Thr1406Asn and CERS2 p.Glu115Ala (Methods). CACNA1S (Calcium Voltage-Gated Channel Subunit Alpha 1s) encodes a subunit of L-type calcium channel located within the glomerular afferent arteriole, is the target of anti-hypertensive dihydropyridine calcium channel blockers (such as amlodipine and nifedipine), and regulates arteriolar tone and intra-glomerular pressure²⁶. CACNA1S missense mutations cause hypokalemic periodic paralysis^27,28, malignant hyperthermia²⁹ and congenital myopathy³⁰. CACNA1S is highly expressed in skeletal muscle tissue, raising the possibility that the high-confidence missense variant may influence eGFR through creatinine production. CPS1 (Carbamoyl-Phosphate Synthase 1) is involved in the urea cycle, where the enzyme plays an important role in removing excess ammonia from cells³¹. GCKR (Glucokinase Regulator) produces a regulatory protein that inhibits glucokinase, and the p.Leu446Pro substitution is a highly pleiotropic variant with reported effects on a wide range of phenotypes, including metabolic traits and type 2 diabetes³².

Table 2 High confidence SNVs driving eGFR associations and putative causal genes through which their effects on kidney function are mediated

Full size table

CERS2 (Ceramide Synthase 2) variants have previously been associated with albuminuria in individuals with diabetes³³, and interrogation of the Human Protein Atlas³⁴ revealed that the CERS2 protein is abundantly expressed in the glomerulus and tubules of the kidney. Cers2-deficient mice exhibit changes in the structure of the kidney³⁵. We verified that Cers2 mRNA is expressed in primary podocytes isolated from the mouse using a previously published method³⁶ (Methods, Supplementary Figure 5). To gain insight into the potential role of CERS2 in podocyte motility and function, we isolated and grew primary murine podocytes in culture, and exposed them to the CERS2 inhibitor, ST-1074^37,38 (Methods). We compared the podocyte migration rate among treated and untreated cells using the scratch wound-healing assay (Supplementary Figure 6). Primary podocytes treated with 3 µM concentration of the CESRS2 inhibitor had a lower migration rate than untreated cells, with significantly higher percentages of uncovered areas remaining at 18 h after wound-scratch. Podocytes treated with ST-1074 appeared much more elongated at 18 h. Although we cannot rule out off-target effects of the inhibitor, these preliminary results suggest that CERS2 may have a functional impact on podocyte biology. However, further studies are needed to determine the specific role of the gene in the kidney, in vivo, in health and disease states.

The remaining 36 high-confidence SNVs mapped to non-coding regions, which we assessed for colocalisation with eQTL from two resources: (i) non-cancer affected healthy kidney tissue obtained from 260 individuals from the TRANScriptome of renaL humAn TissuE (TRANSLATE) Study^39,40 and The Cancer Genome Atlas (TCGA)⁴¹; and (ii) kidney biopsies obtained from 134 healthy donors from the TransplantLines Study⁴² (Methods). We observed that high-confidence eGFR SNVs colocalised with lead renal eQTL variants in the TRANSLATE Study and TGCA (Table 2, Supplementary Table 7) for FGF5 (rs12509595, p = 4.7 × 10⁻¹⁶, π = 57.1%), TBX2 (rs887258, p = 2.7 × 10⁻¹³, π = 62.2%), and both UMOD and GP2 for the same signal at the UMOD-PDILT locus (rs77924615, p = 1.5 × 10⁻⁵⁴, π = 100.0%). Of these three high-confidence SNVs, rs8872528 was a significant eQTL (defined by 5% false discovery rate) for TBX2 across multiple tissues in the GTEx Project⁴³, whilst the associations of rs12509595 and rs77924615 with an expression of FGF5 and UMOD/GP2, respectively, were specific to kidney. FGF5 (Fibroblast Growth Factor 5) is expressed during kidney development, but knockout models have not shown a kidney phenotype⁴⁴. FGF5 has been implicated in GWAS of blood pressure and hypertension⁴⁵, and other fibroblast growth factors are increasingly recognised as contributors to blood pressure regulation through renal mechanisms⁴⁰. TBX2 (T-Box 2) plays a role in defining the pronephric nephron in experimental models⁴⁶. UMOD encodes uromodulin (Tamm-Horsfall protein), the most abundant urinary protein. The eGFR lowering allele at the high-confidence SNV is associated with increased UMOD expression (Supplementary Figure 7), which is consistent with previous investigations that demonstrated uromodulin overexpression in transgenic mice leads to salt-sensitive hypertension and the presence of age-dependent renal lesions⁴⁷.

Mapping genes to kidney cells

Kidney cells are highly specialised in function based on their location in nephron segments. Previous investigations in mouse and human have revealed that genes at kidney trait-related loci are expressed in a cell-specific manner^48,49. To provide insight into cellular specificity of the signals at the UMOD-PDILT, FGF5 and TBX2 loci, we mapped the four genes identified through eQTL analyses to cell types from single nucleus RNA-sequencing (snRNA-seq) data obtained from a healthy human kidney donor (4254 cells, with an average of 1803 detected genes per cell)⁴⁹. UMOD and GP2 demonstrated expression specific to epithelial cells of the ascending loop of Henle (Fig. 1). Uromodulin is involved in protection against urinary tract infections⁵⁰, and the global distribution of UMOD regulatory variants in humans correlates with pathogen diversity and prevalence in urine⁵¹. Glycoprotein 2 is a protein involved in innate immunity. These findings suggest a role for these two proteins in kidney physiology and potential host defence immunity to uropathogens at the UMOD-PDILT locus.

By localising high-confidence SNVs to introns and UTRs (Methods), we identified eight additional genes with differential expression across nephron single cell-types (Fig. 1, Table 2): LRP2, SLC34A1 and DPEP1 (specific to proximal tubule); SPTBN1 (specific to glomeruli endothelial cells); PIP5K1B (specific to glomeruli mesangial cells); and LARP4B, BCAS3, and MPPED2 (multiple cell types in the distal nephron). Of these, DPEP1, which encodes the protein dipeptidase 1, is implicated in the renal metabolism of glutathione and its conjugates, and regulates leukotriene activity. This localisation fits with the previously suggested connection between glutathione metabolism and defence against chemical injury in proximal tubule cells⁵². Taken together, these findings suggest a potential role of these genes in influencing kidney structure and function through regulation of: (i) glomerular capillary pressure, determining intra-glomerular pressure and glomerular filtration; (ii) proximal tubular reabsorption, affecting tubuloglomerular feedback; or (iii) distal nephron handling of sodium or acid load, influencing kidney disease progression. Additional laboratory-based functional studies will be required to delineate the mechanistic pathways that determine kidney function in healthy and disease states, and potential routes to therapeutic targets for pharmacologic development.

Causal effects of eGFR on clinically-relevant outcomes

We sought to evaluate the causal effect of eGFR on clinically-relevant kidney and cardiovascular outcomes via two-sample MR⁵³ (Methods, Supplementary Tables 8, 9 and 10). Analyses were performed separately in each of the three components of the trans-ethnic meta-analysis because allelic effect sizes were measured on different scales in each. For each trait, we accounted for heterogeneity in causal effects of eGFR via modified Q-statistics⁵⁴, excluding outlying genetic instruments that may reflect pleiotropic SNVs and violate the assumptions of MR (Methods, Supplementary Tables 9 and 10).

In each component, we detected a significant (p < 0.0042, Bonferroni correction for 12 traits) causal effect of lower eGFR on higher risk of all-cause CKD, glomerular diseases and CKD stage 5, based on reported association summary statistics from the CKDGen Consortium⁸ and the UK Biobank (Fig. 2, Supplementary Table 8). We also detected a significant causal effect of lower eGFR on lower risk of calculus of the kidney and ureter, in each component, based on reported association summary statistics from the UK Biobank (Fig. 3, Supplementary Table 8). The lead eGFR SNV at the UMOD-PDILT locus (rs77924615) has been previously associated with kidney stone formation⁵⁵ and is consistent with the role of uromodulin in the inhibition of urine calcium crystallisation⁵⁶. However, this SNV was excluded from the MR analysis due to heterogeneity in effect size and was therefore not driving the causal eGFR association with risk of calculus of the kidney and ureter (Supplementary Table 9).

We also detected a novel causal effect of lower eGFR (at nominal significance, p < 0.05, in each component of the trans-ethnic meta-analysis) on higher diastolic blood pressure (DBP) and higher risk of essential (primary) hypertension, but not on systolic blood pressure, based on reported association summary statistics from automated readings and ICD10 codes from primary care data available in the UK Biobank (Fig. 4, Supplementary Table 8). These results are consistent with a role for reduced functional nephron mass on increased peripheral arterial resistance⁵⁷ and confirm previous findings from observational studies⁵⁸. Although the causal association with DBP could not be replicated using published meta-analysis association summary statistics from the International Consortium for Blood Pressure (ICBP)⁵⁹ (Supplementary Table 11), we note that their blood pressure measures were corrected for body-mass index (in addition to age and sex), and there was significant evidence of heterogeneity in effects of eGFR on outcome across SNVs, indicating potential pleiotropy due to collider bias, and consequently invalidating MR estimates. Despite the large sample sizes available for MR analyses from the CardiogramplusC4D Consortium⁶⁰ and MEGASTROKE Consortium⁶¹, there was no significant evidence of a causal association of eGFR on cardiovascular disease outcomes: coronary heart disease, myocardial infarction or ischemic stroke (Supplementary Table 8).

Discussion

We identified 20 novel loci for eGFR through trans-ethnic meta-analysis, and dissected 127 distinct association signals that together explain an additional 5.3% of the genome-wide observed scale heritability. The effects of index SNVs for these distinct eGFR association signals were homogeneous across major ancestry groups, which is consistent with a model in which the underlying causal variants are shared across diverse populations, and therefore amenable to trans-ethnic fine-mapping. The localisation of causal variants at eGFR association signals was further enhanced through integration with enriched signatures of genomic annotation that included kidney-specific histone modifications.

We localised high-confidence causal variants driving 40 distinct eGFR association signals, the majority of which have not been previously reported. Through a variety of approaches, including colocalisation with eQTLs in human kidney, and identification of differential expression between human kidney cell types through snRNA-seq, these high-confidence variants implicated several putative causal genes that account for eGFR variation at kidney function loci. Therefore, our strategy of utilising multiple kidney tissue-specific resources to uncover likely causal variants and the genes through which their effects are mediated, followed by mapping of these genes to specific cells in the nephron, provides important biological insight and potential targets for drug development. Knowledge of the specificity of gene expression in nephron segments should also inform future experiments to elucidate the function of some of these genes and potentially define causal molecular mechanisms underlying CKD.

MR analyses of lead SNVs at kidney function loci highlighted previously unreported causal effects of lower eGFR on higher risk of primary glomerular diseases, lower risk of kidney stone formation, and higher DBP and risk of hypertension. The causal relationships of eGFR to these outcomes have been demonstrated to be consistent across ancestries, which is essential for the development of potential interventions that would be relevant to diverse global populations. Our MR analyses also identified lead eGFR SNVs with heterogeneous causal effects on these outcomes, indicating potential pleiotropy. However, further work will be required to determine the specific pathways through which these pleiotropic SNVs act, including non-eGFR determinants of serum creatinine-based eGFR estimating equations.

In conclusion, we have undertaken the most comprehensive trans-ethnic GWAS of eGFR, which has significantly enhanced knowledge of the genetic contribution to kidney function. Our investigation emphasizes the importance of genetic studies of eGFR in diverse populations and their integration with cell-type specific kidney expression data for maximising gains in discovery and fine-mapping of kidney function loci. Taken together, these strategies offer the most promising route to treatment development for a disease with major public health impact across the globe.

Methods

Ethics statement

All human research was approved by the relevant institutional review boards and conducted according to the Declaration of Helsinki. All participants provided written informed consent. All mice were maintained on a 12-h light–dark cycle with free access to standard chow and water in the animal facility of the University of Virginia (UVA). Experiments were carried out in accordance with local and NIH guidelines, and the animal protocol was approved by the UVA Institutional Animal Care and Use Committee.

COGENT-Kidney Consortium: study-level analyses

Study sample characteristics for GWAS from the COGENT-Kidney Consortium, which incorporates 81,829 individuals of diverse ancestry (32.4% Hispanic/Latino, 28.8% European, 28.8% East Asian and 10.0% African American) are presented in Supplementary Table 12. These GWAS included those reported previously¹² but were expanded with the addition of further studies of Hispanic/Latino ancestry to increase the diversity of represented population groups. Samples were assayed with a range of GWAS genotyping products, and quality control was undertaken within each study (Supplementary Table 13). Samples were excluded because of low genome-wide call rate, extreme heterozygosity, sex discordance, cryptic relatedness, and outlying ethnicity. SNVs were excluded because of low call rate across samples and extreme deviation from Hardy–Weinberg equilibrium. Non-autosomal SNVs were excluded from imputation and association analysis. Within each study, the GWAS genotype scaffold was pre-phased^62,63 and imputed up to the Phase 1 integrated (version 3) multi-ethnic reference panel from the 1000 Genomes Project¹³ using IMPUTEv2^63,64 or minimac^63,65 (Supplementary Table 13). Imputed variants were retained for downstream association analyses if they attained IMPUTEv2 info≥0.4 or minimac r² ≥ 0.3.

Within each study, eGFR was calculated from serum creatinine (mg/dL), accounting for age, sex and ethnicity, using the four-variable MDRD equation^66,67,68. We tested the association of eGFR with each SNV in a linear regression framework, under an additive dosage model, and with adjustment for study-specific covariates to account for confounding due to population structure (Supplementary Table 13). For each SNV, the association Z-score was derived from the allelic effect estimate and corresponding standard error. Z-scores and standard errors were then corrected for residual population structure via genomic control⁶⁹ where necessary (Supplementary Table 13).

CKDGen Consortium: meta-analysis

Full details of the CKDGen Consortium meta-analysis, which incorporated GWAS in 110,517 individuals of European ancestry, have been previously published⁹. Briefly, individuals were assayed with a range of GWAS genotyping products. After quality control, GWAS scaffolds were pre-phased^62,63 and imputed^63,64,65 up to the Phase 1 integrated (version 1 or version 3) multi-ethnic or European-specific reference panels from the 1000 Genomes Project¹³. Imputed variants were retained for downstream association analyses if they attained IMPUTEv2 info≥0.4 or MaCH/minimac r²≥0.4. Within each study, eGFR was calculated from serum creatinine (mg/dL), accounting for age and sex, using the four-variable Modification of Diet in Renal Disease (MDRD) equation^66,67,68. Residuals obtained after regressing ln(eGFR) on age and sex, and study-specific covariates to account for population structure where appropriate, were tested for association with each SNV in a linear regression framework, under an additive dosage model. Association summary statistics within each GWAS were corrected for residual population structure via genomic control⁶⁹ where necessary and were subsequently aggregated across studies, under a fixed-effects model, with inverse-variance weighting of allelic effect sizes, as implemented in METAL¹⁴.

From the available meta-analysis summary statistics for each SNV (downloaded from http://ckdgen.imbi.uni-freiburg.de/), we derived the association Z-score from the ratio of the allelic effect estimate and corresponding standard error. No further correction for population structure was required by genomic control⁶⁹: λ_GC = 0.977.

Biobank Japan Project: study-level analysis

Full details of the Biobank Japan Project GWAS, which incorporated 143,658 individuals of East Asian ancestry, have been previously published¹⁰. Briefly, individuals were assayed with the Illumina HumanOmniExpressExome BeadChip or a combination of the Illumina HumanOmniExpress BeadChip and the Illumina HumanExome BeadChip. After quality control, the GWAS scaffold was pre-phased with MaCH⁷⁰ and imputed up to the Phase 1 integrated (version 3) East Asian-specific reference panel from the 1000 Genomes Project¹³ with minimac^63,65. Imputed variants were retained for downstream association analyses if they attained minimac r² ≥ 0.7. For each individual, eGFR was derived from serum creatinine (mg/dL) using the Japanese coefficient-modified CKD Epidemiology Collaboration (CKD-EPI) equation^71,72,73, and adjusted for age, sex, ten principal components of genetic ancestry, and affection status for 47 diseases. The resulting residuals were inverse-rank normalised and tested for association with each SNV in a linear regression framework, under an additive dosage model.

From the available GWAS summary statistics for each SNV (downloaded from http://jenger.riken.jp/en/result), we derived the association Z-score from the ratio of the allelic effect estimate and corresponding standard error, and subsequently corrected for residual population structure by genomic control⁶⁹: λ_GC = 1.252.

Trans-ethnic meta-analysis

We aggregated eGFR association summary statistics across the three components: COGENT-Kidney Consortium GWAS, the Biobank Japan Project GWAS and the CKDGen Consortium meta-analysis. We performed fixed-effects meta-analysis, with sample size weighting of Z-scores (Stouffer’s method), as implemented in METAL¹⁴, because allelic effect estimates were on different scales in the contributing components. The COGENT-Kidney Consortium included a GWAS of a subset of 23,536 individuals from those contributing to the Biobank Japan Project, which was therefore excluded from the trans-ethnic meta-analysis. Consequently, a combined sample size of 312,468 individuals contributed to the trans-ethnic meta-analysis. SNVs reported in at least 50% of the combined sample size were retained for downstream interrogation. Meta-analysis association summary statistics were corrected for residual population structure via genomic control⁶⁹: λ_GC = 1.113.

Locus definition

We first selected lead SNVs attaining genome-wide significant evidence of association (p < 5 × 10⁻⁸) with eGFR in the trans-ethnic meta-analysis that were separated by at least 500kb. Loci were defined by the flanking genomic interval mapping 500kb up- and down-stream of lead SNVs. Where loci overlapped, they were combined as a single locus, and the lead SNV with minimal p-value from the meta-analysis was retained.

Dissection of association signals

To dissect distinct eGFR association signals at loci attaining genome-wide significance in the trans-ethnic meta-analysis, we used an iterative approximate conditional approach, implemented in GCTA¹⁵. Each COGENT-Kidney Consortium GWAS was first assigned to an ethnic group (Supplementary Table 12) represented in the 1000 Genomes Project reference panel (Phase 3, October 2014 release)⁷⁴. The Biobank Japan Project was assigned to the East Asian ethnic group, and the CKDGen Consortium meta-analysis was assigned to the European ethnic group. Haplotypes in the 1000 Genome Project panel that were specific to the assigned ethnic group were then used as a reference for LD between SNVs across loci for the GWAS in the approximate conditional analysis.

For each locus, we first applied GCTA to the study-level association summary statistics and matched LD reference for each GWAS (or the CKDGen Consortium meta-analysis). We adjusted for the conditional set of variants, which in the first iteration included only the lead SNV at the locus, and aggregated Z-scores across studies with sample size weighting (Stouffer’s method) under a fixed-effects model, as implemented in METAL¹⁴. The conditional meta-analysis summary statistics were corrected for residual population structure using the same genomic control adjustment⁶⁹ as in the unconditional analysis (λ_GC = 1.113). We defined locus-wide significance by p < 10⁻⁵, which is a Bonferroni correction for the approximate number of (independent) SNVs at each locus. If no SNVs attained locus-wide significant evidence of residual association with eGFR, the iterative approximate conditional analysis for the locus was stopped. Otherwise, the SNV with the strongest residual association signal was added to the conditional set. This iterative process continued, at each stage adding the SNV with the strongest residual association from the meta-analysis to the conditional set, until no remaining SNVs attained locus-wide significance. Note, that at each iteration, studies with missing association summary statistics for any SNV in the conditional set were excluded from the meta-analysis.

For each locus including more than one SNV in the conditional set, we then dissected each distinct association signal. We again applied GCTA to the study-level association summary statistics and matched LD reference for each GWAS (or the CKDGen Consortium meta-analysis), but this time by removing each SNV, in turn, from the conditional set of variants, and adjusting for the remainder. The conditional meta-analysis summary statistics were corrected for residual population structure using the same genomic control adjustment⁶⁹ as in the unconditional analysis (λ_GC = 1.113). The SNV with the strongest residual association was defined as the index for the signal.

Estimation of observed scale heritability

We used LD Score regression¹⁶ to assess the contribution of variation to the observed scale heritability of eGFR. LD Score regression accounts for LD between SNVs on the basis of European ancestry individuals from the 1000 Genomes Project⁷⁴. We therefore performed fixed-effects meta-analysis, with sample size weighting of Z-scores (Stouffer’s method), as implemented in METAL¹⁴, across European ancestry studies from the COGENT-Kidney Consortium and CKDGen Consortium (134,070 individuals), and used these association summary statistics in LD Score regression. We first calculated the contribution of genome-wide variation to the observed scale heritability of eGFR. We then partitioned the genome into previously reported and novel loci attaining genome-wide significance in the trans-ethnic meta-analysis (Supplementary Table 1) and calculated the observed scale heritability of eGFR attributable to each.

Estimation of allelic effect sizes at index SNVs

Allelic effect estimates were obtained from a meta-analysis of GWAS from the COGENT-Kidney Consortium, including 81,829 individuals of diverse ancestry (Supplementary Table 12), because the other components applied different transformations to eGFR prior to association analysis. The meta-analysis was performed under a fixed-effects model with inverse-variance weighting of effect sizes, implemented in METAL¹⁴. For loci with multiple signals of association, the allelic effect of an index SNV for each GWAS, prior to meta-analysis, was estimated by application of GCTA¹⁵ to the study-level association summary statistics and ancestry-matched LD reference, and adjusting for the other index SNVs at the locus. The same approach was used to obtain ethnic-specific allelic effect size estimates by implementing fixed-effects meta-analysis of GWAS within each ancestry group.

Assessment of heterogeneity in allelic effect sizes

We considered GWAS from the COGENT-Kidney Consortium, including 81,829 individuals of diverse ancestry (Supplementary Table 12), because the other components applied different transformations to eGFR prior to association analysis. We constructed a distance matrix of mean effect allele frequency differences between each pair of GWAS across a subset of SNVs reported in all studies. We implemented multi-dimensional scaling of the distance matrix to obtain two principal components that define axes of genetic variation to separate GWAS from the four major ancestry groups represented in the trans-ethnic meta-analysis. For each SNV, allelic effects on eGFR across GWAS were modelled in a linear regression framework, incorporating the two axes of genetic variation as covariates, and weighted by the inverse of the variance of the effect estimates, implemented in MR-MEGA¹⁷. Within this modelling framework, heterogeneity in allelic effects on eGFR between GWAS is partitioned into two components. The first component is correlated with ancestry and is accounted for in the meta-regression by the axes of genetic variation, whilst the second is the residual, which is not due to population genetic differences between GWAS.

Enrichment of eGFR associations in genomic annotations

Within each locus, for each distinct signal, we first approximated the Bayes’ factor⁷⁵ in favour of eGFR association of each SNV on the basis of summary statistics from the trans-ethnic meta-analysis. Specifically, the Bayes’ factor for the jth SNV at the ith distinct association signal is approximated by

$$\Lambda _{ij} = {\mathrm{exp}}\left[ {\frac{{Z_{ij}^2 - {\mathrm{ln}}K}}{2}} \right]$$

(1)

where $Z_{ij}^{}$ is the Z-score from the trans-ethnic meta-analysis across K contributing GWAS. The log-odds of association of the SNV is then given by

$${\mathrm{ln}}\left[ {\frac{{\Lambda _{ij}}}{{T_i - \Lambda _{ij}}}} \right]$$

(2)

where $T_i = \mathop {\sum }\limits_j \Lambda _{ij}$ is the total Bayes’ factor for the ith signal across all SNVs at the locus.

We modelled the log-odds of association of each SNV, for each distinct signal, in a logistic regression framework, as a function of binary variables indicating an overlap with a given genomic annotation. Specifically, for the jth SNV at the ith distinct association signal,

$${\mathrm{ln}}\left[ {\frac{{\Lambda _{ij}}}{{T_i - \Lambda _{ij}}}} \right] = \alpha _i + \beta _kz_{ijk}$$

(3)

where $z_{ijk}$ = 1 indicates that the SNV maps to the kth annotation, and $z_{ijk}$ = 0 otherwise. In this expression, α_i is a constant for the ith distinct association signal, and β_k is the log-fold enrichment in the odds to the association for the kth annotation.

We considered three categories of functional and regulatory annotations. First, we considered genic regions, as defined by the GENCODE Project¹⁸, including protein-coding exons, and 3’ and 5’ UTRs as different annotations. Second, we considered the chromatin immuno-precipitation sequence (ChIP-seq) binding sites for 161 transcription factors from the ENCODE Project¹⁹. Third, we considered ten groups of cell-type-specific regulatory annotations for histone modifications (H3K4me1, H3K4me3, H3K9ac, and H3K27ac) obtained from a variety of resources^22,23, which were previously derived for partitioning heritability by annotation by LD Score regression⁷⁶.

Within each category, we first used forward selection to identify annotations that were jointly enriched at nominal significance (p < 0.05). We then included all selected annotations across categories in a final model to obtain joint estimates of the fold-enrichment in eGFR association signals for each.

Trans-ethnic fine-mapping

Within each locus, for each distinct signal, we calculated the posterior probability of driving the eGFR association for each SNV under an annotation-informed prior model, derived from the globally enriched annotations, and the Bayes’ factor approximated from the trans-ethnic meta-analysis. Specifically, for the jth SNV at the ith distinct association signal, the posterior probability $\pi _{ij} \propto \gamma _{ij}\Lambda _{ij}$. In this expression, the relative annotation informed prior is given by

$$\gamma _{ij} = {\mathrm{exp}}\left[ {\mathop {\sum }\limits_k \hat \beta _kz_{ijk}} \right]$$

(4)

where the summation is over the selected enriched annotations, and $\hat \beta _k$ is the estimated log-fold enrichment of the kth annotation from the final joint model.

We derived a 99% credible set⁷⁷ for the ith distinct association signal by: (i) ranking all SNVs according to their posterior probability $\pi _{ij}$; and (ii) including ranked SNVs until their cumulative posterior probability of driving the association attains or exceeds 0.99. Index SNVs accounting for more than 50% posterior probability of driving the eGFR association at a given signal were defined as high-confidence.

SNV associations with measures of kidney function and damage

We evaluated the evidence for association of high-confidence variants with measures of kidney function and damage from published GWAS: (i) eGFR calculated from cystatin C, obtained from up to 24,061 individuals of European ancestry from the CKDGen Consortium⁹; (ii) blood urea nitrogen, obtained from up to 139,818 individuals of East Asian ancestry from the Biobank Japan Project¹⁰; and (iii) urine albumin to creatinine ratio, obtained from up to 46,061 non-diabetic individuals of European ancestry from the CKDGen Consortium²⁵. Effects on these traits were aligned to the eGFR decreasing allele.

Functional annotation of high-confidence missense variants

We assessed the predicted functional impact of high-confidence missense variants across a range of databases including FATHMM (functional analyses through hidden Markov models)⁷⁸ and metaSVM (scores for non-synonymous variants based on SVM model)⁷⁹.

Primary podocyte cell culture and scratch assay

The protocol for isolation and culture of primary podocytes from 129S6 mice has been previously published³⁶. Briefly, under general anaesthesia with isoflurane, mice were perfused through the heart with 6 × 10⁸ magnetic beads/ml (Dynabeads, Invitrogen) diluted in 10 ml of phosphate-buffered saline (PBS). The kidneys were extracted, decapsulated and cut into small pieces, then digested in collagenase A (1mg/ml) at 37 ^oC for 25 min The digest was pressed with a syringe pestle through a 100 uM cell strainer, collected, and washed with 5 mL of PBS, repeated once, and then passed through a 40 uM cell strainer, and washed again with 5 mL of PBS. The isolated glomeruli were then washed off the cell strainer with pre-warmed cell culture media (RPMI 1640 with L-glutamine supplemented with 10% of fetal bovine serum, 100 units/ml of Pen/Strep and 1% of L-glutamine), then further isolated by magnetic particle concentrator (Invitrogen) and washed with a pre-warmed culture media 2–3 times. Glomeruli were then re-suspended in culture media and placed in the 6-well plate coated with rat tail type I collagen and incubated at 37 ^oC. Light microscopy was used to confirm the characteristic migration of podocytes from glomeruli after 3–5 days.

Approximately after 7–9 days, the primary podocytes reached a confluent monolayer, and a rectangular wound was created by scratching the monolayer from the top to the bottom of the well using the base of a sterile 200 µl pipette tip. Images of the scratched area were immediately taken after wound creation and after an additional 18 h of incubation, using EVOS XL Core Cell Imaging System (×10 magnification). We utilised ST-1074, a potent inhibitor that can be used in lower concentrations to avoid cell toxicity, for which the clinical potential of the class of compounds influencing CerS subtypes has been previously published^37,38. The inhibitor was added to the well containing 1 mL of culture media, at a concentration of 1 mM in DMSO, immediately after wound creation at a volume of 3 µL (3 µM). For the control condition, 3 µL of DMSO alone was added. At least 3 independent experiments were performed with 3–5 wells per condition. The images were analysed using ImageJ software (1.48v, NIH, USA), by measuring pixels of the outlined scratched area immediately after wound creation and after 18 h. The results are expressed as % of area that was not covered by migrating podocytes after 18 h of incubation time, compared to the scratched area created immediately after wound creation.

We note that in vitro (HCT116), ST-1074 significantly inhibits both CerS2 and CerS4 (screening conditions 10 µM). ST-1074 disclosed an IC50 value for C18:0 of 21.4 µM (CerS4 inhibition), and an IC50 value for C24:1 of 28.7 µM (CerS2 inhibition), which do not show significant difference. In HeLa cells, ST-1074 inhibited at 5 µM CerS2 and CerS4 significantly (C24:1 ~40%, C24:0 ~50%). However, in the Human Protein Atlas³⁴, CERS4 is only expressed in the tubules and not in podocytes. Although ST-1074 is a CerS2 inhibitor, off-target effects may also influence podocyte function.

Kidney tissue eQTLs: TRANSLATE Study and TGCA

We performed eQTL analysis using data from the TRANSLATE Study^39,40 and TGCA⁴¹. In brief, as a source of kidney tissue, both studies used apparently normal samples from European ancestry individuals undergoing nephrectomy due to kidney cancer (the specimens were collected from the cancer-unaffected pole of the organ). The data from both studies were processed in the same manner using procedures described below.

Gene expression was quantified in transcripts per million (TPM) using Kallisto⁸⁰. The quality control included: removing outlier samples^81,82, checking consistency between declared and biological sex (using XIST and Y-chromosome genes); removing genes on non-autosomal chromosomes; and removing genes with either interquartile range of zero or those not meeting the minimum expression criterion (TPM > 0.1 and read counts ≥6 in at least 30% of samples within each study/sequencing batch). Before cis-eQTL analysis, the log₂-transformed TPM data were normalised using robust quantile normalisation in the R package aroma and then standardised using rank-based inverse normal transformation in GenABEL. To account for technical variation, we used probabilistic estimation of expression residuals (PEER)⁸³: 30 latent factors for the TRANSLATE Study and 15 for TCGA as recommended for different sample sizes in the GTEx Project^84,85.

Kidney DNA samples from individuals from the TRANSLATE Study were genotyped using the Infinium HumanCoreExome-24 BeadChip array, and genotype calls were made using Genome Studio. Individuals from TCGA were genotyped using the Affymetrix Genome-Wide Human SNP Array 6.0, and genotype calls were made using the Birdseed algorithm. Quality control removed variants that: had low genotyping rate (<95%); mapped to Y chromosome/mitochondrial DNA or had an ambiguous chromosomal location; violated Hardy–Weinberg equilibrium (HWE, p < 0.001); or had MAF <5%. Quality control also removed individuals with: genotyping call-rate <95%; heterozygosity above/below 3 standard deviations from the mean; cryptic relatedness to other individuals; non-European genetic ancestry; and discordant sex information (inconsistency between declared and genotyped sex). For both studies, the resulting scaffold was imputed up to the Phase 3 multi-ethnic reference panel from the 1000 Genomes Project⁷⁴ using the Michigan Imputation Server⁸⁶. After imputation, we retained only SNVs, removing those with low imputation coefficient (R² < 0.4), MAF <5%, or violating HWE (p < 10⁻⁶).

A total of 260 individuals (160 from the TRANSLATE Study and 100 from TCGA) were included in the analysis, involving 15,711 genes and 5,498,156 SNVs common to both studies. Normalised gene expression was modelled as a function of alternate allele dosage via linear regression, including sex, three axes of genetic variation (to account for population structure) and PEER latent factors as additional covariates. The regression coefficients of the alternate allele from the two studies were then combined in a fixed-effects meta-analysis under an inverse-variance weighting scheme. For each gene, only those SNVs in cis (within 1 Mb of the transcription start/stop sites) were included in the analysis. A total of 2000 permutations were used to derive the empirical distribution of the smallest p-value for each gene, which then was used to adjust the observed smallest p-value for the gene. The correction for testing multiple genes was based on false discovery rate (FDR) applied to permutation-adjusted p-values (via Storey’s method as implemented in the R package q-value) with a cut-off of 5%. Furthermore, the thresholds for nominal p-values were derived using a global permutation-adjusted p-value closest to FDR of 5% and the empirical distributions determined using permutations.

We identified high-confidence SNVs from the trans-ethnic fine-mapping that were colocalised with lead eQTL variants (i.e. the same SNV or in strong LD, r²>0.8) at a 5% FDR, and reported the corresponding eGene.

Kidney tissue eQTLs: TransplantLines Study

We performed eQTL analysis using data from the TransplantLines Study⁴². The study includes kidneys from donors, donated after brain death or cardiac death. Samples were genotyped on the Illumina CytoSNP 12 v2 array and imputed up to the Phase 1 integrated (version 3) multi-ethnic reference panel from the 1000 Genomes Project¹³ using IMPUTEv2^63,64. Expression and genotype data were available for 236 kidney biopsies obtained from 134 donors, and analyses have been described previously⁵⁹. Briefly, residuals of gene expression for each probe were obtained after adjusting for the first 50 expression principal components to filter out environmental variation⁸⁷. A linear mixed model was used to test the association of residual expression of each probe with the allele dosage of each SNV mapping within 1Mb of the transcription start/stop sites using the R package lme3. Sex, age, donor type, time of biopsy and three axes of genetic variation (to account for population structure) were included in the model as fixed effects. Random effects were then included for donor to account for multiple samples obtained from the same individual.

We identified high-confidence SNVs from the trans-ethnic fine-mapping that were colocalised with lead eQTL variants (i.e. the same SNV or in strong LD, r²>0.8) at a 5% FDR, and reported the corresponding eGene.

Expression of GWAS genes across kidney cell-types

We identified genes for which high-confidence SNVs mapped to introns and untranslated regions. We mapped the genes to cell-types from snRNA-seq data generated by 10x Chromium from a healthy human kidney (62-year old white male, no history of CKD and serum creatinine of 1.03 mg/dl)⁴⁹. The kidney was dissected from the cortex and was bulk homogenized using a dounce homogenizer. The dataset included 4524 cells: 7.8% glomerular cells (including podocytes, endothelial cells and mesangial cells); 86.7% tubular cells (including proximal tubule, loop of Henle, distal convoluted tubule, connecting tubule, proximal tubule and intercalate cells); the remaining 5.5% cells are mostly macrophages and novel cells that do not map to known cell types. An average of 1803 genes were detected per cell. We generated a differential expression gene (DEG) list by performing Wilcoxon rank sum tests on each cell-type from the single nucleus dataset. A gene was defined as mapping to a specific kidney cell type if the expression fulfils all the following criteria: (i) present in the DEG list; (ii) expressed in >25% of the total cells in the specified cell-type; and (iii) log-fold change in expression was >0.25 in the specified cell-type when compared to all other cell-types⁴⁹. Gene expression values for each cell were Z-score normalised. A new gene expression matrix with mean Z-scores for each gene was obtained by averaging the Z-scores from all individual cells in the same cluster. The Z-score normalized gene expression were presented as a heatmap using the heatmap.2 function in the R package gplots.

Two-sample MR analyses

We performed a lookup of association summary statistics for lead SNVs at each of the eGFR loci across a range of clinically-relevant kidney and cardiovascular outcomes from public and proprietary data resources. These included: CKD (12,385 cases and 104,780 controls, published data from the CKDGen Consortium⁸); IgA nephropathy (3211 cases and 8735 controls, unpublished data); glomerular diseases (ICD10 N00-N08, 2,289 cases and 449,975 controls, extracted UK Biobank using GeneATLAS); CKD stage 5 (ICD10 N18, 4905 cases and 447,359 controls, extracted from UK Biobank using GeneATLAS); hypertensive renal disease (ICD10 I12, 1663 cases and 450,601 controls, extracted from UK Biobank using GeneATLAS); calculus of kidney and ureter (ICD10 N20, 5216 cases and 447,048 controls, extracted from UK Biobank using GeneATLAS); DBP (317,756 individuals, automated reading, extracted from UK Biobank using MR-BASE⁸⁸); systolic blood pressure (317,654 individuals, automated reading, extracted from UK Biobank using MR-BASE⁸⁸); essential (primary) hypertension (ICD10 I10, 84,640 cases and 367,624 controls, extracted from UK Biobank using GeneATLAS); coronary heart disease (60,801 cases and 123,504 controls, published data from the CardiogramplusC4D Consortium⁶⁰); myocardial infarction (43,676 cases and 128,199 controls, published data from the CardiogramplusC4D Consortium⁶⁰); and ischemic stroke (10,307 cases and 19,326 controls, published data from the MEGASTROKE Consortium⁶¹).

We performed two-sample MR for each outcome using eGFR as the exposure and the extracted non-palindromic lead SNVs as instrumental variables. The lead SNVs were not in LD with each other, so that their effects on exposure and outcomes were uncorrelated. Analyses were performed separately in each of the three components of the trans-ethnic meta-analysis because allelic effect sizes were measured on different scales in each: COGENT-Kidney Consortium (58,293 individuals after excluding those from the Biobank Japan Project); CDKGen Consortium (110,517 individuals); and Biobank Japan Project (143,658 individuals). For each trait, we first accounted for heterogeneity in causal effects of eGFR via modified Q-statistics⁵⁴, implemented in the R package RadialMR, which identified outlying genetic instruments that may reflect pleiotropic SNVs. For each trait, our primary MR analyses were then performed after excluding outlying SNVs in any component of the trans-ethnic meta-analysis using inverse variance weighted regression⁸⁹, implemented in the R package TwoSampleMR⁸⁸. We also assessed the evidence for causal association between exposure and outcome using two additional approaches that are less sensitive to heterogeneity (although less powerful) and implemented in the R package TwoSampleMR⁸⁸: weighted median regression⁹⁰ and MR-EGGER regression⁹¹.

We performed an additional lookup of association summary statistics for non-outlying lead SNVs at each of the eGFR loci for DBP (150,134 individuals, published data from ICBP⁵¹). We assessed the evidence for a causal association of eGFR on DBP in each component of the trans-ethnic meta-analysis using inverse variance weighted regression⁸⁹, weighted median regression⁹⁰ and MR-EGGER regression⁹¹, as implemented in the R package TwoSampleMR⁸⁸.

Data availability

Association summary statistics will be made available from: (i) the COGENT-Kidney Consortium component of the trans-ethnic meta-analysis; and (ii) the trans-ethnic meta-analysis across the COGENT-Kidney Consortium, CKDGen Consortium and Biobank Japan Project. Fine-mapping data for each distinct eGFR signal will be made available, including the posterior probability of driving the association for each SNV. These data will be made available via: (i) the University of Liverpool Statistical Genetics and Pharmacogenomics Research Group website (https://www.liverpool.ac.uk/translational-medicine/research/statistical-genetics/data-resources); and (ii) the dbGaP CHARGE Summary Results site⁹² with accession number phs000930. The source data underlying Fig. 1 and Supplementary Figure 7 are provided as a Source Data file.

References

GBD 2016 Causes of Death Collaborators. Global, regional and national age-sex specific mortality for 264 causes of death. 1980-2016: a systematic analysis of the Global Burden of Disease Study 2016. Lancet 390, 1151–1210 (2017).
Article Google Scholar
GBD 2016 Diseases and Injury Incidence and Prevalence Collaborators. Global, regional, and national incidence, prevalence, and years lived with disability for 328 diseases and injuries for 195 countries, 1990-2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet 390, 1211–1259 (2017).
Article Google Scholar
Sarnak, M. J. Cardiovascular complications in chronic kidney disease. Am. J. Kidney Dis. 41, 11–17 (2003).
Article PubMed Google Scholar
Go, A. S. et al. Chronic kidney disease and the risks of death, cardiovascular events, and hospitalization. N. Engl. J. Med. 351, 1296–1305 (2004).
Article ADS CAS PubMed Google Scholar
Keith, D. S. et al. Longitudinal follow-up and outcomes among a population with chronic kidney disease in a large managed care organization. Arch. Intern. Med. 164, 659–663 (2004).
Article PubMed Google Scholar
Collins, A. J. et al. US Renal Data System 2012 Annual Data Report. Am. J. Kidney. Dis. 61, e1-e476 (2013).
Kottgen, A. N. et al. Multiple loci associated with indices of renal function and chronic kidney disease. Nat. Genet. 41, 712–717 (2009).
Article CAS PubMed PubMed Central Google Scholar
Pattaro, C. et al. Genetic associations at 53 loci highlight cell types and biological pathways relevant for kidney function. Nat. Commun. 7, 10023 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Gorski, M. et al. 1000 Genomes-based meta-analysis identifies 10 novel loci for kidney function. Sci. Rep. 7, 45040 (2017).
Article ADS PubMed PubMed Central Google Scholar
Kanai, M. et al. Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases. Nat. Genet. 50, 390–400 (2018).
Article CAS PubMed Google Scholar
Liu, C.-T. et al. Genetic association for renal traits among participants of African ancestry reveals new loci for renal function. PLoS. Genet. 7, e1002264 (2012).
Article CAS Google Scholar
Mahajan, A. et al. Trans-ethnic fine mapping highlights kidney-function genes linked to salt sensitivity. Am. J. Hum. Genet. 99, 636–646 (2016).
Article CAS PubMed PubMed Central Google Scholar
The 1000 Genomes Project Consortium. An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56–65 (2012).
Article PubMed Central CAS Google Scholar
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genome-wide association scans. Bioinformatics 26, 2190–2191 (2010).
Article CAS PubMed PubMed Central Google Scholar
Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, 369–375 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bulik-Sullivan, B. et al. (2015). LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Magi, R. et al. Trans-ethnic meta-regression of genome-wide association studies accounting for ancestry increases power for discovery and improves fine-mapping resolution. Hum. Mol. Genet. 26, 3639–3650 (2017).
Article CAS PubMed PubMed Central Google Scholar
Harrow, J. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 22, 1760–1774 (2012).
Article CAS PubMed PubMed Central Google Scholar
The ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Article ADS PubMed Central CAS Google Scholar
Chen, S. et al. Histone deacetylase (HDAC) activity is critical for embryonic kidney gene expression, growth and differentiation. J. Biochem. 286, 32775–32789 (2011).
CAS Google Scholar
Vire, E. et al. The Polycomb group protein EZH2 directly controls DNA methylation. Nature 439, 871–874 (2006).
Article ADS CAS PubMed Google Scholar
Trynka, G. et al. Chromatin marks identify critical cell types for fine-mapping complex trait variants. Nat. Genet. 45, 124–130 (2013).
Article CAS PubMed Google Scholar
Hnisz, D. et al. Super-enhancers in the control of cell identity and disease. Cell 155, 934–947 (2013).
Article CAS PubMed Google Scholar
Li, Y. R. & Keating, B. J. Trans-ethnic genome-wide association studies: advantages and challenges of mapping in diverse populations. Genome Med. 6, 91 (2014).
Article PubMed PubMed Central Google Scholar
Teumer, A. et al. Genome-wide association studies identify genetic loci associated with albuminuria in diabetes. Diabetes 65, 803–817 (2016).
Article CAS PubMed Google Scholar
Hayashi, K., Nagahama, T., Oka, K., Epstein, M. & Saruta, T. Disparate effects of calcium antagonists on renal microcirculation. Hypertens. Res. 19, 31–36 (1996).
Article CAS PubMed Google Scholar
Burge, J. A. & Hanna, M. G. Novel insights into the pathomechanisms of skeletal muscle channelopathies. Curr. Neurol. Neurosci. Rep. 12, 62–69 (2012).
Article CAS PubMed Google Scholar
Hanchard, N. A. et al. Exploring the utility of whole-exome sequencing as a diagnostic tool in a child with atypical episodic muscle weakness. Clin. Genet. 83, 457–461 (2013).
Article PubMed Google Scholar
Beam, T. A., Loudermilk, E. F. & Kisor, D. F. Pharmacogenetics and pathophysiology of CACNA1S mutations in malignant hyperthermia. Physiol. Genom. 49, 81–87 (2017).
Article CAS Google Scholar
Hunter, J. M. et al. Novel pathogenic variants and genes for myopathies identified by whole exome sequencing. Mol. Genet. Genom. Med. 3, 283–301 (2015).
Article CAS Google Scholar
Haberle, J. et al. Molecular defects in human carbamoyl phosphate synthetase I: mutational spectrum, diagnostic and protein structure considerations. Hum. Mutat. 32, 579–589 (2011).
Article PubMed PubMed Central CAS Google Scholar
Dupuis, J. et al. New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nat. Genet. 42, 105–116 (2010).
Article CAS PubMed PubMed Central Google Scholar
Shiffman, D. et al. A gene variant in CERS2 is associated with rate of increase in albuminuria in patients with diabetes from ONTARGET and TRANSCEND. PLoS. One. 9, e106631 (2014).
Article PubMed PubMed Central CAS Google Scholar
Uhlen, M. et al. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
Article PubMed CAS Google Scholar
Imgrund, S. et al. Adult ceramide synthase 2 (CERS2)-deficient mice exhibit myelin sheath defects, cerebellar degeneration, and hepatocarcinomas. J. Biol. Chem. 284, 33549–33560 (2009).
Article CAS PubMed PubMed Central Google Scholar
Cechova, S. et al. MYH9 E1841K mutation augments proteinuria and podocyte injury and migration. J. Am. Soc. Nephrol. 29, 155–167 (2018).
Article PubMed Google Scholar
Schiffmann, S. et al. Inhibitors of specific ceramide synthases. Biochimie 94, 558–565 (2012).
Article CAS PubMed Google Scholar
Sofi, M. H. et al. Ceramide synthesis regulates T-cell activity and GVDH development. JCI Insight 2, 91701 (2017).
Article PubMed Google Scholar
Marques, F. Z. et al. Signatures of mir-181a on the renal transcriptome and blood pressure. Mol. Med. 21, 739–748 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tomaszewski, M. et al. Renal mechanisms of association between fibroblast growth factor 1 and blood pressure. J. Am. Soc. Nephrol. 26, 3151–3160 (2015).
Article CAS PubMed PubMed Central Google Scholar
The Cancer Genome Atlas Research Network,. et al. The cancer genome atlas pan-cancer analysis project. Nat. Genet. 45, 1113–1120 (2013).
Article CAS Google Scholar
Damman, J. et al. Hypoxia and complement-and-coagulation pathways in the deceased organ donor as the major target for intervention to improve renal allograft outcome. Transplantation 99, 1293–1300 (2015).
Article CAS PubMed Google Scholar
The GTEx Consortium. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017).
Article Google Scholar
Cancilla, B., Davies, A., Cauchi, J. A., Risbridger, G. P. & Bertram, J. F. Fibroblast growth factor receptors and their ligands in the adult rat kidney. Kidney Int. 60, 147–155 (2001).
Article CAS PubMed Google Scholar
Ehret, G. et al. Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk. Nature 478, 103–109 (2011).
Article ADS CAS PubMed Google Scholar
Cho, G. S., Choi, S. C., Park, E. C. & Han, J. K. Role of Tbx2 in defining the territory of the pronephric nephron. Development 138, 465–474 (2011).
Article CAS PubMed Google Scholar
Trudu, M. et al. Common noncoding UMOD gene variants induce salt-sensitive hypertension and kidney damage by increasing uromodulin expression. Nat. Med. 19, 1655–1660 (2013).
Article CAS PubMed Google Scholar
Park, J. et al. Single-cell transcriptomics of the mouse kidney reveals potential cellular targets of kidney disease. Science 360, 758–763 (2018).
Article CAS PubMed PubMed Central Google Scholar
Wu, H. et al. Comparative analysis and refinement of human PSC-derived kidney organoid differentiation with single-cell transcriptomics. Cell Stem Cell 23, 869–881 (2018).
Bates, J. M. et al. Tamm-Horsfall protein knockout mice are more prone to urinary tract infection: rapid communication. Kidney Int. 65, 791–797 (2004).
Article CAS PubMed Google Scholar
Ghirotto, S. et al. The uromodulin gene locus shows evidence of pathogen adaptation through human evolution. J. Am. Soc. Nephrol. 27, 2983–2996 (2016).
Article CAS PubMed PubMed Central Google Scholar
Visarius, T. M., Putt, D. A., Schare, J. M., Pegouske, D. M. & Lash, L. H. Pathways of glutathione metabolism and transport in isolated proximal tubular cells from rat kidney. Biochem. Pharmacol. 52, 259–272 (1996).
Article CAS PubMed Google Scholar
Pierce, B. L. & Burgess, S. Efficient design for Mendelian randomization studies: subsample and 2-sample instrumental variable estimators. Am. J. Epidemiol. 178, 1177–1184 (2013).
Article PubMed PubMed Central Google Scholar
Bowden, J. et al. Improving the visualisation, interpretation and analysis of two-sample summary data Mendelian randomization via the radial plot and radial regression. Int. J. Epidemiol. 47, 1264–1278 (2018).
Gudbjartsson, D. F. et al. Association of variants at UMOD with chronic kidney disease and kidney stones - role of age and comorbid diseases. PLoS. Genet. 6, e1001039 (2010).
Article PubMed PubMed Central CAS Google Scholar
Hess, B., Nakagawa, Y. & Coe, F. L. Inhibition of calcium oxalate monohydrate crystal aggregation by urine proteins. Am. J. Physiol. 257, F99–F106 (1989).
CAS PubMed Google Scholar
Tedla, F. M., Brar, A., Browne, R. & Brown, C. Hypertension in chronic kidney disease: navigating the evidence. Int. J. Hypertens. 2011, 132405 (2011).
Article CAS PubMed PubMed Central Google Scholar
Vaaraniemi, K. et al. Lower glomerular filtration rate is associated with higher systemic vascular resistance in patients without prevalent kidney disease. J. Clin. Hypertens. (Greenwich) 16, 722–728 (2014).
Article CAS Google Scholar
Wain, L. et al. Novel blood pressure locus and gene discovery using genome-wide association study and expression data sets from blood and the kidney. Hypertension 70, e4–e19 (2017).
Article CAS Google Scholar
Nikpay, M. et al. A comprehensive 1000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nat. Genet. 47, 1121–1130 (2015).
Article CAS PubMed PubMed Central Google Scholar
Malik, R. et al. Low-frequency and common genetic variation in ischemic stroke: the MEGASTROKE collaboration. Neurology 86, 1217–1226 (2016).
Article CAS PubMed PubMed Central Google Scholar
Delaneau, O., Marchini, J. & Zagury, J. F. A linear complexity phasing method for thousands of genomes. Nat. Methods 9, 179–181 (2011).
Article PubMed CAS Google Scholar
Howie, B., Fuchsberger, C., Stephens, M., Marchini, J. & Abecasis, G. R. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat. Genet. 44, 955–959 (2012).
Article CAS PubMed PubMed Central Google Scholar
Howie, B. N., Donnelly, P. & Marchini, J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS. Genet. 5, e1000529 (2009).
Article PubMed PubMed Central CAS Google Scholar
Fuchsberger, C., Abecasis, G. R. & Hinds, D. A. Minimac2: faster genotype imputation. Bioinformatics 31, 782–784 (2015).
Article CAS PubMed Google Scholar
Levey, A. S. et al. A more accurate method to estimate glomerular filtration rate from serum creatinine: a new prediction equation. Ann. Intern. Med. 130, 461–470 (1999).
Article CAS PubMed Google Scholar
National Kidney Foundation. K/DOQI clinical practice guidelines for chronic kidney disease: evaluation, classification, and stratification. Am. J. Kidney Dis. 39, S1–S266 (2002).
Google Scholar
Levey, A. S. et al. Using standardized serum creatinine values in the modification of diet in renal disease study equation for estimating glomerular filtration rate. Ann. Intern. Med. 145, 247–254 (2006).
Article CAS PubMed Google Scholar
Devlin, B. & Roeder, K. Genomic control for association studies. Biometrics 55, 997–1004 (1999).
Article CAS PubMed MATH Google Scholar
Li, Y., Willer, C. J., Ding, J., Scheet, P. & Abecasis, G. R. MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes. Genet. Epidemiol. 34, (816–834 (2010).
Google Scholar
Levey, A. S. et al. A new equation to estimate glomerular filtration rate. Ann. Intern. Med. 150, 604–612 (2009).
Article PubMed PubMed Central Google Scholar
Levey, A. S. & Stevens, L. A. Estimating GFR using the CKD epidemiology collaboration (CKD-EPI) creatinine equation: more accurate GFR estimates, lower CKD prevalence estimates, and better risk predictions. Am. J. Kidney Dis. 55, 622–627 (2010).
Article PubMed PubMed Central Google Scholar
Horio, M., Inai, E., Yasuda, Y., Watanabe, T. & Matsuo, S. Modification of the CKD epidemiology collaboration (CKD-EPI) equation for Japanese: accuracy and use for population estimates. Am. J. Kidney Dis. 56, 32–38 (2010).
Article PubMed Google Scholar
The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Article CAS Google Scholar
Schwarz, G. Estimating the dimension of a model. Ann. Stat. 6, 461–464 (1978).
Article MathSciNet MATH Google Scholar
Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet. 47, 1228–1235 (2015).
Article CAS PubMed PubMed Central Google Scholar
Maller, J. B. et al. Bayesian refinement of association signals for 14 loci in 3 common diseases. Nat. Genet. 44, 1294–1301 (2012).
Article CAS PubMed PubMed Central Google Scholar
Shihab, H. A. et al. Predicting the functional, molecular and phenotypic consequences of amino acid substitutions using hidden Markov models. Hum. Mutat. 34, 57–65 (2013).
Article CAS PubMed Google Scholar
Dong, C. et al. Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies. Hum. Mol. Genet. 24, 2125–2137 (2015).
Article CAS PubMed Google Scholar
Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016).
Article CAS PubMed Google Scholar
Wright, F. A. et al. Heritability and genomics of gene expression in peripheral blood. Nat. Genet. 46, 430–437 (2014).
Article CAS PubMed PubMed Central Google Scholar
‘t Hoen, P. A. C. et al. Reproducibility of high-throughput mRNA and small RNA sequencing across laboratories. Nat. Biotechnol. 31, 1015–1022 (2013).
Article PubMed CAS Google Scholar
Stegle, O., Parts, L., Piipari, M., Winn, J. & Durbin, R. Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses. Nat. Protoc. 7, 500–507 (2012).
Article CAS PubMed PubMed Central Google Scholar
The GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
Article PubMed Central CAS Google Scholar
The GTEx Consortium. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Article CAS Google Scholar
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287 (2016).
Article CAS PubMed PubMed Central Google Scholar
Fehrmann, R. S. et al. Trans-eqtls reveal that independent genetic variants associated with a complex phenotype converge on intermediate genes, with a major role for the HLA. PLoS. Genet. 7, e1002197 (2011).
Article CAS PubMed PubMed Central Google Scholar
Hemani, G. et al. The MR-Base platform supports systematic causal inference across the human phenome. eLife 7, e34408 (2018).
Article PubMed PubMed Central Google Scholar
Burgess, S., Butterworth, A. & Thompson, S. G. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet. Epidemiol. 37, 658–665 (2013).
Article PubMed PubMed Central Google Scholar
Bowden, J., Davey Smith, G., Haycock, P. C. & Burgess, S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 40, 304–314 (2016).
Article PubMed PubMed Central Google Scholar
Bowden, J., Davey Smith, G. & Burgess, S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int. J. Epidemiol. 44, 512–525 (2015).
Article PubMed PubMed Central Google Scholar
Rich, S. S. et al. Rapid evaluation of phenotypes, SNPand results through the dbGaP CHARGE Summary Results site. Nat. Genet. 48, 702–703 (2016).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

T.H.L. is supported by the NIH (R01-DK-113632). G.H. is supported by the Wellcome Trust (208806/Z/17/Z). G.H. and G.D.S. work in the Medical Research Council Integrative Epidemiology Unit at the University of Bristol (MC_UU_00011/1). C.M.L. is supported by the Li Ka Shing Foundation, WT-SSI/John Fell funds, the Oxford NIHR Biomedical Research Centre, Widenlife and the NIH (5P50-HD-028138–27). R.L.S. and T.R. are supported by the NIH (R37-NS-029993 and U54-TR-002736) and the Evelyn F McKnight Brain Institute. H.S. and A.Z. are supported by the DFG (INST 208/664–1). M.T. is supported the British Heart Foundation (PG/17/35/33001) and Kidney Research UK (RP_017_20180302). N.F. is supported by the NIH (R01-MD-012765, R56-DK-104806, and R01-DK-117445-01A1). Additional funding and acknowledgements can be found in Supplementary Note 1.

Author information

Authors and Affiliations

Department of Biostatistics, University of Liverpool, Liverpool, L69 3GL, UK
Andrew P. Morris & James P. Cook
Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
Andrew P. Morris, Anubha Mahajan & Cecilia M. Lindgren
Department of Medicine, Division of Nephrology, University of Virginia, Charlottesville, VA, 22908, USA
Thu H. Le & Sylvia Cechova
Division of Nephrology, Washington University School of Medicine, St Louis, MO, 63110, USA
Haojia Wu & Benjamin D. Humphreys
Division of Cardiovascular Sciences, Faculty of Medicine, Biology and Health, University of Manchester, Manchester, M13 9PT, UK
Artur Akbarov, James Eales & Maciej Tomaszewski
Department of Epidemiology, University of Groningen, University Medical Center Groningen, P.O. Box 30.001, 9700 RB, Groningen, Netherlands
Peter J. van der Most & Harold Snieder
MRC Integrative Epidemiology Unit, Population Health Sciences, University of Bristol, Bristol, BS8 1TH, UK
Gibran Hemani & George Davey Smith
Department of Pediatrics, University of California, San Diego, San Diego, CA, 92161, USA
Kyle J. Gaulton
Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
Girish N. Nadkarni, Erwin P. Bottinger, Yingchang Lu & Ruth J. F. Loos
Division of Nephrology and Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
Girish N. Nadkarni
Unidad de Investigación Médica en Bioquímica, Hospital de Especialidades, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, 06720, Mexico
Adan Valladares-Salgado & Miguel Cruz
Unidad de Investigación Médica en Epidemiologia Clinica, Hospital de Especialidades, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, 06720, Mexico
Niels Wacher-Rodarte
Center for Public Health Genomics, University of Virginia School of Medicine, Charlottesville, VA, 22908, USA
Josyf C. Mychaleckyj & Stephen S. Rich
John P Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, 33124, USA
Nicole D. Dueker & Susan H. Blanton
Institute for Translational Genomics and Population Sciences, Departments of Pediatrics and Medicine, Los Angeles Biomedical Research Institute at Harbor-UCLA Medical Center, Torrance, CA, 90502, USA
Xiuqing Guo, Yang Hai, Kent D. Taylor, Yii-Der Ida Chen & Jerome I. Rotter
Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109-1024, USA
Jeffrey Haessler & Charles Kooperberg
Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, Japan
Yoichiro Kamatani & Yukinori Okada
Department of Biostatistics, University of Washington, Seattle, WA, 98195, USA
Adrienne M. Stilp & Cathy C. Laurie
Genetic Epidemiology Laboratory, QIMR Berghofer Medical Research Institute, Brisbane, QLD, 4006, Australia
Gu Zhu, Nicholas G. Martin & John B. Whitfield
Department of Neurobiology, Care Sciences and Society, Division of Family Medicine and Primary Care, Karolinska Institutet, Huddinge, 141 83, Sweden
Johan Ärnlöv
School of Health and Social Studies, Dalarna University, Falun, 791 88, Sweden
Johan Ärnlöv
Dr John T Macdonald Department of Human Genetics, University of Miami, Miami, FL, 33124, USA
Susan H. Blanton
Department of Internal Medicine, Division of Nephrology, University of Groningen, University Medical Center Groningen, P.O. Box 30.001, 9700 RB, Groningen, Netherlands
Martin H. de Borst
Department of Medicine, Division of Endocrinology and Diabetes, Keck School of Medicine of USC, Los Angeles, CA, 90033, USA
Thomas A. Buchanan
School of Health and Life Sciences, Federation University Australia, Ballarat, VIC, 3350, Australia
Fadi J. Charchar
Department of Cardiovascular Sciences, University of Leicester, Leicester, LE1 7RH, UK
Fadi J. Charchar
Department of Physiology, University of Melbourne, Parkville, VIC, 3010, Australia
Fadi J. Charchar
Department of Internal Medicine, Fu Jen Catholic University Hospital, School of Medicine, Fu Jen Catholic University, New Taipei City, 242, Taiwan
Pei-Lun Chu
Department of Pathology, Erasmus Medical Center Rotterdam, P.O. Box 2040, 3000 CA, Rotterdam, Netherlands
Jeffrey Damman
Department of Medicine, Division of Nephrology, College of Physicians and Surgeons, Columbia University, New York, NY, 10032, USA
Ali G. Gharavi, Krzysztof Kiryluk & Elena Sanchez
Department of Public Health and Caring Sciences, Molecular Geriatrics, Uppsala University, Uppsala, 751 85, Sweden
Vilmantas Giedraitis
Department of Psychiatry, Washington University in St Louis, St Louis, MO, 63110, USA
Andrew C. Heath & Pamela A. F. Madden
David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, 90024, USA
Eli Ipp
Los Angeles Biomedical Research Institute at Harbor UCLA Medical Center, Torrance, CA, 90502, USA
Eli Ipp
Department of Medicine and Nephrology, Loyola University Medical Center, Maywood, IL, 60153, USA
Holly J. Kramer
Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, Japan
Michiaki Kubo
Department of Medical Sciences, Clinical Epidemiology, Uppsala University, Uppsala, 751 85, Sweden
Anders Larsson, Johan Sundstrom & Lars Lind
Li Ka Shing Centre for Health Information and Discovery, Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7FZ, UK
Cecilia M. Lindgren
Broad Institute of Harvard and MIT, Boston, MA, 02142, USA
Cecilia M. Lindgren
Brisbane Institute for Molecular Bioscience, University of Queensland, St. Lucia, QLD 4072, Australia
Grant W. Montgomery
Epidemiology Branch, Division of Cardiovascular Sciences, National Heart, Lung and Blood Institute, Bethesda, MD, 20892, USA
George J. Papanicolaou
Department of Pediatrics, Division of Genetic and Genomic Medicine, University of California, Irvine Orange, CA, 92868, USA
Leslie J. Raffel
Departments of Neurology and Public Health Sciences, Miller School of Medicine, University of Miami, Miami, FL, 33136, USA
Ralph L. Sacco & Tatjana Rundek
Evelyn F McKnight Brain Institute, Miller School of Medicine, University of Miami, Miami, FL, 33136, USA
Ralph L. Sacco & Tatjana Rundek
Jackson Memorial Hospital, University of Miami, Miami, FL, 33136-1096, USA
Ralph L. Sacco
Institute of Pharmaceutical and Medicinal Chemistry, Heinrich Heine University Düsseldorf, Düsseldorf, 40225, Germany
Holger Stark & Aleksandra Zivkovic
Department of Research and Education, Kaiser Permanente Southern California, Pasadena, CA, 91101, USA
Anny H. Xiang
Department of Medicine, Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford, CA, 94309, USA
Erik Ingelsson
Stanford Cardiovascular Institute, Stanford University, Stanford, CA, 94309, USA
Erik Ingelsson
Stanford Diabetes Research Center, Stanford University, Stanford, CA, 94305, USA
Erik Ingelsson
Department of Medical Sciences, Molecular Epidemiology and Science for Life Laboratory, Uppsala University, Uppsala, 751 85, Sweden
Erik Ingelsson
Collaborative Studies Coordinating Center, Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599-7420, USA
Jianwen Cai
Department of Statistical Genetics, Osaka University Graduate School of Medicine, Osaka, Suita, 565-0871, Japan
Yukinori Okada
Laboratory of Molecular Medicine, Human Genome Center, Institute of Medical Science, University of Tokyo, Tokyo, 108-8639, Japan
Koichi Matsuda
Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
Ruth J. F. Loos
Department of Anthropology, University of Toronto at Mississauga, Mississauga, ON, L5L 1C6, Canada
Esteban J. Parra
Division of Medicine, Manchester University NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester, M13 9WL, UK
Maciej Tomaszewski
Department of Epidemiology, University of North Carolina, Chapel Hill, NC, 27516-8050, USA
Nora Franceschini

Authors

Andrew P. Morris
View author publications
You can also search for this author in PubMed Google Scholar
Thu H. Le
View author publications
You can also search for this author in PubMed Google Scholar
Haojia Wu
View author publications
You can also search for this author in PubMed Google Scholar
Artur Akbarov
View author publications
You can also search for this author in PubMed Google Scholar
Peter J. van der Most
View author publications
You can also search for this author in PubMed Google Scholar
Gibran Hemani
View author publications
You can also search for this author in PubMed Google Scholar
George Davey Smith
View author publications
You can also search for this author in PubMed Google Scholar
Anubha Mahajan
View author publications
You can also search for this author in PubMed Google Scholar
Kyle J. Gaulton
View author publications
You can also search for this author in PubMed Google Scholar
Girish N. Nadkarni
View author publications
You can also search for this author in PubMed Google Scholar
Adan Valladares-Salgado
View author publications
You can also search for this author in PubMed Google Scholar
Niels Wacher-Rodarte
View author publications
You can also search for this author in PubMed Google Scholar
Josyf C. Mychaleckyj
View author publications
You can also search for this author in PubMed Google Scholar
Nicole D. Dueker
View author publications
You can also search for this author in PubMed Google Scholar
Xiuqing Guo
View author publications
You can also search for this author in PubMed Google Scholar
Yang Hai
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey Haessler
View author publications
You can also search for this author in PubMed Google Scholar
Yoichiro Kamatani
View author publications
You can also search for this author in PubMed Google Scholar
Adrienne M. Stilp
View author publications
You can also search for this author in PubMed Google Scholar
Gu Zhu
View author publications
You can also search for this author in PubMed Google Scholar
James P. Cook
View author publications
You can also search for this author in PubMed Google Scholar
Johan Ärnlöv
View author publications
You can also search for this author in PubMed Google Scholar
Susan H. Blanton
View author publications
You can also search for this author in PubMed Google Scholar
Martin H. de Borst
View author publications
You can also search for this author in PubMed Google Scholar
Erwin P. Bottinger
View author publications
You can also search for this author in PubMed Google Scholar
Thomas A. Buchanan
View author publications
You can also search for this author in PubMed Google Scholar
Sylvia Cechova
View author publications
You can also search for this author in PubMed Google Scholar
Fadi J. Charchar
View author publications
You can also search for this author in PubMed Google Scholar
Pei-Lun Chu
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey Damman
View author publications
You can also search for this author in PubMed Google Scholar
James Eales
View author publications
You can also search for this author in PubMed Google Scholar
Ali G. Gharavi
View author publications
You can also search for this author in PubMed Google Scholar
Vilmantas Giedraitis
View author publications
You can also search for this author in PubMed Google Scholar
Andrew C. Heath
View author publications
You can also search for this author in PubMed Google Scholar
Eli Ipp
View author publications
You can also search for this author in PubMed Google Scholar
Krzysztof Kiryluk
View author publications
You can also search for this author in PubMed Google Scholar
Holly J. Kramer
View author publications
You can also search for this author in PubMed Google Scholar
Michiaki Kubo
View author publications
You can also search for this author in PubMed Google Scholar
Anders Larsson
View author publications
You can also search for this author in PubMed Google Scholar
Cecilia M. Lindgren
View author publications
You can also search for this author in PubMed Google Scholar
Yingchang Lu
View author publications
You can also search for this author in PubMed Google Scholar
Pamela A. F. Madden
View author publications
You can also search for this author in PubMed Google Scholar
Grant W. Montgomery
View author publications
You can also search for this author in PubMed Google Scholar
George J. Papanicolaou
View author publications
You can also search for this author in PubMed Google Scholar
Leslie J. Raffel
View author publications
You can also search for this author in PubMed Google Scholar
Ralph L. Sacco
View author publications
You can also search for this author in PubMed Google Scholar
Elena Sanchez
View author publications
You can also search for this author in PubMed Google Scholar
Holger Stark
View author publications
You can also search for this author in PubMed Google Scholar
Johan Sundstrom
View author publications
You can also search for this author in PubMed Google Scholar
Kent D. Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Anny H. Xiang
View author publications
You can also search for this author in PubMed Google Scholar
Aleksandra Zivkovic
View author publications
You can also search for this author in PubMed Google Scholar
Lars Lind
View author publications
You can also search for this author in PubMed Google Scholar
Erik Ingelsson
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas G. Martin
View author publications
You can also search for this author in PubMed Google Scholar
John B. Whitfield
View author publications
You can also search for this author in PubMed Google Scholar
Jianwen Cai
View author publications
You can also search for this author in PubMed Google Scholar
Cathy C. Laurie
View author publications
You can also search for this author in PubMed Google Scholar
Yukinori Okada
View author publications
You can also search for this author in PubMed Google Scholar
Koichi Matsuda
View author publications
You can also search for this author in PubMed Google Scholar
Charles Kooperberg
View author publications
You can also search for this author in PubMed Google Scholar
Yii-Der Ida Chen
View author publications
You can also search for this author in PubMed Google Scholar
Tatjana Rundek
View author publications
You can also search for this author in PubMed Google Scholar
Stephen S. Rich
View author publications
You can also search for this author in PubMed Google Scholar
Ruth J. F. Loos
View author publications
You can also search for this author in PubMed Google Scholar
Esteban J. Parra
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Cruz
View author publications
You can also search for this author in PubMed Google Scholar
Jerome I. Rotter
View author publications
You can also search for this author in PubMed Google Scholar
Harold Snieder
View author publications
You can also search for this author in PubMed Google Scholar
Maciej Tomaszewski
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin D. Humphreys
View author publications
You can also search for this author in PubMed Google Scholar
Nora Franceschini
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Central analysis: A.P.M., A.M., K.J.G. and N.F. COGENT-Kidney Consortium GWAS analysis: A.P.M., A.M., G.N.N., A.V.-S., N.W.-R., J.C.M., N.D.D., X.G., Y.H., J.H., Y.K., A.M.S., G.Z., and J.P.C. COGENT-Kidney Consortium genotyping and phenotyping: J.A., S.H.B., E.P.B., T.A.B., V.G., A.C.H., E.Ipp, H.J.K., M.K., A.L., C.M.L., Y.L., P.A.F.M., G.W.M., G.J.P., L.J.R., R.L.S., J.S., K.D.T. and A.H.X. COGENT-Kidney Consortium principal investigator: A.P.M., L.L., E.Ingelsson, N.G.M., J.B.W., J.C., C.C.L., Y.O., K.M., C.K., Y.-D.I.C., T.R., S.S.R., R.J.F.L, E.J.P., M.C., J.I.R. and N.F. CERS2 functional experimentation: T.H.L., S.C., P.-L.C., H.Stark and A.Z. Kidney single-cell expression data generation and analysis: H.W. and B.D.H. TRANSLATE/TCGA data generation and eQTL analyses: A.A., J.E., F.J.C and M.T. TransplantLines data generation and eQTL analyses: P.v.d.M., M.H.d.B., J.D. and H.Snieder. Mendelian randomisation analyses: A.P.M., G.H. and G.D.S. IgA Nephropathy GWAS: A.G.G., K.K. and E.S. Manuscript preparation: A.P.M., T.H.L., H.W., A.A., M.T., B.D.H. and N.F. COGENT-Kidney Consortium co-ordination: A.P.M. and N.F. All authors approved the final manuscript.

Corresponding authors

Correspondence to Andrew P. Morris or Nora Franceschini.

Ethics declarations

Competing interests

G.N.N. has received operational funding from Goldfinch Bio. The remaining authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Morris, A.P., Le, T.H., Wu, H. et al. Trans-ethnic kidney function association study reveals putative causal genes and effects on kidney-specific disease aetiologies. Nat Commun 10, 29 (2019). https://doi.org/10.1038/s41467-018-07867-7

Download citation

Received: 16 August 2018
Accepted: 03 December 2018
Published: 03 January 2019
DOI: https://doi.org/10.1038/s41467-018-07867-7

This article is cited by

Genetic drivers of heterogeneity in type 2 diabetes pathophysiology
- Ken Suzuki
- Konstantinos Hatzikotoulas
- Eleftheria Zeggini
Nature (2024)
Can polygenic risk scores help explain disease prevalence differences around the world? A worldwide investigation
- Pritesh R. Jain
- Myson Burch
- Peristera Paschou
BMC Genomic Data (2023)
Ancestry-driven metabolite variation provides insights into disease states in admixed populations
- Kaylia M. Reynolds
- Andrea R. V. R. Horimoto
- Nora Franceschini
Genome Medicine (2023)
Mapping genomic regulation of kidney disease and traits through high-resolution and interpretable eQTLs
- Seong Kyu Han
- Michelle T. McNulty
- Matthew G. Sampson
Nature Communications (2023)
Imputation-powered whole-exome analysis identifies genes associated with kidney function and disease in the UK Biobank
- Matthias Wuttke
- Eva König
- Christian Fuchsberger
Nature Communications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Study overview

Trans-ethnic meta-analysis

Trans-ethnic heterogeneity in eGFR association signals

Enrichment of eGFR associations for genomic annotations

Annotation-informed trans-ethnic fine-mapping

Putative causal genes at eGFR association signals

Mapping genes to kidney cells

Causal effects of eGFR on clinically-relevant outcomes

Discussion

Methods

Ethics statement

COGENT-Kidney Consortium: study-level analyses

CKDGen Consortium: meta-analysis

Biobank Japan Project: study-level analysis

Trans-ethnic meta-analysis

Locus definition

Dissection of association signals

Estimation of observed scale heritability

Estimation of allelic effect sizes at index SNVs

Assessment of heterogeneity in allelic effect sizes

Enrichment of eGFR associations in genomic annotations

Trans-ethnic fine-mapping

SNV associations with measures of kidney function and damage

Functional annotation of high-confidence missense variants

Primary podocyte cell culture and scratch assay

Kidney tissue eQTLs: TRANSLATE Study and TGCA

Kidney tissue eQTLs: TransplantLines Study

Expression of GWAS genes across kidney cell-types

Two-sample MR analyses

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links