Figures
Abstract
Background
Genome-wide association studies have identified multiple genomic loci associated with coronary artery disease, but most are common variants in non-coding regions that provide limited information on causal genes and etiology of the disease. To overcome the limited scope that common variants provide, we focused our investigation on low-frequency and rare sequence variations primarily residing in coding regions of the genome.
Methods and results
Using samples of individuals of European ancestry from ten cohorts within the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium, both cross-sectional and prospective analyses were conducted to examine associations between genetic variants and myocardial infarction (MI), coronary heart disease (CHD), and all-cause mortality following these events. For prevalent events, a total of 27,349 participants of European ancestry, including 1831 prevalent MI cases and 2518 prevalent CHD cases were used. For incident cases, a total of 55,736 participants of European ancestry were included (3,031 incident MI cases and 5,425 incident CHD cases). There were 1,860 all-cause deaths among the 3,751 MI and CHD cases from six cohorts that contributed to the analysis of all-cause mortality. Single variant and gene-based analyses were performed separately in each cohort and then meta-analyzed for each outcome. A low-frequency intronic variant (rs988583) in PLCL1 was significantly associated with prevalent MI (OR = 1.80, 95% confidence interval: 1.43, 2.27; P = 7.12 × 10−7). We conducted gene-based burden tests for genes with a cumulative minor allele count (cMAC) ≥ 5 and variants with minor allele frequency (MAF) < 5%. TMPRSS5 and LDLRAD1 were significantly associated with prevalent MI and CHD, respectively, and RC3H2 and ANGPTL4 were significantly associated with incident MI and CHD, respectively. No loci were significantly associated with all-cause mortality following a MI or CHD event.
Citation: Hahn J, Fu Y-P, Brown MR, Bis JC, de Vries PS, Feitosa MF, et al. (2020) Genetic loci associated with prevalent and incident myocardial infarction and coronary heart disease in the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium. PLoS ONE 15(11): e0230035. https://doi.org/10.1371/journal.pone.0230035
Editor: Marie-Pierre Dubé, Universite de Montreal, CANADA
Received: February 18, 2020; Accepted: October 26, 2020; Published: November 13, 2020
This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Data Availability: Summary association statistics from the meta-analyses are available from dbGaP at accession number phs000930. The authors confirm that, for approved reasons, some access restrictions apply to the data underlying the findings. This study makes use of the data of 10 cohorts: AGES, ARIC, CHS, FamHS, FHS, GeneSTAR, MESA, RS, SHIP, and WGHS. The genotype and phenotype data of the ARIC (dbGaP Study Accession: phs000280.v3.p1) and CHS (dbGaP Study Accession: phs000287.v5.p1) are available via dbGaP. Data for FHS are also available on dbGAP, at accession number phs000007. Data for the NHLBI Family Heart Study (FamHS-Visit1 and FamHS-Visit2) are available on dbGaP Study Accession: phs000221.v1.p1. Due to restrictions based on privacy regulations and/or informed consent of the participants, data cannot be made freely available in a public repository for GeneSTAR, RS, SHIP, WGHS, MESA, and AGES. Data of these studies can be obtained upon request. GeneSTAR data are available through application and approval from the GeneSTAR Study Steering Committee: https://www.hopkinsmedicine.org/gim/research/GeneSTAR/for_researchers. Requests for Rotterdam Study data should be directed towards the management team of the Rotterdam Study (secretariat.epi@erasmusmc.nl). The data of the SHIP study can be accessed through a data application form available at https://fvcm.med.uni-greifswald.de/ for researchers who meet the criteria for access to confidential data. WGHS data are available to researchers who are approved for analysis on site (contact person: dchasman@research.bwh.harvard.edu or https://whs.bwh.harvard.edu). To obtain data on MESA participants please contact Craig Johnson (wcraigj@uw.edu). Some participant data is also available on dbGaP (https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000209.v13.p3). AGES data can be made available to interested researchers upon request following approval by the relevant institutional review boards. Requests can be directed to the Icelandic Heart Association (AGES_data_request@hjarta.is). Data sharing is in accordance with the informed consent after an approval by the Icelandic National Bioethics committee and the Data Protection Authority of Iceland.
Funding: No funding sources had a role in the design of the study or the analysis or interpretation of the data. Infrastructure for the CHARGE Consortium is supported in part by the National Heart, Lung and Blood Institute (NHLBI) grant R01HL105756. JH, ACM and PSdeV were supported by NIH NHLBI R01HL141291. PSdV was additionally supported by American Heart Association grant number 18CDA34110116. The views expressed in this manuscript are those of the authors and do not necessarily represent the views of the NHLBI; the National Institutes of Health; or the U.S. Department of Health and Human Services. The Age, Gene, Environment, Susceptibility Study (AGES) study has been funded by NIH contracts N01-AG-1-2100 and HHSN271201200022C, the NIA Intramural Research Program, Hjartavernd (the Icelandic Heart Association), and the Althingi (the Icelandic Parliament). The Atherosclerosis Risk in Communities study has been funded in whole or in part with Federal funds from the National Heart, Lung, and Blood Institute, National Institutes of Health, Department of Health and Human Services (contract numbers HHSN268201700001I, HHSN268201700002I, HHSN268201700003I, HHSN268201700004I and HHSN268201700005I). Funding support for “Building on GWAS for NHLBI-diseases: the U.S. CHARGE consortium” was provided by the NIH through the American Recovery and Reinvestment Act of 2009 (ARRA) (5RC2HL102419). Cardiovascular Health Study (CHS) research was supported by NHLBI contracts HHSN268201200036C, HHSN268200800007C, HHSN268201800001C, N01HC55222, N01HC85079, N01HC85080, N01HC85081, N01HC85082, N01HC85083, N01HC85086; and NHLBI grants U01HL080295, R01HL087652, R01HL105756, R01HL103612, R01HL120393, and U01HL130114 with additional contribution from the National Institute of Neurological Disorders and Stroke (NINDS). Additional support was provided through R01AG023629 from the National Institute on Aging (NIA). A full list of principal CHS investigators and institutions can be found at https://chsnhlbi.org/. The provision of genotyping data was supported in part by the National Center for Advancing Translational Sciences, CTSI grant UL1TR001881, and the National Institute of Diabetes and Digestive and Kidney Disease Diabetes Research Center (DRC) grant DK063491 to the Southern California Diabetes Endocrinology Research Center. The Family Heart Study (FamHS) was supported by the grant R01-HL-117078 from the National Heart, Lung, and Blood Institute, and grant R01-DK-089256 from the National Institute of Diabetes and Digestive and Kidney Diseases. The Framingham Heart Study (FHS) The National Heart, Lung and Blood Institute’s Framingham Heart Study is supported by contract N01-HC-25195. GeneSTAR was supported by grants from the National Institutes of Health/National Heart, Lung and Blood Institute (HL49762, HL59684, HL071025, HL58625, U01 HL72518, HL089474, HL092165, HL099747, K23HL105897, K23HL094747, HL11006, and HL112064), National Institute of Nursing Research (NR0224103, NR008153), National Institute of Neurological Disorders and Stroke (NS062059), and by a grant from the National Center for Research Resources (M01-RR000052) to the Johns Hopkins General Clinical Research Center. Genotyping services were provided through the RS&G Service by the Northwest Genomics Center at the University of Washington, Department of Genome Sciences, under U.S. Federal Government contract number HHSN268201100037C from the National Heart, Lung, and Blood Institute. MESA and the MESA SHARe projects are conducted and supported by the National Heart, Lung, and Blood Institute (NHLBI) in collaboration with MESA investigators. Support for MESA is provided by contracts 75N92020D00001, HHSN268201500003I, N01-HC-95159, 75N92020D00005, N01-HC-95160, 75N92020D00002, N01-HC-95161, 75N92020D00003, N01-HC-95162, 75N92020D00006, N01-HC-95163, 75N92020D00004, N01-HC-95164, 75N92020D00007, N01-HC-95165, N01-HC-95166, N01-HC-95167, N01-HC-95168, N01-HC-95169, UL1-TR-000040, UL1-TR-001079, UL1-TR-001420. Also supported in part by the National Center for Advancing Translational Sciences, CTSI grant UL1TR001881, and the National Institute of Diabetes and Digestive and Kidney Disease Diabetes Research Center (DRC) grant DK063491 to the Southern California Diabetes Endocrinology Research Center. For the Rotterdam Study, the work was supported by the Erasmus Medical Center and Erasmus University, Rotterdam; The Netherlands Organisation for the Health Research and Development (ZonMw); the Research Institute for Diseases in the Elderly (014-93-015, RIDE2); the Ministry of Education, Culture and Science; the Ministry for Health, Welfare and Sports; the European Commission (DG XII); the Municipality of Rotterdam; The Netherlands Organisation of Scientific Research (NWO) (175.010.2005.011, 911-03-012); the Netherlands Genomics Initiative (NGI) (NWO 050-060-810), the Netherlands Organisation for Scientific Research (NWO) (veni 916.12.154). SHIP is supported by the German Federal Ministry of Education and Research (Bundesministerium für Bildung und Forschung (BMBF); grants 01ZZ9603, 01ZZ0103, and 01ZZ0403) and the German Research Foundation (Deutsche Forschungsgemeinschaft (DFG); grant GR 1912/5-1). The Study of Health in Pomerania (SHIP) and SHIP-TREND are part of the Community Medicine Research net (CMR) of the Ernst-Moritz-Arndt University Greifswald (EMAU) which is funded by the BMBF as well as the Ministry for Education, Science and Culture and the Ministry of Labor, Equal Opportunities, and Social Affairs of the Federal State of Mecklenburg-West Pomerania. The CMR encompasses several research projects that share data from SHIP. The EMAU is a member of the Center of Knowledge Interchange (CKI) program of the Siemens AG. SNP typing of SHIP and SHIP-TREND using the Illumina Infinium HumanExome BeadChip (version v1.0) was supported by the BMBF (grant 03Z1CN22). The Women’s Genome Health Study (WGHS) is supported by the National Heart, Lung, and Blood Institute (HL043851, HL080467, HL099355) and the National Cancer Institute (CA047988 and UM1CA182913), with collaborative scientific support and funding for genotyping provided by Amgen. There was no additional external funding received for this study.
Competing interests: BMP reports serving service on the DSMB of a clinical trial funded by the manufacturer (Zoll LifeCor) and on the Steering Committee of the Yale Open Data Access Project. This does not alter our adherence to PLOS ONE policies on sharing data and materials.
Introduction
Coronary heart disease (CHD) is a leading cause of morbidity and mortality worldwide, accounting for one of every seven deaths in the United States in 2016 [1]. In addition to major modifiable risk factors such as dyslipidemia, hypertension, diabetes, and cigarette smoking [2], genetic susceptibility to CHD has also been investigated extensively through family-based studies, candidate gene studies, and more recently genome-wide association studies (GWAS) [3–9]. With progressively expanded sample sizes in recent GWAS, at least 160 loci have been associated with the risk of coronary artery disease [10–13]. Most of these loci are represented by common variants located in noncoding regions, resulting in limited implications for causal genes and etiological pathways. Further, while most available data are derived from genome-wide analysis of prevalent CHD, data are sparse from prospective studies of incident cardiovascular events in populations free of baseline cardiovascular disease.
Low-frequency and rare coding sequence variations across the genome have been investigated in studies of cardiovascular disease risk factors [14–18], with the goal of better understanding the etiology of these risk factors and to advance the discovery of the treatment and prevention of diseases [19]. We previously published the results from a prospective analysis of CHD among individuals of European ancestry from the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium, and identified low-frequency and common variants associated with incident CHD [20].
In this current study of individuals of European ancestry, we implemented both a cross-sectional and prospective study design in the setting of the CHARGE Consortium to examine the association between low-frequency and rare genetic variants and the risk of prevalent and incident myocardial infarction (MI) and CHD. Study of incident cardiovascular events is enabled by the rigorous prospective design of population cohorts contributing to the CHARGE Consortium. We also investigated whether these genetic variants are associated with all-cause mortality after incident MI and CHD.
Materials and methods
Study design and participants
Ten cohorts within the CHARGE Consortium Subclinical Working Group were included in this study: Age, Gene, Environment, Susceptibility Study (AGES), Atherosclerosis Risk in Communities (ARIC) Study, Cardiovascular Health Study (CHS), Family Heart Study (FamHS), Framingham Heart Study (FHS), the GeneSTAR Study (GeneSTAR), Multi-Ethnic Study of Atherosclerosis (MESA), Rotterdam Study (RS), Study of Health in Pomerania (SHIP), and the Women’s Genome Health Study (WGHS). Detailed characteristics of the participating cohorts and study participant are shown in the S1 Document. All study participants provided written informed consent to participate in genetic studies, and all study sites received approval to conduct this research from their local Institutional Review Boards (IRB) respectively. AGES was approved by the National Bioethics Committee in Iceland that acts as the institutional review board for the Icelandic Heart Association and by the National Institute on Aging Intramural Institutional Review Board. ARIC was approved by the University of Mississippi Medical Center IRB, Wake Forest University Health Sciences IRB, University of Minnesota IRB, and John Hopkins University IRB. CHS was approved by the Wake Forest University Health Sciences IRB, University of California, Davis IRB, John Hopkins University IRB, and University of Pittsburgh IRB, and University of Washington IRB. FamHS was approved by the Washington University School of Medicine IRB. FHS was approved by the Boston University IRB. GeneSTAR was approved by the Johns Hopkins Medicine IRB. MESA was approved by the institutional review boards of the six field centers have approved the study protocol (Wake Forest University School of Medicine, University of Minnesota, Northwestern University, Columbia University, Johns Hopkins University, University of California, Los Angeles). RS was approved by the Medical Ethics Committee of the Erasmus MC and the Dutch Ministry of Health, Welfare and Sport. SHIP was approved by the Medical Ethics Committee of the University of Greifswald. WGHS was approved by the Brigham and Women’s Hospital IRB.
Genotype calling and quality control
Participants from WGHS were genotyped by the HumanHap300 Duo+ (Illumina, Inc., San Diego, CA), and all other study participants were genotyped by the HumanExome BeadChip (v1.0–1.2, Illumina, Inc., San Diego, CA) which contains more than 240,000 variants including those discovered through exome sequencing in ~12,000 individuals and other non-coding common variants such as previously-reported GWAS signals and ancestry-informative markers. Data for AGES, ARIC, CHS, FamHS, FHS, MESA, and RS were jointly called at the University of Texas Health Science Center at Houston [21]; SHIP was called in Illumina GenomeStudio using the CHARGE Consortium joint calling cluster file; GeneSTAR used the Illumina GenomeStudio and zCall software [22]; and WGHS data was called using the Illumina BeadStudio v.3.3. Variant quality control (QC) was performed centrally [21] and by the individual studies, including checking concordance with previous GWAS data, and excluding participants with missing >5% genotypes, population clustering outliers, individuals with high inbreeding coefficients or heterozygote rates, gender mismatches, duplicated pairs, and unexpectedly high proportion of identity-by-descent sharing for family studies. Joint calling of the measured exome chip genotypes allowed for the ability to accurately genotype rare variation using array technology.
Cardiovascular outcome definition
Two cardiovascular outcomes were examined for association in this study: 1) MI: fatal or non-fatal MI; and 2) CHD: fatal or non-fatal MI, fatal CHD, sudden death within one hour of onset of symptoms, or revascularization (percutaneous coronary artery intervention such as stent or balloon angioplasty, or coronary artery bypass grafting). No exclusions were applied for the cross-sectional analysis of prevalent MI and prevalent CHD. For analysis of incident events, participants with a history of MI, CHD or revascularization at the baseline examination were excluded. Follow-up time was defined as the time from the baseline exam to MI or CHD diagnosis, the time of death, last date of contact, or at the end of follow-up, whichever came first. All-cause mortality after MI or CHD was also investigated with follow-up time from first MI or CHD incident events until death, loss to follow-up, or the end of study.
Statistical analysis
Single variant and gene-based analyses were conducted in each participating cohort respectively, followed by meta-analysis performed for each cardiovascular outcome to summarize results. All autosomal variants were coded to the minor allele observed in the CHARGE jointly called data [21] and assumed log-additive genetic effect in the analyses. The minor allele frequency (MAF) thresholds were defined using the European allele frequencies derived from the CHARGE jointly called data [21]. Variant annotation was performed centrally within CHARGE using dbNSFP [23, 24]. Low-frequency variants (MAF ≥ 1% and less than 5%) were included in single variant tests for prevalent MI and CHD and for incident MI. Single variant results for incident CHD followed the same analytic approach and are reported in Morrison et al. [20] and are not reported in detail here. Gene-based tests were evaluated for MI and CHD outcomes: the Sequence Kernel Association Test (SKAT) [25] was used for incident events and post-MI mortality and a burden test [26] was performed for prevalent and incident events and for post-MI mortality. Only functional coding variants (missense, stop-gain, stop-loss, or splice-site changes) with MAF < 5% were aggregated by gene, and we only analyzed genes with a cumulative minor allele count (cMAC) ≥ 5.
For both single variant and gene-based burden tests of prevalent events, we performed Firth’s logistic regression model to test the association between each variant and cardiovascular outcome using the “logistf” package in R [27–29] to account for the possible inflated type one error in the rare variant association analysis in a case-cohort study design [30]. Meta-analysis for prevalent events was conducted with METAL [31] and applied the genomic control correction. For the single variant and two gene-based tests of incident events, a Cox proportional hazards regression model implemented in the seqMeta package in R was used to test the association between each variant and the incident event or post-event all-cause mortality. SeqMeta was used both at the study-specific analysis and meta-analysis levels [32]. All study-specific analyses (single variant and gene-based tests) were adjusted for cohort-specific design variables (e.g. study sites, family structure) and for population substructure using principal components as needed. We applied a Bonferroni corrected threshold to determine statistical significance in each analysis as described below.
Results
Prevalent MI and CHD association
A total of 27,349 participants of European ancestry from seven cohorts including 1831 prevalent MI cases (6.7%) and 2518 prevalent CHD cases (9.2%) were used in the meta-analyses of prevalent events (S1 Table). We individually examined a total of 6,221 low-frequency variants, across all autosomal chromosomes corresponding to a Bonferroni corrected significance threshold of P = 8.04 × 10−6. A low-frequency (MAF = 1.64%) intronic variant (rs988583) in the phospholipase C like 1 gene (PLCL1) was significantly associated with prevalent MI (P = 7.12 × 10−7; OR = 1.80, 95% confidence interval = 1.43 to 2.27; Table 1). No low-frequency variants were significantly associated with prevalent CHD. Cohort specific summary statistics for this association is shown in S2 Table.
In the gene-based burden tests, we analyzed 16,628 autosomal genes that contained functional low-frequency or rare variants with MAF < 5% and with a cumulative minor allele count (cMAC) ≥ 5; therefore, the Bonferroni corrected p-value threshold was P = 3.01 × 10−6. The transmembrane serine protease 5 gene (TMPRSS5) on chromosome 11, containing nine nonsynonymous rare variants (S3 Table), was significantly associated with prevalent MI (P = 2.59 × 10−6, OR = 3.00, 95% confidence interval: 1.90, 4.73; Table 2). The low-density lipoprotein receptor class A domain containing 1 gene (LDLRAD1) on chromosome 1 contained seven rare variants (S3 Table) and was significantly associated with prevalent CHD (P = 1.30 × 10−6, OR = 4.48, 95% confidence interval: 2.44, 8.23; Table 2).
Incident MI and CHD association
Nine cohorts contributed a total of 55,736 participants of European ancestry to the analyses of incident events, where 3,031 incident MI cases (5.4%) were reported during an average of 15.0 years of follow-up and 5,425 incident CHD cases (9.73%) were reported during an average of 15.6 years of follow-up (S4 Table). A total of 9,852 low-frequency autosomal variants were individually tested for association with incident MI, with adjustment of age, sex, and population substructure. The Bonferroni corrected p-value threshold for single variant analysis of incident MI was P = 5.08 × 10−6. No low-frequency variants were significantly associated with incident MI. As previously stated, single variant results for incident CHD are reported in Morrison et al. and are not reported here, but include a significant association between a low-frequency variant in ANGPTL4 and a decreased risk of incident CHD [20].
For the gene-based analyses, we examined 17,574 genes across all autosomal chromosomes for association with incident MI, and the Bonferroni corrected significance level was P = 2.85 × 10−6. The ring finger and CCCH-Type domains 2 gene (RC3H2) on chromosome 9 was significantly associated with incident MI in the burden test (P = 2.99 × 10−6, OR = 0.35, 95% confidence interval = 0.23, 0.55; Table 3) and contained 12 nonsynonymous and one splice-site rare variants (S5 Table). No genes were significantly associated with incident MI using SKAT. For the gene-based analyses of incident CHD, 16,620 genes were evaluated and the Bonferroni significance levels was P = 3.01 × 10−6. Angiopoietin-like 4 (ANGPTL4) on chromosome 19 was significantly associated with incident CHD using SKAT (P = 1.29 × 10−6; Table 3) and contained 10 variants (S5 Table), and no gene was significantly associated using the burden test.
Post MI and CHD mortality analysis
Among the 3,751 MI and CHD cases from six cohorts that contributed to the analysis of all-cause mortality, there were 1,860 all-cause deaths over a mean 10.9 years of follow-up (S6 Table). We examined 9,943 low-frequency autosomal variants in the single variant analysis (Bonferroni corrected significant level of P = 5.03 × 10−6) and 17,574 genes in the gene-based analysis (Bonferroni corrected significant level of P = 2.85 × 10−6). No single variant or gene reached the significance threshold in the analysis of all-cause mortality among survivors of MI or CHD. We examined the significant variants and genes reported in Tables 1–3 for their relationship with mortality following a MI or CHD event, and none of them were significantly associated with all-cause mortality (S7 Table).
Discussion
Our study evaluated genetic susceptibility to MI and CHD in cross-sectional and prospective settings among individuals of European ancestry. We identified one new locus associated with prevalent MI, and also investigated disease risk in the context of gene-based analyses.
Single variant analysis of prevalent cardiovascular outcomes revealed a low-frequency (MAF = 1.64%) intronic variant, rs988583, in PLCL1 significantly associated with increased risk of MI (P = 7.12 × 10−7). In silico replication was conducted by a look up of rs988583 and its association with prevalent MI in the Myocardial Infarction Genetics and CARDIoGRAM exome chip meta-analysis public release [33], and there was no significant association with MI (P = 0.34). A GWAS of MI and coronary artery disease (CAD) in a Saudi Arab population identified an intergenic variant, rs7421388, near PLCL1 associated with CAD (P = 4.31 × 10−6) and replicated in an independent sample of Saudi Arabs (P = 5.37 × 10−7) [34]. In another study of an ethnic Arab population, rs1147169 in PLCL1 was protective against a low level of high density lipoprotein-cholesterol levels (P = 2.87 × 10−7) [35]. In individuals of European ancestry, rs988583 and rs1147169 are in linkage equilibrium (R2 = 0.0043). In addition to these studies, PLCL1 has been implicated in coronary artery aneurysm in Kawasaki disease and PLCL1 might play a role in the regulation of vascular endothelial cell inflammation via interference with proinflammatory cytokine expression [36].
A burden test aggregating low-frequency and rare coding variants in genes showed a significant positive association between TMPRSS5 and prevalent MI (P = 2.59 × 10−6) and LDLRAD1 and prevalent CHD (P = 1.30 × 10−6), and a significant protective association between RC3H2 and incident MI (P = 2.99 × 10−6). A significant association between ANGPTL4 and incident CHD was identified using SKAT (P = 1.29 × 10−6). The relationship between ANGPTL4 and CHD has been previously reported, with the rs116843064 missense variant playing a major role in reducing lipid levels and risk of CHD [33, 37]. Serine proteases, such as TMPRSS5, are known to be involved in many physiological and pathological processes, and TMPRSS5 has been implicated in impaired hearing function [38]. Little is known about LDLRAD1, with most marked gene expression in lung and fallopian tube [39], and a rare variant in this gene has been associated with breast cancer [40]. Roquin-2 is encoded by RC3H2 and has been shown to play a key role in posttranscriptional regulation of autoimmunity and inflammatory response [41]. Each of these genes associated with prevalent or incident cardiovascular outcomes has rare and low-frequency variants underlying the gene burden tests (S3 and S5 Tables). We identified 11 putative driving variants of these gene-based associations (i.e. those with p<0.05 in S3 and S5 Tables; rs201233178, rs200417674, and rs116913282 in TMPRSS5; rs150560713, rs202234131, rs142900519, and rs76122098 in LDLRAD1; rs201920127, rs144714368, and rs199901510 in RC3H2; and rs116843064 in ANGPTL4). An in silico replication was not possible due to the rare frequency of these coding variants and their absence in the public release of the Myocardial Infarction Genetics and CARDIoGRAM exome chip meta-analysis or the analysis of CAD in the UK Biobank and the UK Biobank and CARDIoGRAMplusC4D meta-analysis [10, 33]. However, it is important to note that rs116843064 of ANGPTL4 is the same variant found in the single variant analysis conducted for incident CHD by Morrison et al., and this gene is likely to be driving the significant association found in the SKAT analysis of incident CHD [20]. It is of interest that the effect size of the gene-based tests (Tables 2 and 3) are larger than the single variant test effect size (Table 1). This shows that aggregate tests of rare variants indeed have a larger effect on disease outcomes, although there remains some scientific debate regarding the utility of conducting aggregate tests of rare variants.
Although there was no statistically significant result found for all-cause mortality after MI or CHD, after accounting for multiple testing, the protective direction of effect for the mortality results suggests that genetic variants might contribute differently in various stages of disease manifestation. Given the limited statistical power of our findings for post-event survival, our study supports the need for substantially larger well-phenotyped cohorts to differentiate effects of variants associated with CHD from post-event mortality. Another limitation that may reside within mortality analysis is that there may have been a presence of index event bias, which arises from selecting a population on the basis of a prior, or an “index” event [42]. It is possible that due to this selection bias, individuals with MI or CHD presented modestly lower rate of mortality compared to those without disease endpoints. This is a major challenge affecting prognosis research. Several approaches are under development that aim to mitigate this type of bias, such as the “Slope-Hunter” method proposed by Mahmoud et al [43]. This method utilizes clustering technique to ultimately identify variants that only affect prognosis, and also to find estimated adjustment factors by identifying variants that affect incidence. With more implementation in the future, this may be suitable to be applied in future investigation. Also, generally the loci identified for prevalent disease were not the same as those identified for incident disease (S7 Table), as has been observed in previous studies [9]. A possible explanation for these observed differences is that genetic studies of cardiovascular diseases are usually conducted with the cross-sectional study design, which has the potential to oversample participants with longer post-event survival [44] and the results do not always replicate in the prospective studies for disease onset and vice versa [9].
An advantage of this study is that within the setting of the CHARGE Consortium we are able to evaluate and make comparisons between cross-sectional and prospective study designs, and to investigate all-cause mortality following cardiovascular events. There are differing, but overlapping, sample sizes across the various study designs: 27,349 participants from seven cohorts for prevalent outcomes, 55,736 participants from nine cohorts for incident outcomes, and 3,751 MI and CHD cases from six cohorts that contributed to the analysis of all-cause mortality. These differing sample sizes influence our power to detect associations, and inferences about similarities and differences across study designs could be due to biological differences or differences in sample sizes. This investigation of low-frequency and rare variants was limited to the variants included on the genotyping platforms (HumanHap300 Duo+ and HumanExome BeadChip, v1.0–1.2, Illumina, Inc., San Diego, CA) and was also limited to individuals of European ancestry. Additionally, although the variants on the genotyping platform and included in our gene-based tests were enriched for coding variants predicted to be causal, we cannot attribute causality to the variants or genes with novel associations. A strength of this study is that the quality of rare variant genotype calling was maximized by the joint clustering performed within CHARGE on thousands of samples [21].
In conclusion, this study comprehensively evaluated the relationship between autosomal genetic variation and prevalent and incident cardiovascular outcomes in participants of European ancestry in the context of the CHARGE consortium. We identified one known locus (ANGPTL4) and four new loci (PLCL1, RC3H2, TMPRSS5, and LDLRAD1) associated with cardiovascular disease risk that warrant further investigation.
Supporting information
S1 Document. Characteristics of the participating cohorts.
https://doi.org/10.1371/journal.pone.0230035.s001
(DOCX)
S1 Table. Study participants’ characteristics for prevalent MI and CHD analysis.
https://doi.org/10.1371/journal.pone.0230035.s002
(XLSX)
S2 Table. Cohort specific summary statistics for low-frequency variant associated with prevalent MI.
https://doi.org/10.1371/journal.pone.0230035.s003
(XLSX)
S3 Table. Low-frequency and rare variants underlying top signals from gene-based analysis for prevalent events.
https://doi.org/10.1371/journal.pone.0230035.s004
(XLSX)
S4 Table. Study participants’ characteristics for incident MI and CHD analysis.
https://doi.org/10.1371/journal.pone.0230035.s005
(XLSX)
S5 Table. Low-frequency and rare variants underlying top signals from gene-based analysis for incident events.
https://doi.org/10.1371/journal.pone.0230035.s006
(XLSX)
S6 Table. Study participants’ characteristics for post MI and CHD mortality analysis.
https://doi.org/10.1371/journal.pone.0230035.s007
(XLSX)
S7 Table. Prevalent and incident findings in relation to corresponding outcomes.
https://doi.org/10.1371/journal.pone.0230035.s008
(XLSX)
Acknowledgments
The authors are grateful to the study participants, the researchers, and the staff at each study site for their contributions.
References
- 1. Benjamin EJ, Muntner P, Alonso A, Bittencourt MS, Callaway CW, Carson AP, et al. Heart Disease and Stroke Statistics—2019 Update: A Report From the American Heart Association. Circulation. 2019;139(10):e56–e528. pmid:30700139
- 2. Anderson KM, Odell PM, Wilson PWF, Kannel WB. Cardiovascular disease risk profiles. American Heart Journal. 1991;121(1, Part 2):293–8. pmid:1985385
- 3. Swerdlow DI, Holmes MV, Harrison S, Humphries SE. The genetics of coronary heart disease. British Medical Bulletin. 2012;102(1):59–77.
- 4. O’Donnell CJ, Nabel EG. Genomics of Cardiovascular Disease. New England Journal of Medicine. 2011;365(22):2098–109. pmid:22129254
- 5. Lloyd-Jones DM, Nam BH, D’Agostino RB, Levy D, Murabito JM, Wang TJ, et al. Parental cardiovascular disease as a risk factor for cardiovascular disease in middle-aged adults: a prospective study of parents and offspring. JAMA. 2004;291(18):2204–11. pmid:15138242
- 6. Marenberg ME, Risch N, Berkman LF, Floderus B, de Faire U. Genetic Susceptibility to Death from Coronary Heart Disease in a Study of Twins. New England Journal of Medicine. 1994;330(15):1041–6. pmid:8127331
- 7. Murabito JM, Pencina MJ, Nam B-H, D’Agostino RB, Wang TJ, Lloyd-Jones D, et al. Sibling Cardiovascular Disease as a Risk Factor for Cardiovascular Disease in Middle-aged Adults. JAMA. 2005;294(24):3117–23. pmid:16380592
- 8. Dai X, Wiernek S, Evans JP, Runge MS. Genetics of coronary artery disease and myocardial infarction. World journal of cardiology. 2016;8(1):1–23. pmid:26839654
- 9. Dehghan A, Bis JC, White CC, Smith AV, Morrison AC, Cupples LA, et al. Genome-Wide Association Study for Incident Myocardial Infarction and Coronary Heart Disease in Prospective Cohort Studies: The CHARGE Consortium. PloS one. 2016;11(3):e0144997–e. pmid:26950853
- 10. van der Harst P, Verweij N. Identification of 64 Novel Genetic Loci Provides an Expanded View on the Genetic Architecture of Coronary Artery Disease. Circulation research. 2018;122(3):433–43. pmid:29212778
- 11. Klarin D, Zhu QM, Emdin CA, Chaffin M, Horner S, McMillan BJ, et al. Genetic analysis in UK Biobank links insulin resistance and transendothelial migration pathways to coronary artery disease. Nature genetics. 2017;49(9):1392–7. pmid:28714974
- 12. Nikpay M, Goel A, Won H-H, Hall LM, Willenborg C, Kanoni S, et al. A comprehensive 1000 Genomes–based genome-wide association meta-analysis of coronary artery disease. Nature Genetics. 2015;47:1121. pmid:26343387
- 13. Nelson CP, Goel A, Butterworth AS, Kanoni S, Webb TR, Marouli E, et al. Association analyses based on false discovery rate implicate new loci for coronary artery disease. Nature Genetics. 2017;49:1385. pmid:28714975
- 14. Natarajan P, Bis JC, Bielak LF, Cox AJ, Dörr M, Feitosa MF, et al. Multiethnic Exome-Wide Association Study of Subclinical Atherosclerosis. Circulation: Cardiovascular Genetics. 2016;9(6):511–20. pmid:27872105
- 15. Yu B, Pulit Sara L, Hwang S-J, Brody Jennifer A, Amin N, Auer Paul L, et al. Rare Exome Sequence Variants in CLCN6 Reduce Blood Pressure Levels and Hypertension Risk. Circulation: Cardiovascular Genetics. 2016;9(1):64–70. pmid:26658788
- 16. Flannick J, Mercader JM, Fuchsberger C, Udler MS, Mahajan A, Wessel J, et al. Exome sequencing of 20,791 cases of type 2 diabetes and 24,440 controls. Nature. 2019;570(7759):71–6. pmid:31118516
- 17. Liu DJ, Peloso GM, Yu H, Butterworth AS, Wang X, Mahajan A, et al. Exome-wide association study of plasma lipids in >300,000 individuals. Nature Genetics. 2017;49:1758. pmid:29083408
- 18. Warren HR, Evangelou E, Cabrera CP, Gao H, Ren M, Mifsud B, et al. Genome-wide association analysis identifies novel blood pressure loci and offers biological insights into cardiovascular risk. Nature genetics. 2017;49(3):403–15. pmid:28135244
- 19. Stein EA, Mellis S, Yancopoulos GD, Stahl N, Logan D, Smith WB, et al. Effect of a Monoclonal Antibody to PCSK9 on LDL Cholesterol. New England Journal of Medicine. 2012;366(12):1108–18. pmid:22435370
- 20. Morrison AC, Fu Y, O’Donnell C. Variants in ANGPTL4 and the Risk of Coronary Artery Disease. New England Journal of Medicine. 2016;375(23):2303–6. pmid:27959691
- 21. Grove ML, Yu B, Cochran BJ, Haritunians T, Bis JC, Taylor KD, et al. Best Practices and Joint Calling of the HumanExome BeadChip: The CHARGE Consortium. PLOS ONE. 2013;8(7):e68095. pmid:23874508
- 22. Goldstein JI, Crenshaw A, Carey J, Grant GB, Maguire J, Fromer M, et al. zCall: a rare variant caller for array-based genotyping: genetics and population analysis. Bioinformatics (Oxford, England). 2012;28(19):2543–5. pmid:22843986
- 23. Liu X, Jian X, Boerwinkle E. dbNSFP: a lightweight database of human nonsynonymous SNPs and their functional predictions. Human mutation. 2011;32(8):894–9. pmid:21520341
- 24. Liu X, Jian X, Boerwinkle E. dbNSFP v2.0: a database of human non-synonymous SNVs and their functional predictions and annotations. Human mutation. 2013;34(9):E2393–E402. pmid:23843252
- 25. Wu MC, Lee S, Cai T, Li Y, Boehnke M, Lin X. Rare-variant association testing for sequencing data with the sequence kernel association test. American journal of human genetics. 2011;89(1):82–93. pmid:21737059
- 26. Li B, Leal SM. Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. American journal of human genetics. 2008;83(3):311–21.
- 27. Conti E, Izaurralde E. Nonsense-mediated mRNA decay: molecular insights and mechanistic variations across species. Current Opinion in Cell Biology. 2005;17(3):316–25. pmid:15901503
- 28. Firth D. Bias Reduction of Maximum Likelihood Estimates. Biometrika. 1993;80(1):27–38.
- 29. Heinze G, Schemper M. A solution to the problem of separation in logistic regression. Statistics in Medicine. 2002;21(16):2409–19. pmid:12210625
- 30. Wang X. Firth logistic regression for rare variant association tests. Frontiers in Genetics. 2014;5(187).
- 31. Willer CJ, Li Y, Abecasis GR. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics (Oxford, England). 2010;26(17):2190–1. pmid:20616382
- 32.
Voorman A, Brody JB, Chen H, Lumley T, Davis B. seqMeta: Meta-Analysis of Region-Based Tests of Rare DNA Variants. R package version 1.6.7. 2017 https://CRAN.R-project.org/package=seqMeta.
- 33. Stitziel NO, Stirrups KE, Masca NGD, Erdmann J, Ferrario PG, König IR, et al. Coding Variation in ANGPTL4, LPL, and SVEP1 and the Risk of Coronary Disease. The New England journal of medicine. 2016;374(12):1134–44. pmid:26934567
- 34. Wakil SM, Ram R, Muiya NP, Mehta M, Andres E, Mazhar N, et al. A genome-wide association study reveals susceptibility loci for myocardial infarction/coronary artery disease in Saudi Arabs. Atherosclerosis. 2016;245:62–70. pmid:26708285
- 35. Wakil SM, Ram R, Muiya NP, Andres E, Mazhar N, Hagos S, et al. A common variant association study reveals novel susceptibility loci for low HDL-cholesterol levels in ethnic Arabs. Clinical Genetics. 2016;90(6):518–25. pmid:26879886
- 36. Lin Y-J, Chang J-S, Liu X, Tsang H, Chien W-K, Chen J-H, et al. Genetic variants in PLCB4/PLCB1 as susceptibility loci for coronary artery aneurysm formation in Kawasaki disease in Han Chinese in Taiwan. Scientific reports. 2015;5:14762-. pmid:26434682
- 37. Folsom AR, Peacock JM, Demerath E, Boerwinkle E. Variation in ANGPTL4 and risk of coronary heart disease: the Atherosclerosis Risk in Communities Study. Metabolism: clinical and experimental. 2008;57(11):1591–6. pmid:18940399
- 38. Guipponi M, Toh M-Y, Tan J, Park D, Hanson K, Ballana E, et al. An integrated genetic and functional analysis of the role of type II transmembrane serine proteases (TMPRSSs) in hearing loss. Human Mutation. 2008;29(1):130–41. pmid:17918732
- 39.
Gene expression for LDLRAD1 (ENSG00000203985.10) [Internet]. https://gtexportal.org/home/gene/LDLRAD1.
- 40. Haiman CA, Han Y, Feng Y, Xia L, Hsu C, Sheng X, et al. Genome-wide testing of putative functional exonic variants in relationship with breast and prostate cancer risk in a multiethnic population. PLoS genetics. 2013;9(3):e1003419–e. pmid:23555315
- 41. Pratama A, Ramiscal Roybel R, Silva Diego G, Das Souvik K, Athanasopoulos V, Fitch J, et al. Roquin-2 Shares Functions with Its Paralog Roquin-1 in the Repression of mRNAs Controlling T Follicular Helper Cells and Systemic Inflammation. Immunity. 2013;38(4):669–80. pmid:23583642
- 42. Dahabreh IJ, Kent DM. Index event bias as an explanation for the paradoxes of recurrence risk research. JAMA. 2011;305(8):822–3. pmid:21343582
- 43.
Mahmoud O, Dudbridge F, Smith GD, Munafo M, Tilling K. Slope-Hunter: A robust method for index-event bias correction in genome-wide association studies of subsequent traits. bioRxiv. 2020:2020.01.31.928077.
- 44. Anderson CD, Nalls MA, Biffi A, Rost NS, Greenberg SM, Singleton AB, et al. The effect of survival bias on case-control genetic association studies of highly lethal diseases. Circulation Cardiovascular genetics. 2011;4(2):188–96.