Introduction

Autoimmune diseases affect up to 5% of the population and are among the 10 most common causes of death in women.1 They are characterized by the loss of self-tolerance causing immune-mediated tissue destruction.2 Autoimmune Addison's disease (AD) occurs due to autoimmune destruction of the adrenal cortex. Approximately half of the patients have additional autoimmune components.3 AD with concomitant thyroid autoimmunity and/or type I diabetes is the most common syndrome referred to as autoimmune polyendocrine syndrome type II (APS II).4 It has long been suspected that several of the genetic determinants may be identical in different autoimmune diseases. This hypothesis is based upon observations of clustering of different autoimmune diseases in families,5 as well as overlapping linkage peaks in meta-analyses of families with different autoimmune diseases.6 There is also molecular genetic evidence to support this; for example, human leukocyte antigen (HLA) complex genes7 and cytotoxic T-lymphocyte-associated protein 4 are associated with numerous autoimmune conditions.8, 9

Tyrosine-protein phosphatase non-receptor type 22 encoded by the PTPN22 gene has recently been claimed to be the second most important genetic susceptibility factor in autoimmune diseases, after the HLA gene complex. PTPN22 encodes a tyrosine phosphatase, also called Lyp, which modulates T-cell activation and signaling events in a way that may trigger development of autoimmune disease.10 A C to T substitution in nucleotide position 1858, which results in the substitution of arginine (R) with tryptophan (W) (R620W), has been suggested to be a gain-of-function variant.10, 11 The T-allele of this single-nucleotide polymorphism (SNP) has been demonstrated to moderately increase the susceptibility to several autoimmune disorders, for example, type I diabetes,12, 13, 14 rheumatoid arthritis,15 Graves’ disease,14, 16 systemic lupus erythematosus,17 and juvenile idiopathic arthritis.18 These associations were confirmed in a meta-analysis by Lee et al.19 Two smaller studies on PTPN22 1858CT and AD have, on the other hand, shown diverging results. Velaga et al16 reported association (odds ratio (OR) for the T-allele equal to 1.69; 95% confidence interval (CI): 1.04–2.73, P=0.031) based on genotyping of 104 patients from northeast England,whereas Kahles et al20 found no association (OR 1.03; 95% CI: 0.63–1.67, P=0.913) in 121 German patients. Sample size is crucial in studies of genetic factors with moderate risk, and with the small studies performed on PTPN22 in AD; it is not surprising that the results are inconsistent.

For this reason, we decided to undertake a larger and statistically more powerful study encompassing a total of 368 AD patients and 1128 healthy controls from Norway and UK to determine whether the PTPN22 1858CT variant is associated with AD. Three hundred and two of the Norwegian patients had a verified autoimmune etiology, either by the detection of autoantibodies in their blood or by a secondary autoimmune diagnosis, and were, therefore, of special interest. A meta-analysis of Caucasian cohorts was performed to elucidate the risk of this SNP across populations. Furthermore, in light of the extreme proneness to autoimmunity observed in AD patients, we hypothesized that there may also be other, as yet unidentified, rare variants in this gene, which may predispose to AD. The involvement of infrequent mutations would also be in line with AD, being a rare autoimmune disorder compared to the other diseases previously analyzed for PTPN22 polymorphisms. Being a rare disorder, the common variant–common disease hypothesis,21 which postulates that most genetic susceptibility factors in common complex disorders are common variants, is less likely to be applicable. Previous screenings for mutations throughout the PTPN22 gene are, to our knowledge, limited to 94 type I diabetes patients,22 64 parents of type I diabetes patients,23 20 psoriasis patients,24 48 rheumatoid arthritis patients,25 and 35 healthy controls.26 To this end we resequenced the coding part of the gene in 332 Norwegian patients and 112 healthy controls to clarify if there were additional coding variants in PTPN22 that may predispose for AD. To our knowledge, this is the most comprehensive sequence analysis of PTPN22 performed in a study of any autoimmune disease.

Materials and methods

Patients and controls

Three hundred and thirty-two Caucasian patients with AD (206 women, 126 men) were recruited from the Norwegian registry of organ-specific autoimmune diseases (Supplementary Table 1). The mean age of the patients was 53 years (range 15–92 years), the mean age at diagnosis was 36 years (range 1–86 years) and the mean duration of AD was 18 years (range 0–54 years). Either low basal serum cortisol together with high adrenocorticotropic hormone (ACTH) or a pathological ACTH stimulation test was used to diagnose adrenal insufficiency. In the total cohort of men and women, 128 patients (39%) had isolated AD and 177 (53%) had APS II (all having AD concomitant with thyroid disease and/or type I diabetes). Autoantibodies against 21-hydroxylase (21OH) were present in 266/328 (80%) of the patients (four had unknown 21OH autoantibody status), 89 of them having isolated AD. Of the remaining 66 patients, 9 had antibodies against side-chain cleavage enzyme (SCC) or glutamic acid decarboxylase (GAD), which indicate an autoimmune etiology. Another 27 (8%) had other autoimmune manifestations (eg vitiligo, pernicious anemia, alopeci and premature gonadal failure) also indicating autoimmune etiology of AD. Only 30 patients, all with isolated AD, lacked autoantibodies, had unknown autoantibody status, no associated autoimmune features or other known causes of adrenal failure. Thus, the vast majority of patients (302 of 332) were interpreted to have autoimmune AD. Patients with AD as a component of autoimmune polyendocrine syndrome type I were excluded. A total of 990 anonymous healthy Norwegian blood donors, none of whom had AD were used as controls. The 36 British AD cases and 138 healthy controls were recruited by University of Sheffield as described elsewhere.27 Informed consent has been obtained by all participants.

Measurement of autoantibodies

Antibodies against 21OH, GAD and SCC were assayed by a radioimmunoassay against 35S-labeled recombinant human autoantigens expressed by in vitro transcription and translation, as described by Ekwall et al.28 Positivity for the assays was governed by whether the test absorbance was greater than 3 SD above the negative control value.

Single-nucleotide polymorphism genotyping

The 1858CT (rs2476601) SNP was genotyped by standard conditions using a Custom TaqMan® SNP Genotyping Assay, and analyzed on ABI PRISM® 7900HT Sequence Detection System (Applied Biosystems, Foster City, CA, USA). Secondary to the mutation screening, the same technique was used to analyze five of the missense variants in an extended number of controls to define their frequency in the general population. Direct sequencing, as described below, was performed for the same purpose for the other two missense variants.

Mutation screening

The BigDye® terminator v3.1 chemistry (Applied Biosystems) was used for bidirectional sequencing of all 21 exons of the PTPN22 gene, in addition to the 5′UTR, 3′UTR and intron–exon boundaries in all 332 Norwegian AD patients and 112 anonymous Norwegian blood donors (randomly chosen from the 990 blood donors analyzed for the SNP 1858CT). Primers were designed using Primer3 (http://frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi),29 and optimized with 60°C annealing temperature to enable the polymerase chain reactions (PCRs) to be run simultaneously in 5 μl reactions in 384-well plates. PCR clean-up was performed with Exo-SAP (USB Corporation, Cleveland, OH, USA), while the sequence reactions were purified using Montage™ SEQ384 filter plates on a vacuum manifold (Millipore, Bedford, MA, USA) according to recommendations from the manufacturers. Sequences were read by an ABI3730 DNA analyzer and analyzed using the SeqScape v2.1.1 software (Applied Biosystems). Unsuccessful sequences were redone obtaining 100% coverage. All identified novel coding variants were assessed for functional importance using PolyPhen (http://genetics.bwh.harvard.edu/pph/),30 a bioinformatic tool for prediction of possible impact of an amino acid substitution of a protein, based on both phylogenetic and structural information. Any changes in splicing-site motifs due to nucleotide substitutions close to exon–intron junctions were predicted by the use of ESEfinder v3.0 (http://rulai.cshl.edu/cgi-bin/tools/ESE3/esefinder.cgi).31, 32

Statistical analysis

Power calculation (http://pngu.mgh.harvard.edu/~purcell/gpc/) for the association study of SNP 1858CT in the Norwegian AD patients was based on the OR of 1.69 reported by Velaga et al,16 a risk allele frequency of 0.116,18 cases (n=332), controls (n=990) and significance level 0.05. This gave a statistical power of 88.8%. Comparisons of allele, genotype and haplotype frequencies in cases and controls were performed by Pearson χ2 analysis using SPSS v14.0, the same program that was used for calculation of OR. Haplotype predictions were done by the software PHASE v2.1,33 which has been reported to be the most accurate algorithm for haplotype construction with a 5.2–5.9% error rate in phasing the genotypes of unrelated individuals.34 Hardy–Weinberg equilibrium was checked by the web-based tool at http://ihg.gsf.de/cgi-bin/hw/hwa1.pl.

Meta-analysis

PubMed (http://www.ncbi.nlm.nih.gov/) was searched using the search terms ‘PTPN22’, ‘Lyp’ and ‘Addison's’. All studies found, who gave genotype data for 1858CT, were included in a meta-analysis together with our own data from the present study. The two English cohorts were recruited from different geographical areas. Breslow–Day homogeneity of OR analysis and Mantel–Haenszel common OR estimation were performed using SPSS v.14.0.

Results

Association analyses of the PTPN22 SNP 1858CT

In the Norwegian population, we found a significant association between the PTPN22 1858T-allele and AD patients with known autoimmune etiology (n=302) with an OR 1.39 (95% CI: 1.06–1.82, P=0.016). Including all Norwegian AD patients (n=332) lowered the risk to an OR of 1.29 (95% CI: 0.99–1.68, P=0.055). Genotype frequencies of both cases and controls were in Hardy–Weinberg equilibrium. Genotype and allele frequency distributions are listed in Table 1. To confirm this association across populations, we performed meta-analysis with the other available genotyping data. Our meta-analysis included our Norwegian autoimmune AD patients and the Norwegian controls, a British population (recruited from the Sheffield district) and genotype results from two populations published earlier, a German20 and a British (recruited from the Newcastle upon Tyne district).16 Breslow–Day analysis did not detect any significant heterogeneity of OR (P=0.554) between the populations. The meta-analysis confirmed the association of the 1858T risk allele with AD with a common OR of 1.36 (95% CI: 1.11–1.66, P=0.003) (Table 2).

Table 1 Genotype and allele frequency distributions of the PTPN22 SNP 1858CT in the Norwegian and UK datasets
Table 2 Meta-analysis of the PTPN22 SNP 1858CT

Mutation screening of PTPN22

The mutation screening of the PTPN22 gene revealed eight missense variants, of which five have not been reported previously (Table 3). Moreover, ten novel non-coding variants were found (Supplementary Table 2). An additional novel variant was detected in the 3′UTR. Three intronic variants were positioned within four bases away from an exon–intron junction, and might, therefore, affect splicing. We only had access to leukocyte RNA from one patient with a nucleotide change close to an exon-intron junction; the c.197−4 in intron 2 adjacent to exon 3. Analysis to elucidate if this variant caused skipping of the exon 3 did, however, not show such a consequence in this patient's leukocytes (data not shown). All novel base changes were rare variants with a minor allele frequency less than 1% in the normal population. Segregation of the variants in the patients’ families could not be investigated since samples from family members were unavailable.

Table 3 PTPN22 missense variants detected in Norwegian AD patients and Norwegian healthy control individuals

Eight known non-coding or 3′UTR SNPs were located inside the sequenced regions thereby providing genotypes in the sequenced individuals. These were rs3789612, rs1217419, rs1217418, rs1599971, rs3761935, rs34209542, rs1217412 and rs3811021. They were all in Hardy–Weinberg Equilibrium. Neither of these SNPs showed significant difference in genotype or allele distributions between the patient group and control group using standard χ2 tests. Haplotypes of all SNPs with minor allele frequency (MAF) >5%, that is, excluding rs3789612 and rs34209542, were constructed for both cases and controls (Supplementary Table 3). As noted earlier, the 1858T allele was observed to occur on a single common haplotype.22, 25 These data show that the 112 controls, randomly selected for sequencing, have an unusually high frequency of the 1858T variant (14.3%) compared to the wider population (10.8%). Neither of the haplotypes displayed any association with AD in the dataset where we had haplotype data.

Discussion

Our data comparing Norwegian AD patients and healthy controls demonstrated a significant association of autoimmune AD to the PTPN22 1858T allele, underpinning the coupling of autoimmunity to this SNP. The meta-analysis comprising European populations provides further support that this is a real association. An earlier performed meta-analysis may have suffered from an inadequate sample size to detect this moderate contribution of risk.19 Moreover, these results let autoimmune AD fall into the line of autoimmune diseases previously shown to be associated with this SNP.

We screened the PTPN22 gene to clarify if there are any common unknown variants or any rare mutations in PTPN22 that may predispose for AD. A total of 28 variations were found, 16 of them were novel, including 5 missense, 1 in the 3′UTR and 10 located in introns. This represents the largest screening for PTPN22 mutations in any disease, and also included 112 healthy controls. Four of the seven missense variants listed in Table 3 were predicted to be damaging to the protein function by PolyPhen.30 Interestingly, the frequency of missense variants predicted to be damaging were slightly higher in the patient group compared to the controls. Conversely, there was an opposite weak tendency for variants predicted to be benign to be more frequent in the controls. The main contribution came from R263Q that was detected by the heterozygosity rates of 2.4 and 3.9% in Norwegian AD cases and controls, respectively.

The three non-coding variants located closest to an exon–intron junction, that is, c.197−4, c.828+4 and c.2882−2 were subjected to in silico analysis for prediction of splice sites and binding of serine/arginine-residue proteins (SR proteins) using the program ESEfinder v3.0.31 Compared to the respective wild-type sequence, the nucleotide substitutions were predicted to have the following motif changes: the mutated allele c.197−4G had two additional predicted SR protein-binding sites (SC35 and SRp55), and the mutant variant c.2282−2C lacked a 3′U2 splicing site. These predicted differences of binding of splicing factors in the two PTPN22 variants may suggest that they can affect normal splicing. In vitro analysis of c.197−4 did, however, not prove any exon skipping in leukocytes, but we cannot rule out that the variant may influence gene splicing in other tissues. However, any pathogenic effect of these variants is, therefore, at present speculative, and needs to be confirmed functionally.

PTPN22 is a downregulator of the immune system by turning off the T-cell receptor (TCR) signaling when c-src tyrosine kinase (CSK) is bound to the SH3-domain of PTPN22. The PTPN22 1858T allele has been suggested to be a gain-of-function variant by increasing the enzyme activity,11 and also by disrupting the binding of CSK to PTPN22.12, 15 Two explanations of the increased risk of autoimmunity in carriers of this allele have been proposed: they may have a higher TCR-signaling threshold and require stronger TCR signals for activation, thereby introducing a relative resistance to thymic deletion of autoreactive T cells,11 or they may have reduced TCR signaling, leading to less effective T-regulatory cell activity.35 The effect of the rare variants detected in AD patients in the present study on the function of PTPN22 is difficult to predict. Some of the variants may modulate catalytic function, others the binding to CSK or other potential binding partners,36 with a potential result of affected TCR signaling. We examined family history of individuals with rare PTPN22 variants; however, no firm conclusions can be drawn, largely owing to the small number of individuals involved (data not shown).

Focusing on the spectrum of variations (Supplementary Figure 1) detected in PTPN22 in this study 10 of the novel variants (63%) were singletons (ie heterozygotes observed in a single individual) and the rest of the novel variants were all rare (ie MAF <5%), demonstrating the fact that the number of rare SNPs increases with depth of sequencing, whereas the number of common variants will plateau as most of them are already identified.37 The sequencing of PTPN22 in a total of 888 chromosomes revealed a two-fold density of variations in non-coding sequence of PTPN22 compared to the coding sequence of the gene, with a density of one variation per 166 and 333 bp, respectively. Even though our study and the HapMap ENCODE project diverge on the basis of selection of sample population and selection of sequence to be screened, it is noteworthy that our frequency variation spectrum and SNP density (compared to their overall SNP-density of one per 279 bp) are very similar. Anyway, our data comprise a large sample size for mutation screening of a single locus and may, therefore, contribute to our understanding of SNP density in genes.

To conclude, this study adds autoimmune AD to the list of autoimmune diseases, where PTPN22 1858T is a genetic susceptibility factor. The comprehensive resequencing of PTPN22 did not demonstrate any novel common coding SNP predisposing to AD in Norwegian patients, but our findings show that a number of novel infrequent or private mutations in PTPN22 do occur, and bioinformatic analyses suggest that they may affect PTPN22 function and possibly susceptibility to AD. However, their contribution to the AD phenotype needs to be further elucidated by functional studies.