Search tips
Search criteria 


Logo of annrheumdAnnals of the Rheumatic DiseasesVisit this articleSubmit a manuscriptReceive email alertsContact usBMJ
Ann Rheum Dis. 2007 May; 66(5): 683–686.
Published online 2006 December 14. doi:  10.1136/ard.2006.060459
PMCID: PMC1954621

Investigation of genetic variation across the protein tyrosine phosphatase gene in patients with rheumatoid arthritis in the UK



To investigate single‐nucleotide polymorphisms (SNPs) across the PTPN22 gene region in a UK cohort of patients with rheumatoid arthritis (RA), to look for evidence of disease associations independent of the well‐characterised R620W variant (rs2476601).


951 RA cases in the UK satisfying American Rheumatism Association (ARA) criteria and 448 population controls were genotyped for 11 SNPs across the PTPN22 gene region using the Sequenom MassArray MassEXTEND technology. Allele, genotype and estimated haplotype frequencies of cases and controls were compared.


In addition to the R620W (rs2476601) SNP, three SNPs were associated with RA in this study. The sole haplotype on which the associated T allele of R620W occurred was associated with RA; no other haplotypes showed a significant difference in frequencies between RA cases and controls.


In contrast with a study of American patients with RA no evidence of association with PTPN22 independent of the well‐characterised R620W variant was found, suggesting that in these patients this variant alone explains the association with the PTPN22 gene.

The importance of both genetic and environmental risk factors for common human diseases is well accepted and different approaches are being used to identify and characterise how such factors influence the disease process. For diseases with an autoimmune pathology, such as rheumatoid arthritis (RA), the association with genes in the human leucocyte antigen (HLA) region have been established for many years. Although this region seems to harbour the major susceptibility locus for many of these diseases, it is also recognised that other non‐HLA genes may influence many aspects of the disease process from susceptibility to severity and outcome to treatment responsiveness.

A large number of candidate genes have been investigated for autoimmune diseases and many associations reported; however, the vast majority of these have turned out to be false‐positive results that have not been replicated in subsequent investigations. The finding that emerged recently of an autoimmune susceptibility polymorphism in the protein tyrosine phosphatase gene (PTPN22) is, however, quite different. A functional polymorphism (R620W) in PTPN22, a negative regulator of T‐cell activation, was first associated with type 1 diabetes (T1D) by researchers investigating a number of tyrosine phosphatases.1 At much the same time, a pioneering large‐scale candidate gene investigation in RA screened large numbers of non‐synonymous single‐nucleotide polymorphisms (SNPs), and revealed the strongest association with the same R620W PTPN22 variant as that described in the study by Bottini et al.2 This study contained evidence of replication, as the same association was observed in two large and independent cohorts of RA cases. Subsequently the R620W association has been replicated many times over in cohorts of patients with RA from different countries around the world (reviewed in Hinks et al3). The only failures to replicate have come from Asian populations, in which this particular variant is not seen.4

The significance of the PTPN22 R620W variant extends beyond RA and T1D to a number of other autoimmune and rheumatological conditions, including systemic lupus erythematosus (SLE), juvenile idiopathic arthritis (JIA), autoimmune thyroid disease and Addison's disease. No association has been observed with multiple sclerosis, psoriasis, psoriatic arthritis or Crohn's disease.5 What is perhaps surprising is that, with just two exceptions, all these studies have restricted their investigation of the PTPN22 gene to a single polymorphism (R620W), even those that failed to detect an association with this variant. This approach has been justified by the fact that the R620W appears to have functional significance, in that, the variant form results in reduced binding to c‐src tyrosine kinase (Csk), as shown in both the early RA and T1D studies. It was initially suggested that this would result in loss of negative regulation of T‐cells and lead to a hyper‐responsiveness; however, the most recent experiments seem to suggest quite the opposite; that the 620W form results in a gain of function resulting in enhanced suppression of T‐cell activation.6

The possibility that other polymorphisms in PTPN22 may be important in RA, either independently or in combination with R620W, has been investigated in only one study to date. Carlton et al7 identified 15 novel SNPs in PTPN22 and genotyped these together with an additional 22 SNPs in two cohorts of RA cases and controls. The 620W variant was found to reside on a single haplotype, but this haplotype did not explain fully the difference between RA cases and controls. Two SNPs on a different common haplotype were found to be independently associated with RA, suggesting that the association between PTPN22 and RA may be more complex than was initially realised. However, the novel associations did not reach the levels of statistical significance seen for R620W and require validation in more cohorts. Thus, in this study we have investigated 11 SNPs from across the PTPN22 gene region, including those associated with RA by Carlton et al, in a large cohort of RA cases in the UK, in which we have previously reported association with the PTPN22 R620W variant.8

Materials and methods


DNA was available for 951 RA cases obtained from two main sources: the Arthritis and Rheumatism Council (ARC) National Repository for families with RA, and clinics within the Greater Manchester area of northern England. For patients obtained through the national repository, only one affected case per family was selected at random for investigation. All RA cases had a disease that satisfied the 1987 American College of Rheumatology (ACR) criteria9 modified for genetic studies,10 and 73% were female. Rheumatoid factor (RF) status was ascertained using a particle agglutination test, and a positive result was classified as a titre of 1 in >40. In all, 75% of the cases were RF positive and 67% had erosive joint disease. HLA‐DRB1 genotypes were determined using a commercially available semiautomated PCR‐sequence specific oligonucleotide probe typing technique (INNO‐LiPA; Abbott Laboratories, Maidenhead, UK). It was found that 47% and 31% carried one or two copies of the shared epitope, respectively. This frequency reflects those observed in a hospital‐based series of RA cases.

DNA was available for 448 control subjects, who were recruited from blood donors and from general practitioners (GP) registers. All patients and controls were of UK Caucasoid ethnic origin, were recruited with ethical committee approval and provided informed consent.

SNPs selected for genotyping

As there was some redundancy due to strong linkage disequilibrium (LD) between the 37 SNPs genotyped by Carlton et al,7 we chose to investigate the seven haplotype‐tagging SNPs (SNPs 1, 18, 20, 22, 27, 35 and 36), three SNPs (SNP 2, 32 and 37) that were potentially functional, and one SNP (SNP 21) that showed the strongest allelic association in the Carlton sample set 1. In total, 11 SNPs from the Carlton paper were selected for genotyping.

Genotyping protocol

SNPs were genotyped using the Sequenom MassArray genotyping technology, according to the manufacturer's instructions (‐genotyping.html).

Statistical analysis

All SNPs were tested for Hardy–Weinberg equilibrium in cases and controls. Association of the PTPN22 SNPs was tested using the χ2 test implemented in STATASTATASE V.9.

Pairwise LD measures of D′ and LD correlation coefficient r2 were calculated and plotted on a graph. Haplotypes were inferred using the expectation–maximisation algorithm which was implemented in HelixTree V 4.1(Golden Helix, Bozeman, Montana, USA). Haplotype frequencies were compared between cases and controls using the χ2 test implemented in STATA. Each haplotype was compared with all the others combined.


Allele and genotype frequencies for all the PTPN22 SNPs were in Hardy–Weinberg equilibrium in the RA cohort and in the control population.

As we have shown previously, the rs2476601 T allele was associated with RA (allelewise p = 5×10−8). Three additional SNPs showed evidence of association: rs2488458 (SNP2), rs1217388 (SNP32) and rs1217413 (SNP35) (table 11);); however all these SNPs are in LD with rs2476601 (pairwise r2>0.5) (fig 11),), suggesting that these are not independent associations. This result contrasts with the data presented by Carlton et al, in which the six SNPs in addition to R620W associated with RA (SNPs 2, 18, 21, 27, 32, 35) were not strongly correlated with the R620W variant (r2<0.5).

figure ar60459.f1
Figure 1 Linkage disequilibrium plot across the PTPN22 gene in controls.
Table thumbnail
Table 1 PTPN22 SNP allele frequencies and allelic association with RA

Haplotype analysis

Out of the 10 common haplotypes reported by Carlton et al, eight were identified in the UK RA and control cohorts at frequencies >1% (table 22),), and the most common haplotype in UK cases and controls was haplotype 5, as in the American data. The 620W risk T allele is carried only on haplotype 2, and this was the only haplotype to occur at a different frequency in cases and controls in this study (p = 6.6×10−6). Haplotype 1, which differs from haplotype 2 only at the R620W variant, carrying the non‐associated C allele, is not associated with RA (p = 0.08). Furthermore, unlike the Carlton study, we found no significant differences between the haplotype frequencies of the RA cases and controls other than haplotype 2. In addition, when all chromosomes carrying the risk T allele (W620) were removed and the haplotype frequencies calculated, there were no haplotypes showing a significantly different allele frequency in RA cases compared with controls (data not shown). These observations, together with the fact that the only significantly associated SNPs were those in LD with the R620W variant, suggest that, in the UK population, the association between RA and PTPN22 is fully explained by association with the 620W (T allele) of rs2476601.

Table thumbnail
Table 2 PTPN22 haplotype frequencies for RA cases and controls


The well established association of the R620W variant of PTPN22 with RA and other autoimmune diseases has demonstrated an important role for this gene in determining disease susceptibility. Almost all investigations to date have focused on the R620W SNP (rs2476601) and studies are currently underway to elucidate the mechanism by which this amino acid substitution influences the function of the PTPN22 gene and its role in the disease process. It is, however, clear from many examples that once an association between disease and an allelic variant has been identified, further studies of the gene reveal additional variants, often with a more minor but nonetheless significant role (eg, HLA‐DRB1 and RA).11 The study by Carlton et al suggests this to be the case for PTPN22 and RA; screening of 37 SNPs in the PTPN22 gene region identified two SNPs on a single common haplotype that was associated with RA independently of rs2476601. The effect was, however, weaker than that observed for the R620W SNP and requires validation in independent cohorts.

We investigated 11 SNPs from across the PTPN22 gene region in order to analyse all haplotypes defined by Carlton et al, and attempted to replicate their findings with RA in a cohort in which we have previously observed a strong association with rs2476601. We observed associations with rs2476601 and with three additional SNPS (rs2488458, rs1217388, rs1217413); however, all these SNPs are correlated and the associations are unlikely to be independent. The two independent associations identified by Carlton et al (SNP 27 (rs1310182) and the highly correlated SNP 36 (rs3811021) and SNP 37 (rs3789604)) were not associated with RA in our cohorts (p = 0.59, p = 0.33 and p = 0.36, respectively).

Patterns of LD across the PTPN22 gene region were similar to those previously described, although individual correlation values were higher in the UK cohort. The haplotype frequency distributions were the same in the US and UK cohorts; however, in the UK data, only haplotype 2, characterised by the associated T allele of R620W variant, was associated with RA. Thus in contrast with Carlton et al, in this study, we detected no PTPN22 SNP associations with RA independent of the well‐characterised R620W.

There are no obvious differences between the RA cases used in this study and that of Carlton et al, although no direct comparison is possible based on published data. In both investigations, the cohorts were derived from multicase families and sporadic cases. Our study had 80% power to detect an OR = 1.45 at p<0.05; therefore, we had adequate power to detect an effect similar to that observed by Carlton et al. However, it is possible that a more modest association with the SNPs investigated could have been missed.

SNP screening across the PTPN22 gene region has also been carried out in T1D and psoriasis. In patients with psoriasis, a disease not associated with the 620W variant, association was reported with two haplotypes spanning the two genes adjacent to PTPN22: round sperm basic protein 1 (RSBP1) and the neighbouring gene putative homeodomain transcription factor 1 (PHTF1).12 Interestingly rs3811021 (SNP36) and rs3789604 (SNP37) lie in the 3′untranslated region (3′UTR) of PTPN22 and 5′ of the neighbouring gene, RSBP1, respectively. A recent systematic screen of SNPs across the PTPN22 gene region in Asian patients with T1D, who did not have the R620W polymorphism, revealed association with a promoter SNP (rs2488457). This SNP is in LD with the R620W variant in Caucasians and more strongly associated with diabetes, leading the authors to speculate that this might be the causal variant.4

Although we found no association with PTPN22 SNPs in UK patients with RA independent of the R620W variant, it is intriguing that the region around PTPN22 may harbour additional disease‐associated alleles, particularly in light of the conclusion that the R620W variant does not fully explain the linkage peak on chromosome 1p13 in the North American Rheumatoid Arthritis Consortium (NARAC) RA‐affected sibling pair families.13


HLA - human leucocyte antigen

LD - linkage disequilibrium

PTPN22 - protein tyrosine phosphatase gene

RA - rheumatoid arthritis

RF - rheumatoid factor

SNP - single‐nucleotide polymorphism

T1D - type 1 diabetes


Funding: This work was funded by the Arthritis Research Campaign.


1. Bottini N, Musumeci L, Alonso A, Rahmouni S, Nika K, Rostamkhani M. et al A functional variant of lymphoid tyrosine phosphatase is associated with type I diabetes. Nat Genet 2004. 36337–338.338 [PubMed]
2. Begovich A B, Carlton V E, Honigberg L A, Schrodi S J, Chokkalingam A P, Alexander H C. et al A missense single‐nucleotide polymorphism in a gene encoding a protein tyrosine phosphatase (PTPN22) is associated with rheumatoid arthritis. Am J Hum Genet 2004. 75330–337.337 [PubMed]
3. Hinks A, Worthington J, Thomson W. The association of PTPN22 with rheumatoid arthritis and juvenile idiopathic arthritis. Rheumatology (Oxford) 2006. 45365–368.368 [PubMed]
4. Kawasaki E, Awata T, Ikegami H, Kobayashi T, Maruyama T, Nakanishi K. et al Systematic search for single nucleotide polymorphisms in a lymphoid tyrosine phosphatase gene (PTPN22): association between a promoter polymorphism and type 1 diabetes in Asian populations. Am J Med Genet A 2006. 140586–593.593 [PubMed]
5. Bottini N, Vang T, Cucca F, Mustelin T. Role of PTPN22 in type 1 diabetes and other autoimmune diseases. Semin Immunol 2006. 18207–213.213 [PubMed]
6. Vang T, Congia M, Macis M D, Musumeci L, Orru V, Zavattari P. et al Autoimmune‐associated lymphoid tyrosine phosphatase is a gain‐of‐function variant. Nat Genet 2005. 371317–1319.1319 [PubMed]
7. Carlton V E, Hu X, Chokkalingam A P, Schrodi S J, Brandon R, Alexander H C. et al PTPN22 Genetic Variation: Evidence for Multiple Variants Associated with Rheumatoid Arthritis. Am J Hum Genet 2005. 77567–581.581 [PubMed]
8. Hinks A, Barton A, John S, Bruce I, Hawkins C, Griffiths C E. et al Association between the PTPN22 gene and rheumatoid arthritis and juvenile idiopathic arthritis in a UK population: Further support that PTPN22 is an autoimmunity gene. Arthritis Rheum 2 521694–1699.1699
9. Arnett F C, Edworthy S M, Bloch D A, McShane D J, Fries J F, Cooper N S. et al The American Rheumatism Association 1987 revised criteria for the classification of rheumatoid arthritis. Arthritis Rheum 1988. 31315–324.324 [PubMed]
10. MacGregor A J, Bamber S, Silman A J. A comparison of the performance of different methods of disease classification for rheumatoid arthritis. Results of an analysis from a nationwide twin study. J Rheumatol 1994. 211420–1426.1426 [PubMed]
11. Wordsworth P, Pile K D, Buckely J D, Lanchbury J S, Ollier B, Lathrop M. et al HLA heterozygosity contributes to susceptibility to rheumatoid arthritis. Am J Hum Genet 1992. 51585–591.591 [PubMed]
12. Huffmeier U, Steffens M, Burkhardt H, Lascorz J, Schurmeier‐Horst F, Stander M. et al Evidence for susceptibility determinant(s) to psoriasis vulgaris in or near PTPN22 in German patients. J Med Genet 2006. 43517–522.522 [PMC free article] [PubMed]
13. Kawasaki E, Awata T, Ikegami H, Kobayashi T, Maruyama T, Nakanishi K. et al Systematic search for single nucleotide polymorphisms in a lymphoid tyrosine phosphatase gene (PTPN22): association between a promoter polymorphism and type 1 diabetes in Asian populations. Am J Med Genet A 2006. 140586–593.593 [PubMed]
14. Amos C I, Chen W V, Lee A, Li W, Kern M, Lundsten R. et al High‐density SNP analysis of 642 Caucasian families with rheumatoid arthritis identifies two new linkage regions on 11p12 and 2q33. Genes Immun 2006. 7277–286.286 [PubMed]

Articles from Annals of the Rheumatic Diseases are provided here courtesy of BMJ Publishing Group