In humans, beta-defensins are small, secreted, antimicrobial peptides, which are encoded by
DEFB genes in three main gene clusters: two on chromosome 20 and one on 8p23.1
1. Of the eight beta-defensin genes at 8p23.1,
DEFB1 (encoding the protein hBD-1) and
DEFB103 (encoding the protein hBD-3) are expressed constitutively in skin and
DEFB4 (encoding the protein hBD-2) can be induced in cultured keratinocytes by cytokines or bacterial lipopolysaccharides
2. The beta-defensin genes on 8p23.1, with the exception of
DEFB1, but including
DEFB4,
SPAG11,
DEFB103,
DEFB104,
DEFB105,
DEFB106 and
DEFB107, are on a large repeat unit (
Supplementary Figure 1) that is variable in copy number
3. Individuals have between 2 and 12 copies per diploid genome, with a modal copy number of four in the UK. The antimicrobial and pro-inflammatory nature of these beta-defensins suggests that quantitative variation in gene dosage might contribute to susceptibility to infectious and inflammatory disease
4.
Psoriasis is a common inflammatory skin disease with a prevalence of about 2% in the populations of developed countries. It has both environmental and genetic parts to its etiology, and linkage analysis has been used to identify multiple loci and alleles that confer risk of the disease, with the strongest genetic effect at 6p21.3, where haplotypes carrying the HLA-
Cw6 allele are associated with an increase in risk
5. Psoriasis is characterized by red scaling elevated plaques commonly on elbows, knees and trunk. Histological examination of psoriatic lesions shows inflammation and disturbed epidermal differentiation. hBD-2 is induced as a part of the inflammatory response in skin, which, in psoriasis, is part of the regenerative maturation process involving hyperproliferation and induction of marker genes such as elafin and the cytokeratins 6, 16 and 17. In addition to their antimicrobial activity it has been shown that hBD-2 and other skin beta-defensins have cytokine-like properties
6. The central role of these proteins in the innate immune system of the skin suggested that beta-defensin genes could be candidate genes for psoriasis susceptibility.
Copy number variation at the 8p23.1 beta-defensin cluster, commonly over the 2-7 copy range, poses greater challenges for accurate genotyping than at lower copy numbers. To investigate the relationship between beta-defensin copy number and susceptibility to psoriasis, we initially used a multiplex amplifiable probe hybridisation assay (MAPH,
Supplementary Table 1) to determine copy number of the beta-defensin repeat per diploid genome
3,7,8. Comparison of (unrounded) MAPH data from 190 Dutch cases and 303 controls suggested an association between increased defensin copy number and psoriasis (p = 1.65× 10
−6, t-test). As an alternative assay for beta-defensin copy number we used the higher-throughput Paralogue Ratio Test (PRT) on the same set of samples; PRT typing of
DEFB4 copy number uses specific co-amplification of a heat-shock protein pseudogene upstream of
DEFB4 together with a single-copy paralogue on chromosome 5, and has been described previously
9. Comparison of PRT results from 179 Dutch cases and 272 controls also suggested an association with psoriasis, but at a lower significance (p = 0.01, t-test).
Although capable of high throughput, a single PRT assay has an error rate
9 for
DEFB4 copy number estimated at about 8%. True copy number values are likely to be integers. We therefore sought to improve the overall accuracy of copy number determination in this cohort by combining information into a consensus integer copy number for 179 cases and 272 controls, using data from MAPH, PRT, and ratios of multisite variants (MSVs)
7,10 mapping around the
DEFB4 gene (dbSNP reference numbers rs2740091, rs2737532 and rs3762040) (“REDVR”, see
Supplementary Methods). Integer copy number agreed between MAPH/REDVR and first-pass PRT for 78% of samples, and for a further 11% on repeat PRT typing of discordant samples. This consensus integer copy number was significantly higher among cases than controls (, p = 7.8 × 10
−5, t-test).
As an independent test of this association we typed 305 controls and 319 patients from Germany using PRT alone. Analysis of PRT results showed a significant association with both unrounded (p = 9.02 × 10
−6, t-test) and integer-rounded analyses (, p = 2.95 × 10
−5, t-test). We do not believe that our findings can be attributed to differential genotyping bias
11; in addition to the independent replication, important details of genotyping, including sample preparation, missing data and integer clustering are closely matched between cases and controls for both cohorts (see
Supplementary Methods, sections C1 and C2). We have also made detailed comparisons between independent typing platforms for the Dutch samples, and believe that our copy number typing attains a high standard of accuracy (see
Supplementary Methods, section C3). The full set of data for all samples typed is available as
Supplementary Table 2. There was no significant difference between the Dutch and German cohorts for controls (p=0.165, t-test) or patients (p=0.95, t-test). Assuming a population prevalence of 2%, the relative risk of psoriasis for each copy number class can be inferred, with 1.00 representing the mean population risk (). Combining both cohorts, there is significant support (p=0.005, linear regression ANOVA, weighted by the square root of sample size) for a specifically linear model where each additional copy above two copies increases the relative risk, and the linear regression equation suggests that each copy adds 34 percentage points (95% CI 25%-43%) to the relative risk.
| Table 1Influence of beta-defensin copy number on relative risk of psoriasis. |
We cannot exclude the possibility that the nucleotide state of an MSV actually causes susceptibility to psoriasis, and that copy number is only indirectly associated, as a proxy for this sequence variant. For the variants we have studied, in an extended Dutch cohort, psoriasis shows no association with rs3762040 (p=0.958) and only weak association (p=0.043) with rs2740091. However, we note that more than 200 substitutional variants are reported in dbSNP for this region and were not tested here.
Because of the cytokine-like properties of beta-defensins, either high resting levels or high induced levels may be a precipitating factor after minor skin injury, infection or some other environmental trigger. This could lead to an inappropriate inflammatory response generating the clinical symptoms typical of psoriasis. It has been shown that the level of hBD-2 in keratinocytes after induction by cytokines is correlated with its basal expression level
12, and hBD-2, hBD-3 and hBD-4 have all been found to stimulate keratinocytes to release IL-8, IL-18 and IL-20, which are all proinflammatory cytokines that have an established role in the etiology of psoriasis
13. The genes encoding hBD-2, hBD-3 and hBD-4 (
DEFB4,
DEFB103 and
DEFB104 respectively) are all on the beta-defensin repeat and show the same copy number, so we cannot distinguish whether one gene or a combination of all three genes is responsible for the gene dosage association with psoriasis.
We have identified a psoriasis susceptibility locus not previously identified by linkage analysis. We therefore considered whether the strength of effect observed in our data should give rise to a detectable linkage signal. We simulated pedigrees with the same beta-defensin haplotype population frequencies and subject to the same dependence of risk on beta-defensin copy number genotype (, and
Supplementary Methods). Only a minority of simulated studies involving 500 affected sib pairs detect linkage at even “suggestive” levels of significance (p<0.05). Similarly, simulation of sibling recurrence risk (λ
s), suggested that the beta-defensin locus accounts for a locus-specific λ
s of about 1.08. This low value largely reflects the common occurrence of the variation, and the fact that within pedigrees, total diploid copy number may not have a simple relationship with inheritance of parental alleles. At this level of effect, loci are undetectable by even large linkage studies, and may only be discovered by candidate gene association studies.
Copy number polymorphism of the cytokine
CCL3L1 is reflected in expression levels and has been shown to influence susceptibility to HIV-1 infection
14, and low copy number of complement C4 genes was associated with SLE in a recently published study
15. We adopted PRT as a high-throughput method to genotype loci like these, with copy numbers of 2-12, to determine discrete copy numbers while maintaining an accuracy comparable to MAPH. Given the number of loci that vary in copy number across the genome, and the large number of these loci that are candidates for susceptibility to different diseases, large-scale copy number typing of case-control cohorts will be a priority.