|Home | About | Journals | Submit | Contact Us | Français|
Copy number studies have led to an explosion in the discovery of new segmental duplication-mediated deletions and duplications. We have analyzed copy number changes in 2419 patients referred for clinical array comparative genomic hybridization studies. Twenty-three percent of the abnormal copy number changes we found are immediately flanked by segmental duplications ≥10 kb in size and ≥95% identical in direct orientation, consistent with deletions and duplications generated by non-allelic homologous recombination. Here, we describe copy number changes in five previously unreported loci with genomic organization characteristic of NAHR-mediated gains and losses; namely, 2q11.2, 7q36.1, 17q23, 2q13 and 7q11.21. Deletions and duplications of 2q11.2, deletions of 7q36.1 and deletions of 17q23 are interpreted as pathogenic based on their genomic size, gene content, de novo inheritance and absence from control populations. The clinical significance of 2q13 deletions and duplications is still emerging, as these imbalances are also found in phenotypically normal family members and control individuals. Deletion of 7q11.21 is a benign copy number change well represented in control populations and copy number variation databases. Here, we discuss the genetic factors that can modify the phenotypic expression of such gains and losses, which likely play a role in these and other recurrent genomic disorders.
Segmental duplications are substrates of genomic instability that mediate deletions and duplications by unequal crossing-over between paralogous segments. This mechanism is responsible for the genomic changes underlying many classic genetic syndromes, including 22q11.2 deletion syndrome, Charcot-Marie-Tooth syndrome, hereditary neuropathy with liability to pressure palsies, Williams-Beuren syndrome, Prader-Willi syndrome, Angelman syndrome and Smith-Magenis syndrome (1). Recent whole-genome array comparative genomic hybridization (CGH) studies have led to the identification of new segmental duplication-mediated chromosome rearrangements that are emerging as distinct syndromes (2–11). Given that segmental duplications make up approximately 5% of the human genome (12), many more rearrangements await discovery. Copy number changes mediated by segmental duplications may be either pathogenic or exist as benign variants in the human population. Here, we describe five novel loci subject to copy number change flanked by highly homologous segmental duplications. As with other recurrent rearrangements generated by NAHR, we find reciprocal deletions and duplications.
We analyzed 2419 samples from patients referred to our clinical cytogenetics laboratory for array CGH testing. Clinical indications for testing were diverse, and included developmental delay, autism and birth defects among the most common features reported to us. Our array combines targeted and whole-genome coverage on a 44 000-oligonucleotide platform with a mean spacing of 75 kb between genomic probes (13). We identified 457 array cases representing clinically significant genomic abnormalities (19% abnormality rate) and compared these copy number changes to 130 rearrangement hotspots described by Sharp et al. (5). Rearrangement hotspots were defined as 50 kb to 10 Mb genomic regions flanked by segmental duplications that are ≥10 kb in size and ≥95% identical (5). We identified 98 patients with previously described genomic disorders mediated by segmental duplications (Supplementary Material, Table S1). Nine patients had deletions and/or duplications in four previously unreported loci flanked by segmental duplications (Fig. 1, Table 1, Supplementary Material, Table S2). Thus, 23% (107/457) of the clinically significant copy number changes identified by our array are flanked by segmental duplications. We also detected a benign deletion of chromosome 7q11.21 flanked by segmental duplications in one patient with no clinically significant copy number changes. All deletions and duplications were heterozygous (loss or gain of one copy) and were confirmed by fluorescence in situ hybridization (FISH), as described previously (13). To evaluate the inheritance of copy number changes, we performed FISH analysis on parental samples with probes specific to the regions of gain or loss. We then verified the de novo status of copy number changes not detected in parents by microsatellite analysis (15 loci evaluated, AmpFlSTR Identifiler PCR Amplification Kit no. 4322288, Applied Biosystems, Inc.), which confirmed familial relationships in all cases.
Next we analyzed the genomic architecture of the five regions relative to the reference genome assembly (Build 36.1, hg18). Segmental duplications were identified using the duplication track on the UCSC browser (14,15) and confirmed using the cross_match alignment program (http://www.phrap.org/). We identified six highly homologous segmental duplication pairs in direct orientation flanking the deletion and duplication regions (Table 2). Such conformations are capable of mediating reciprocal gains and losses via non-allelic homologous recombination (NAHR) (1).
We interpreted the clinical significance of deletions and duplications based on the size of gain/loss, gene content, inheritance pattern and frequency in control populations (Fig. 1, Table 1). Given the relatively few individuals carrying any particular copy number change, comprehensive genotype–phenotype correlations were not possible at this time. The major phenotypic features of the patients in our study may be found in the supplementary data (Supplementary Material, Table S3).
To determine the frequency of copy number changes at our five loci in the general population, we searched previously reported copy number variation (CNV) data sets ascertained from apparently normal individuals (16–25) and consulted CNVs reported in online databases (http://www.genome.ucsc.edu/; http://projects.tcag.ca/variation/). We also screened a control population of 876 individuals for gains and losses of the five loci (Affymetrix Genome-Wide Human SNP Array 6.0: n = 347 unaffected parents of schizophrenia probands, plus n = 529 Ashkenazi Jewish patients with Crohn's disease, dystonia or Parkinson's disease; J.G.M., unpublished data).
We identified two patients with overlapping 1.5-Mb copy number changes in 2q11.2 flanked by clusters of segmental duplications. Patient 1's duplication is slightly larger than Patient 2's deletion, corresponding to neighboring segmental duplication pairs A/A′ and B/B′ (Fig. 1). Segmental duplications A and B are not homologous, rather they are part of large duplication clusters flanking the rearrangements. The deletion and duplication span a minimum of 17 genes, none of which have been associated with human disease. Parental studies demonstrated that Patient 1's duplication was de novo; however, Patient 2's parents were unavailable for study. Neither the duplication nor the deletion was found in our control population or in CNV databases, consistent with the interpretation of these imbalances as pathogenic copy number changes.
The 2.35 Mb deletion of 7q36.1 spans 35 genes, including two genes involved in heart function, PRKAG2 and KCNH2 (Fig. 1). Heterozygous mutations in PRKAG2 cause hypertrophic cardiomyopathy with Wolff–Parkinson–White syndrome (MIM 602743) (26–28). KCNH2 encodes a voltage-gated potassium channel, and mutations in this gene cause long QT syndrome type 2 (MIM 152427) (29,30) and short QT syndrome type 1 (MIM 609620) (31). Mutations in both PRKAG2 and KCNH2 are inherited in an autosomal dominant manner, exhibiting variable expressivity and incomplete penetrance (26,32–34). Patient 9 had a normal EKG at 19 months of age, suggesting incomplete penetrance of the cardiac defects associated with loss of function of PRKAG2 and KCNH2. However, heterozygous mutations in KCNH2 have been found in patients with epilepsy (35,36), and Patient 9 has a history of seizures. The 7q36.1 deletion occurred de novo and was not present in either our control population or CNV databases.
The 2.14-Mb deletion of 17q23 removes 11 genes, including TBX4. Notably, heterozygous loss-of-function mutations in TBX4 have been identified in families with autosomal dominant small patella syndrome (MIM 147891) (37); nevertheless, Patient 14 shows no patellar, pelvic or foot anomalies typical of the syndrome. The 17q23 deletion occurred de novo and is not found in our control population or other CNV databases. The absence of a skeletal phenotype in Patient 14 may be owing to incomplete penetrance of small patella syndrome.
The most frequent site of deletion/duplication in our study is a 1.62-Mb region of 2q13 that includes 10 genes. There is an assembly gap in the interval of gain/loss; thus, the region may be larger or smaller than 1.62 Mb (Fig. 1). We identified two duplications (both paternally inherited) and three reciprocal deletions (one paternally inherited, two of unknown inheritance) (Table 1). Patient 7 also carries an unbalanced translocation, complicating our interpretation of how the 2q13 deletion contributes to his phenotype. Thus, we compared the phenotypes of Patients 3–6 to discern common features of the deletion and duplication. Most phenotypic findings were non-specific (Supplementary Material, Table S3); however, tooth abnormalities were noted in three patients. Patients 3 and 4 (duplication carriers) have dental crowding, and Patient 6 (deletion carrier) has widely spaced teeth. Interestingly, the FBLN7 gene is located in the 2q13 region of gain/loss. FBLN7 is a cell adhesion molecule that plays a critical role in the differentiation and maintenance of odontoblasts and in dentin formation (38). To date, there are no known loss-of-function mutations in FBLN7; however, it is tempting to speculate that loss and gain of the gene may affect tooth development.
An overlapping 2q13 deletion has been reported previously, though the flanking segmental duplications and mechanism of rearrangement were not described (39). Brothers with the deletion had developmental delay and dysmorphic features not present in their unaffected mother who transmitted the deletion. The more affected proband had hypotonia and epilepsy, features found in a subset of our patients with 2q13 deletions (Supplementary Material, Table S3). Tooth abnormalities were not described in this family.
We detected one individual with the same 2q13 duplication in our control set. The reciprocal deletion, on the other hand, was not detected in our controls, but the deletion was transmitted from an apparently normal parent in at least one family in our study and another in the literature (39). Approximately 850 kb of the 1.62-Mb 2q13 region has been reported in CNV databases; however, many genes lie outside of this region, including FBLN7 (Fig. 1). Thus, there may be phenotypic differences between those carrying gains and/or losses of the common 850 kb CNV and the 1.6 Mb region we describe. Like several other genomic changes (7,9,11,40–42), deletions or duplications of this region could be pathogenic, with variable expressivity and/or incomplete penetrance in some individuals. Conversely, this could be a copy number change that affects tooth development but does not contribute to cognitive disabilities.
One deletion in our study falls clearly into the benign variant category. The 461-kb loss of 7q11.21 deletes a single zinc finger gene, ZNF92 (Fig. 1), a member of a large and copy number variable gene family (43). The inheritance of Patient 8's deletion is unknown; however, we found three losses and one gain of this region in our control population. Further, gains and losses overlapping this entire region have been reported in multiple population-wide CNV studies with a frequency similar to that in our control population (16,24). It is important to note that the same recombination mechanism that generates recurrent pathogenic deletions and duplications can also give rise to normal CNV in the human genome.
We found that 23% of the clinically significant copy number changes detected by array CGH testing are flanked by highly homologous segmental duplications. Owing to the genome-wide coverage of our array CGH design, we were able to detect previously described as well as novel NAHR events. We identified five regions of copy number change in affected patients. Similar to the genomic architecture that mediates deletions and duplications underlying other NAHR-driven genomic disorders (1,5), the segmental duplication pairs in our study are ≥10 kb in length and ≥95% identical (Table 2). Within the regions of gain or loss, we find genes in which heterozygous loss-of-function is disease-causing, as well as genes with putative roles in patient phenotypes.
In three out of five loci, we detected more than one individual with the deletion and/or reciprocal duplication, consistent with a recurrent mechanism of chromosome rearrangement. Deletions of 7q36.1 and 17q23.2 were singleton events, rare in our patient population but likely to occur again given the organization of segmental duplications. We expect that further study of other affected populations will reveal more individuals with the same genomic changes, leading to a better understanding of the phenotypic spectrum associated with particular recurrent deletions and duplications. The discovery of recurrent genomic imbalances is critical to the burgeoning field of CNV. Deletions and duplications associated with the 130 segmental duplication hotspots identified by Sharp et al. (5) may be embryonic lethal, pathogenic in liveborn individuals, or benign copy number changes (25). In this study we have characterized three loci associated with pathogenic copy number changes and one benign copy number change.
We have also identified copy number changes of uncertain clinical significance. We found deletions and duplications of 2q13 in some apparently normal individuals; however, copy number changes in these regions include several genes and are not common copy number variants in control populations (25). This type of inheritance has been described in other recurrent genomic disorders (7,9,11,40–42), and complicates any interpretation of the clinical significance of such findings.
Our data highlight the importance of other genetic factors in modifying the expression of genomic imbalances. Loss-of-function mutations in PRKAG2, KCNH2 and TBX4 cause autosomal dominant syndromes, but do not produce the associated phenotypes in some patients with heterozygous deletions encompassing those genes. Further, large deletions and duplications may be inherited from a phenotypically normal parent and exist as a rare copy number change (<1%) in cohorts of normal individuals. Genetic effects of other loci modify phenotypic expression and penetrance of copy number changes. In the case of deletions, loss of one gene copy may unmask a recessive allele on the intact chromosome. Both deletions and duplications could be subject to genomic imprinting, whereby the parental origin of the transmitted copy number change determines phenotype. All these factors are critical when considering newly described deletions and duplications in a relatively small group of patients. Genomic regions identified in this study are subject to recurrent deletion and duplication; thus we expect more individuals with the same copy number changes to emerge, which will lead to a more refined phenotype, as has been the case with other genomic disorders.
Patient samples were collected as previously described (13). This study was approved by the Emory University Institutional Review Board. Informed consent was obtained as per the study protocol.
Our oligonucleotide microarray combines targeted and genome-wide coverage in a 4 X 44K format (Agilent Technologies, Santa Clara, CA, USA) (13). Mean backbone spacing is approximately 75 kb between genomic probes. Array CGH was performed following the manufacturer's protocol as described previously (13). Genomic DNA was extracted from peripheral blood from patients and control individuals. Patient DNA was co-hybridized with a pool of five sex-mismatched control DNA samples. Arrays were scanned using a GenePix 4000B scanner (Molecular Devices, Sunnyvale, CA, USA) and signal intensities were evaluated using Feature Extraction Version 126.96.36.199 software (Agilent Technologies). We used DNA Analytics Version 4.0 software (Agilent Technologies) to analyze the array data and determine copy number gains and losses. Deletions and duplications were confirmed by FISH, as described previously (13).
This work was supported by the National Institutes of Health (R01 MH074090 to D.H.L. and C.L.M.).
We thank Daniel Moreno De Luca for helpful discussions and critical reading of the manuscript. Andres Moreno De Luca made the initial observation of the recurrent 2q11.2 copy number change. We also thank Cheryl Strauss for editorial assistance. The Emory Cytogenetics Laboratory performed the array CGH experiments.
Conflict of Interest statement. None declared.