|Home | About | Journals | Submit | Contact Us | Français|
Chromosomal DNA is a dynamic structure that can adopt a variety of non-canonical (i.e. non-B) conformations. In this regard, at least ten different forms of non-B DNA conformations have been identified, and many of them have been found to be mutagenic, and associated with human disease development. Despite the importance of non-B DNA structures in genetic instability and DNA metabolic processes, mechanisms remain largely undefined. The purpose of this review is to summarize current methodologies that are used to address questions in the field of non-B DNA structure-induced genetic instability. Advantages and disadvantages of each method will be discussed. A focused effort to further elucidate the mechanisms of non-B DNA-induced genetic instability will lead to a better understanding of how these structure-forming sequences contribute to the development of human disease.
DNA is not a uniform molecule and can adopt more than 10 types of non-B DNA conformations (DNA structures that are different from the classic Watson and Crick B-DNA helix). Although most regions of genomic DNA are in the right-handed double helix B-form, alternative DNA structures can exist transiently in the genome. The factors that influence secondary DNA structure formation include the nucleotide sequence, DNA metabolic activities, and the intracellular environment (1). Sequencing of the genomic DNA from various species (including human) has revealed an abundance of repetitive sequences, which have the capacity to adopt a variety of non-canonical DNA structures (2-4). For example, long simple repeat sequences can form slippage structures initiated by misalignment of the two strands, which can then result in looped-out repeat units (5); if the nucleotides in the loop regions are self-complementary and can form intra-strand Watson and Crick base pairs, such as in CNG (“N” can be any one of the 4 nucleotides), then a hairpin structure containing mismatches in the stem can form (6). Because purine bases can exist in a syn conformation, alternating purine/pyrimidine sequences, such as GT and GC repeats, are readily converted into a left-handed Z-DNA conformation, which is named because of the zig-zag shape of the phosphate backbone (7, 8). Polypurine-polypyrimidine regions with mirror repeat symmetry support the formation of H-DNA, where one of the single strands from half of the tract winds back and pairs with the purine-rich strand in the duplex through the major groove to form a three-stranded helix, with the complement of the third strand remaining unpaired (9, 10). Certain guanine-rich regions of the genome can form G4 DNA, which contains four guanines aligned to each other via Hoogsteen hydrogen bonding (11). Some specific sequences have the potential to form more than one non-canonical DNA structure, e.g., CG repeats can form both Z-DNA and hairpin/cruciform structures (12), and GAA triplet repeat can form H-DNA, or sticky DNA (a structure formed by two interacting triplex-like structures), in addition to the slippage structure due to its simple repeat feature (13, 14).
Recent studies have revealed that many types of non-B DNA structures play important roles in recombination, DNA replication/transcription, and particularly, genetic instability, and have been related with more than 20 human diseases (1, 15). Therefore, it is important to develop reliable experimental systems to study the roles of non-B DNA structures in human disease development and the mechanisms involved. In this review, we summarize a number of methods used to study non-B DNA structures in vitro and in vivo, with a focus on strategies to determine non-B DNA-induced genetic instability in different model systems, particularly on those methods established and used in this laboratory over the past several years.
Non-B DNA structures differ from canonical B-form DNA in many aspects, and some of these differences can be used to detect the presence of non-B DNA structures within a background of canonical B-DNA. As discussed above, most non-B DNA structures contain single-stranded regions that can serve as substrates for single-strand specific chemicals and enzymes. For example, thymines in unpaired regions of the DNA are hyper-reactive to osmium tetroxide and potassium permangenate (16-21); cytosines in single-stranded DNA are sensitive to hydroxylamine (22); adenines and cytosines in single-stranded regions can be modified by bromoacetaldehyde and chloroacetaldehyde. The reaction of chloroacetaldehyde with DNA is less sensitive and slower than the reaction of bromoacetaldehyde with DNA. However, chloroacetaldehyde is more stable in solution than bromoacetaldehyde (23). Haloacetaldehyde reacts with adenines in single-stranded regions (24), and carbodiimide reacts with unpaired thymines and guanines (25). Dimethylsulfate methylates guanines in single-stranded DNA, but it is not able to modify guanines in duplexes or folded structures (e.g., tetraplex DNA) (26, 27). Diethyl pyrocarbonate reacts preferentially with purines at B-Z junctions (28, 29) or unpaired purines in H-DNA structures (30). Chemical probing is usually more sensitive to small distortions within regions of non-B DNA than are enzymatic reactions, and the reactions can be performed under a wider range of conditions (such as pH, temperature, and salt concentrations; see below). However, the modifications per se are not generally directly detectable and require a second step such as piperidine cleavage of the modified base to convert the modification into a break, followed by end-labeling or primer extension to visualize the break. Single-strand specific enzymes such as P1 and S1 nucleases directly cleave single-stranded DNA and can be used to detect non-B DNA structures in one step, but this type of reaction usually requires more stringent conditions. For example, the activity of S1 nuclease is very specific to single-stranded DNA and has been widely used to detect non-B DNA structures including cruciforms and hairpins (31, 32), H-DNA (27, 33, 34), Z-DNA (35, 36), and tetraplex DNA (37), but it works under acidic conditions (pH 4-5) and requires Zn2+ for activity. P1 nuclease is less sensitive than S1 nuclease, but it works under neutral conditions (38). Mung bean nuclease cleaves the single-stranded regions at the tips of hairpin structures, at the junctions of the stem of a cruciform structure (39, 40), and at H-DNA structures (41). T4 endonuclease VII selectively cleaves cruciform structures (42, 43).
Most types of non-B DNA structures require negative supercoiling to provide energy for transition, and a specific number of negative supercoil turns are relaxed once the non-B DNA structure is formed. This topological alteration has been used to detect the presence non-B DNA structures (44, 45) by two-dimensional gel electrophoresis. Briefly, this protocol includes: 1) treatment of DNA with topoisomerase I and increasing concentrations of ethidium bromide (EtBr) to generate various families of topoisomers, which are then combined (~1 μg of each topoisomer); 2) separate the topoisomer mixture on the 1st dimension gel in TBE buffer; 3) cut the DNA strip from the gel after the 1st dimension electrophoresis and turn it 90° and place it across the top of the gel for the 2nd dimension, which contains chloroquine; 4) run the gel in TBE buffer containing chloroquine at the same voltage as used in the 1st dimension electrophoresis; and finally 5) visualize the migration of the topoisomers after soaking the gel in EtBr solution. Because topoisomers migrate according to the number of supercoil turns, the 1st dimension generates a ladder of increasingly negative supercoiled species, with the closed circular (no supercoiled turns) molecules migrating the slowest, at the top of the gel. Chloroquine adds a certain number of positive supercoils to all species, and reverts non-B DNA structure back to the canonical B-DNA form when the reduced negative supercoiling is not enough to support the non-B DNA formation. Topoisomers migrate in the 2nd dimension according to the net sum of supercoil turns present prior to exposure to chloroquine + the effect of chloroquine on supercoil turns + any additional negative turns acquired from the non-B to B-DNA transition. Without a B-DNA to non-B DNA transition, the overall topoisomer distribution adopts a smooth arc-like shape in the 2nd dimension electrophoresis, with the slowest species in the 2nd dimension corresponding to the topoisomer with a net sum of supercoil turns equal to zero. Spots located above the smooth arc-shaped path indicate the presence of non-B DNA conformations at those levels of negative supercoiling (15, 46) since the number of supercoils lost during the structural transition caused these topoisomers to run slower in the 1st dimension.
Structure-, rather than sequence-specific antibodies are powerful tools for detecting non-B DNA structures. Using poly[d(Tm5dC).poly[d(GdA)] to immunize mice, a monoclonal antibody was developed to detect H-DNA. The antibody bound specifically to triplex DNA, particularly to those triplex structures formed at GA repeats, with no detectable binding to single-stranded DNA or to duplex DNA (47). H-DNA structures formed in plasmids or in mammalian chromosomes have been detected in assays using this H-DNA antibody (48-50). An anti-Z-DNA antibody was generated by immunizing rabbits with brominated poly[d(GdC).poly[d(G-dC)], and was used for detecting Z-DNA in chromosomes of Drosophila melanogaster by immunofluorescent staining (51) and Z-DNA formed at a GT(32) sequence in a negatively supercoiled plasmid in vitro (52). Monoclonal antibodies were also developed to recognize cruciform and T-shaped DNA structures and were able to protect the cruciforms against cleavage by mung bean nuclease or T7 endonuclease III (39).
Other methods have been employed for detecting non-B DNA structures. For example, structures that induce bending on DNA strands, such as cruciforms, H-DNA, and G-loops (large structures containing G4 DNA on the G-rich strand and a RNA/DNA hybrid on the other) have been visualized directly by electron microscopy (53-57). DNA modifying enzymes have also been used to detect the presence of non-B DNA structures. For example, the formation of Z-DNA protects that region of DNA from methylation modification and restriction digestion (58).
Expansion of trinucleotide repeats has been associated with more than 20 neurological disorders, and numerous studies have been performed to detect the expansion and contraction of repeat units in different model systems and in patient samples. The most commonly used method is direct measurement of the length of the repeat sequences by PCR amplification of DNA purified from tissues or cultured cells (59). If the repeats of different lengths are amplified at the same efficiency, then this method can provide a global view of non-B DNA-induced genetic instability of both expansion and contraction events. The limitation of the PCR-based repeat length measurement is its sensitivity; rare mutation events (<5%) will not be detected as a distinguishable band in the gel.
To detect lower frequency events occurring at simple repeats, selectable or screenable reporter systems have been developed based on length-dependent gain or loss of function of the reporter gene. Insertion of repeats within the reporter (e.g., AAAG(16) or CA(16) repeats in the green fluorescent protein (GFP) gene (60), or tk-neo gene (61)) in the coding region can disrupt the reading frame of the reporter gene, and some expansion or contraction events can restore the reading frame and reactivate reporter expression. Similarly, one can design a forward mutation-screening assay by inserting the repeats in-frame and then screen for frame-shift mutations that inactivate the reporter gene. For the greatest sensitivity, the reporter gene containing the repeats should be integrated as a single copy.
Although expansion or deletion within trinucleotide repeats (TNRs) does not change the reading frame, long triplet repeats can impact transcription or RNA splicing processes and inactivate reporter genes. For example, CAG triplet repeats were inserted at the single-copy chromosomally integrated hypoxanthine phosphoribosyl transferase (HPRT) mini-gene in its intron. The long CAG tract is spliced into the HPRT mRNA and inactivates the HPRT gene. Contraction to <39 CAG repeats reactivates the HPRT gene and permits cells to survive HPRT+ selection (62). Various sequences, located at different positions in reporter genes may have different length thresholds of inactivation or reactivation, and using a sequence length near the threshold should increase the sensitivity of the assay.
The threshold of TNRs for contraction or expansion has been investigated thoroughly in yeast, and this information has been applied to the design of reporter systems (63-65). For example, Lahue’s group developed a selectable yeast system to monitor TNR instability in a chromosomal context, with a detection level of ~2×10-8 per cell generation (66). In this system, a number of trinucleotide repeats were inserted between the TATA box and the URA3 gene, under the Schizosaccharomyces pombe adh1 promoter. The permissive distance between the TATA box and the transcription initiation site for transcription is between 55-125 bp in Saccharomyces cerevisiae (67), which is near the threshold of TNR expansions observed in human patients (~30-40 repeats, (68, 69)). To measure contractions, the distance from the TATA box to the URA3 gene in the initial construct, which was separated by TNR insertion, was longer than the permissive distance so that the transcription machinery was forced to use the upstream ATG start site, which generates an out-of-frame URA3 transcript. Contraction of TNRs brought the URA3 ORF near the TATA box, generating the correct transcript and a functional URA3 gene product. These cells can be selected for by growth on synthetic complete medium in the absence of uracil (64). Similarly, this system can be used to detect TNR expansions that separate the TATA box and the ATG start site over the permissive distance, inactivating the URA3 gene, rendering cells viable in the presence of medium containing 5-fluoroorotic acid (FOA) (70).
Recently, a vector derived from this yeast system was established for monitoring TNR instability in human cells (71). This vector contains the URA3 reporter to monitor for contraction and expansion events (as described above), an SV40 replication origin, and a large T antigen gene to insure replication in the human cells. The vector was incubated in human cells, collected and transformed into yeast for analysis. This in vitro system displayed high sensitivity (~10-6, limited by the background instability in yeast) in detecting contraction or expansion events. This reporter system has been used for studying several aspects of CTG/CAG repeat-induced expansion and contraction (e.g., length threshold, effect of TNR sequence purity, replication orientation), and the results suggested that this system can reflect the process of TNRs in human cells (68, 72-74).
Expansion and contraction of repeat units are specific mutation types most common to simple repeat sequences. However, DNA strand breaks have been related with most types of non-B DNA structures (1). Many breakage hotspots that are associated with human disease (75) are mapped in or near sequences that have the potential to adopt non-B DNA conformations (76-81). For example, segments of the human c-MYC gene capable of adopting non-B structures (e.g., H-DNA) are found in the promoters P1, P2, and upstream of the P1 promoter (82), overlapping with the major breakage hotspots found in c-MYC-induced lymphomas and leukemias (83-87). Multiple Z-DNA motifs have been found near breakpoints in the c-MYC P1 promoter and the 3’ downstream region (88, 89), and the 5’ breakage hotspot in the BCL-2 gene is surrounded by sequences capable of adopting Z-DNA structures (80, 81). Breakpoints in lymphoid tumor-specific translocations also map to Z-DNA-forming sequences (90). Many chromosomal breakpoints associated with t(11;22) translocations in human cells are localized at the center of AT-rich palindromic sequences on 11q23 and 22q11 (91-93).
In experimental systems, these non-B DNA structures have been found to cause large-scale deletion and rearrangements, resulting from DNA double strand breaks (DSBs). Model Z-DNA-forming sequences (CG)14 inserted into a mutation reporter plasmid pUCNIM (described in detail below) induced a mutation frequency ~12-fold greater than the background control in mammalian COS-7 cells, and >95% of the (CG)14-induced mutants were large deletions and/or rearrangements resulting from DSBs located upstream, downstream, and within the (CG)14 repeats in mammalian cells (94). We have also discovered that a native H-DNA-forming sequences from the human c-MYC promoter, and model H-DNA-forming sequences are intrinsically mutagenic in mammalian cells and lead to DSBs (95). Palindromic sequences capable of adopting cruciform structures can induce DSBs (55, 96) and stimulate deletion, amplification, and recombination in bacteria (97), yeast (98, 99), and mammalian cells (100-102).
Although the model systems of bacteria, yeast, and mice share many important features of DNA metabolism with humans, it is important to note that the fate of non-B DNA structures in different species is not always the same. For example, in bacteria, the majority of the Z-DNA-forming CG repeat-induced mutations are small expansions and contractions of CG units within the repeat, likely generated by misalignment during replication (103, 104). However, we have shown that CG repeats in mammalian cells induce a very different kind of mutation event, i.e., DSBs surrounding the Z-DNA structure that can lead to large-scale deletions and rearrangements, and the occurrence of these mutations does not require replication (94). Interestingly, H-DNA-forming sequences, which cause DSBs and large-scale deletions in mammalian COS-7 cells, are stable in bacterial DH5α cells (95). Long CNG triplet repeats tend to cause expansions in human cells, but the majority of mutants are deletions in bacteria, yeast, and mice (104-107). Therefore, we have developed several reporter systems for studying non-B DNA-induced genetic instability in bacteria, yeast, transgenic mice, and cultured mammalian cells (see below).
Shuttle vectors can replicate in both bacterial and mammalian cells, and are powerful tools for studying non-B DNA-induced genetic instability. Mutation reporter genes in the plasmids can be used to reflect the processing of non-B DNA sequences in bacteria or mammalian cells, where a considerable portion of transiently transfected plasmids are capable of being properly chromatinized (108), replicated, damaged, and repaired, thus mimicking the mammalian genomic DNA. The function of the reporter gene recovered from mammalian cells can be determined in an appropriate bacterial host cell. The advantages of this method are that it is facile, sensitive (capable of detecting mutation frequency as low as 10-5), and individual mutants can be characterized.
The supF mutation-reporter system is a forward mutation assay that allows detection of point mutations and a 6variety of deletion and rearrangement events. The pSP189 plasmid (Figure 1) is an SV40-based shuttle vector that can replicate in human cells due to the SV40 origin of replication and T antigen gene, and also in bacterial cells by virtue of the pBR327 origin. The supF gene encodes a suppressor tRNA that suppresses amber mutations in the β-galactosidase gene in an indicator bacterial strain, such as MBM7070, so that blue colonies can be seen resulting from wild-type supF genes and white colonies from mutant supF genes when grown on plates containing 5-bromo-4-chloro-3-indolyl-β-D-galactoside (X-Gal) and isopropyl β-D-thiogalactoside (IPTG). The small size of the supF reporter gene (85-nucleotide mature tRNA, after processing) reduces the spontaneous mutation frequency. Non-B DNA-forming sequences are inserted at the XhoI restriction site 4-bp upstream of the mature tRNA sequence. The non-B DNA-containing plasmids are transfected into human cell lines, using appropriate transfection protocols according to the cell type, such as GenePorter transfection reagent (Gene Therapy Systems, Inc., San Diego, CA) or electroporation (Amaxa Nucleofector System, Cologne, Germany), and allowed to replicate for at least 48 hours prior to analysis. Plasmid DNA is then isolated from the cells (see below), analyzed by blue/white screening, and the mutation frequency of the supF gene calculated by dividing the number of white mutant colonies by the total number of colonies (white+blue). Mutants generated can be analyzed at the molecular level by DNA sequencing. Data obtained can be analyzed for statistical significance using Student’s T test.
We have also constructed a mutation-reporter shuttle vector, pUCNIM (Figure 2A), containing the lacZ gene expressing the amino-terminal fragment of the lacZ gene product (β-galactosidase), and the SV40 and the pBR327 origins of replication. Non-B DNA-forming sequences can be inserted at the multiple cloning site located within the lacZ gene (replacing codons 6-7 of the original lacZ gene). If the insertion is in-frame and does not introduce a stop codon, then sequences of interest can be as long as 80-100 bp and still maintain the function of the lacZ gene. The E. coli lac operon containing a CAP protein binding site, Plac promoter, and a lac repressor binding site provides a system of controllable transcription through the inserted non-B DNA-forming sequences in bacteria, and in mammalian cells that express the LacI protein (109). In addition, we have included a dexamethasone-inducible MMTV-LTR promoter restricted from pMAMneo in reporter plasmid pUCNIM. Upon adding dexamethasone at a final concentration of 10 μM, transcription through the non-B DNA-forming sequences and the lacZ gene was enhanced at least 10-fold over the levels of transcription in the absence of dexamethasone (Figure 2B). The controllable transcription system provides an opportunity to study the effect of transcription on non-B DNA metabolism (94). DNA structure-induced instability, including DSBs, illegitimate recombination, or frame-shift mutations due to small-scale deletion/insertion within the insert, which are not detectable via the supF system, result in dysfunction of the lacZ gene and can be detected by blue/white screening in indicator bacteria (e.g., DH5α) grown on plates containing X-Gal and IPTG. Since the supF and lacZ reporter systems are appropriate for detecting different types of mutations, using both systems enables one to screen a broad range of mutations in cell-based systems with high sensitivity.
Plasmid DNA recovery from mammalian cells is usually performed according the methods described by Hirt (110) with slight modification. Briefly, approximately 0.2-2×107 mammalian cells are washed twice with PBS, and lysed by adding 1 ml Hirt’s lysis buffer (10 mM Tris-HCl, pH 7.5, 10 mM EDTA, 0.6% SDS, and 0.7 μg/ml of proteinase K) at 37°C for 30 minutes. The viscous cell lysate is then transferred into a 1.5 ml Eppendorf tube and mixed gently but thoroughly with 5% volume (0.25 ml) of 5 M NaCl, and incubated at 4°C overnight. The supernatant collected after centrifugation for 30 minutes at maximum speed on a microfuge at 4°C is purified by phenol/chloroform extraction and ethanol precipitation. Alternatively, plasmid DNA can be recovered through an alkaline lysis method, such as using a Qiagen Miniprep Kit (Qiagen Science, Maryland). In our experience, Hirt’s method can recover more nicked or linear plasmid DNA from repair/replication intermediates than the alkaline lysis method. Thus, it is appropriate for directly determining the damage and breakage events that occurred on the recovered plasmid DNA, while the alkaline lysis method recovers more supercoiled plasmids, and therefore is a better option for screening mutations. Purified plasmid DNA is digested with DpnI which cleaves methylated GATC sites as in dam+ bacterial strains, and has no activity on plasmids that have been replicated in mammalian cells, to ensure that all the plasmids transformed into indicator bacteria for mutation screening have been replicated in mammalian cells.
Investigation of non-B DNA-induced DSB formation in yeast has been performed on modified chromosomes (111) or on constructed yeast artificial chromosomes (YACs) (112). In both systems, one end of the chromosome was modified to contain only the selection marker (CAN1 and ADE2, or URA3) and no other essential genes. The non-B DNA-induced DSBs promote the complete loss of that end of the chromosome, including the selection marker. Thus, the chromosomal breakage event can be detected at a frequency as low as 10-10 (CAN1 and ADE2) or 10-8 (URA3).
We use the YAC system to monitor non-B DNA-induced breakage in a chromosomal context (Figure 3) as described by Freudenreich’s group (112). Non-B DNA-forming sequences are placed on the right arm of a YAC, upstream of a URA3 gene. Just proximal to the non-B DNA tract is a “backup telomere”, a 108-bp stretch of C4A4/T4G4 telomeric sequence from Oxytricha, used for telomere addition by yeast telomerase (113). The left arm of the YAC contains the LEU2 gene, for maintaining the YAC, a yeast origin of replication (ARS1), and a centromere (CEN4). Non-B DNA-induced breakage should result in loss of the right arm of the YAC, including the URA3 gene, giving rise to FOA resistance (FOAR). The exposed C4A4 sequence acts as a seed for addition of a yeast telomere for efficient YAC recovery. Thus, measurement of the rate of generation of FOAR cells can be used as a monitor of the rate of breakage.
Since the non-B DNA-forming sequences are inserted ~770 bp away from the 3’-end of the URA3 gene, FOA selection would include only the complete loss of the right arm or large-scale deletion of the YAC. Small deletions, such as mutations from Z-DNA detected by using the lacZ system in bacteria (94), cannot be detected using this YAC system. Furthermore, while the human disease-related GAA/TCC and CAG/CTG repeats have been shown to cause chromosome loss in yeast (111, 112), expansions or recombination events arethe predominant form of genetic instability seen in human patients, suggesting a difference between yeast and human cells. However, it is possible that loss of one end of a chromosome is a costly and deadly event for human cells, and those cells might not survive long enough to be detected.
Although shuttle-vector reporter systems and yeast systems are sensitive and convenient, and can mimic the mammalian genomic DNA in many aspects, in some studies it is important to assess the effect of non-B DNA structures in a chromosomal context in mammalian cells. Chromosomally integrated selectable markers such as the hypoxanthine phosphoribosyl transferase (HPRT) have been used to determine the non-B DNA-induced instability in cultured mammalian cells (62), however, this strategy is not applicable in a living animal.
To develop an in vivo system to screen for genetic instability induced by non-B DNA-forming sequences, we constructed a recoverable shuttle-vector mutation reporter, p2RT (Figure 4). Recoverable plasmid-based transgenic mice or bacteriophage lambda shuttle vectors are established models for facile detection of mutations generated in living animals. We chose a plasmid-based reporter system because it allows for the detection of a variety of mutation types (e.g., translocations, deletions, DSBs), and unlike lambda vectors, the recovery of plasmid reporter DNA is not size-dependent, so mutants with deletions >50 bp can be recovered. This is important because these constitute the majority of mutants caused by non-B DNA, such as H-DNA and Z-DNA (94, 95). The plasmid, p2RT, contains a neomycin resistant (Neo) gene, which provides a selection marker for establishing a stably transected cell line, and only replicates in cells that express the SV40 large T antigen. To integrate this vector in the chromosome of SV40-transformed cell lines, the SV40 Ori should be removed prior to transfection. In both transgenic cell lines and mouse models, the vector containing the non-B DNA-forming sequences are randomly integrated into the mammalian genome (see detail below). To avoid possible position effects due to random integration of the reporter plasmid, two mutation reporter genes were included in the same vector; a lacZ reporter gene, same as in pUCNIM as described previously, and a supF reporter gene located ~2,000 bp away from the lacZ gene. Both reporter genes contain unique restriction sites between the reporter (supF tRNA or lacZ coding region) and the promoter (XhoI and NheI sites in the supF gene, and EcoRI, SacI, SalI, AccI, and SbfI sites in the lacZ gene). Non-B DNA-forming sequences to be studied can be inserted at one of two mutation reporter genes, and the other reporter gene can serve as an internal control for the background mutation frequencies (which might differ among different integration sites). The LacI binding site in the lacZ promoter (Plac) and an additional LacI binding site located ~600 bp away serve as tags during recovery of the plasmid DNA (as described below). The additional LacI site allows for recovery of deletion mutants that may have lost the Plac site. The cut & ligation site (C&L) containing SpeI-EcoRV-SpeI restriction sites can be used to recover the integrated plasmid DNA in single units. This recoverable lacZ-supF dual reporter system allows rapid analysis of a large number of mutants generated in cells and in mouse tissues.
To develop mutation-reporter human cell lines to measure non-B structure-induced mutation frequencies in a chromosomal context, the recoverable dual mutation-reporter construct, p2RT containing non-B DNA sequences or control inserts, following linearization by EcoRV digestion at the cut & ligation site, were transfected into mammalian cells using appropriate transfection methods. Cells were selected in G418 (250-500 μg/ml) and maintained in medium containing 100 μg/ml G418. The copy numbers of incorporated reporter plasmids in individual clones were determined by Southern blot analysis or quantitative Real Time PCR (Wang et al., JNCI, 2008, in press).
The construct for generating transgenic mice was described in our recent publication (Wang et al., JNCI, 2008, in press). Briefly, dual-reporter plasmids containing the structure-forming or control sequences were linearized at the unique EcoRV site, and were microinjected into fertilized mouse oocytes to develop the transgenic mice. The integrated fragments in the genome are therefore flanked by two SpeI sites, which are used for plasmid DNA recovery from the genomic DNA. Transgenic founders were identified by PCR analysis of genomic DNA isolated from tail clips using primers specific to the plasmid sequence. Characterized strains with the maximum copy number of the reporter plasmid were further utilized for the mutation screening studies.
The cell lines or transgenic mice described above can be used to measure the non-B DNA-induced mutation frequencies in a chromosomal context. Genomic DNA is isolated from the cells or from the tissues. Purification of the mutation-reporter containing DNA has been described by Vijg and co-workers (114-117) as outlined below (Figure 5). Briefly, genomic DNA is digested with SpeI to release the mutation reporter plasmid DNA (containing SpeI sites on both ends). Plasmid DNA is then separated from the genomic DNA using the LacI-LacZ fusion protein coupled with magnetic beads, which can bind to either one of two LacI binding sequences in the plasmid, to pull down the DNA fragments containing the reporter gene. The recovered linear plasmid sequences are circularized using T4 DNA ligase, and electroporated into DH5α cells to screen for mutations in the lacZ gene, and into MBM7070 cell to screen for mutations in the supF gene. Non-B DNA-induced mutation frequencies on the reporter gene should be normalized to the background mutation frequency measured from the other reporter gene. Statistical analysis can be performed as described (116). Nucleotide sequences that have the capacity to adopt non-B DNA structures have been found to cause genetic instability in the form of deletions, expansions, and DSBs, which can be detected by appropriate methods described above. Further studies in this field are warranted and will lead to a better understanding of the mechanisms of human diseases and genetic evolution, and may provide new strategies to reduce genetic instability in human cells (118). To facilitate these studies, we list some experimental procedures that we have developed or modified in detail below. Other detailed protocols can be found in the references listed throughout the text.
This method was originally developed in Vijg’s group (114-117). Preparation of the LacI-LacZ fusion protein and its conjugation to magnetic beads coated with sheep anti-mouse IgG through the anti-β-galactosidase monoclonal antibody has been described in detail by Garcia et al (119). Below is a modified protocol for recovery of the p2RT plasmid from mammalian genomic DNA.
Screened or selected mutants induced by non-B DNA structures are the final products of processing and repair. Initial characterization of the mutant plasmids can be performed by restriction digestion analysis and visualized on agarose gels, or by direct DNA sequencing, (e.g., large deletion areas and the existence of microhomologies at the junctions of the deletions would suggest an error-prone end-joining at a DSB (94, 95)) However, it may be useful to determine if strand breaks were in fact generated at the non-B DNA structures. To directly map strand breaks, we have developed a modified linker-mediated PCR (LM-PCR) assay to explore non-B DNA-induced DSB hotspots in plasmids recovered from mammalian cells (94, 95).
We thank Dr. Albino Bacolla for useful comments on the manuscript and Sarah Henninger for manuscript preparation. Support was provided by an NIH/NCI grant to K.M.V. (CA093729) and an NIH/NIEHS grant to K.M.V. (ES015707).
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.