|Home | About | Journals | Submit | Contact Us | Français|
Users may view, print, copy, download and text and data- mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms
Adverse events linked to perturbations of cellular genes by vector insertion reported in gene therapy trials and animal models have prompted attempts to better understand the mechanisms directing viral vector integration. The integration profiles of vectors based on MLV, ASLV, SIV, and HIV have all been shown to be non-random, and novel vectors with a safer integration pattern have been sought. Recently we developed a producer cell line called CatPac that packages standard MoMLV vectors with FeLV gag, pol and env gene products. We now report the integration profile of this vector, asking if the FeLV integrase and capsid proteins could modify the MoMLV integration profile, potentially resulting in a less genotoxic pattern. We transduced rhesus macaque CD34+ hematopoietic progenitor cells with CatPac or standard MoMLV vectors, and determined their integration profile by LAM-PCR. We obtained 184 and 175 unique integration sites (IS) respectively for CatPac and standard MoMLV vectors, and these were compared to 10 000 in silico-generated random IS. The integration profile for CatPac vector was similar to MoMLV and equally non-random, with a propensity for integration near transcription start sites and in highly dense gene regions. We found an IS for CatPac vector localized 715 nucleotides upstream of LMO-2, the gene involved in the ALL developed by X-SCID patients treated via gene therapy using MoMLV vectors. In conclusion, we found that replacement of MoMLV env, gag, and pol gene products with FeLV did not alter the basic integration profile. Thus there appears to be no safety advantage for this packaging system. However, considering the stability and efficacy of CatPac vectors, further development is warranted, utilizing potentially safer vector backbones, for instance those with a SIN configuration.
Insertional mutagenesis after gene transfer in animal models1,2 as well as in X-SCID clinical trials3–5 has led numerous laboratories to focus on the safety of the virus-based vectors used to transfer and express genes of interest. A better understanding of the mechanisms that define the characteristics of the integration of virus-based vectors should allow the development of new vectors with increased biosafety. Taking advantage of the subgroup C Feline Leukemia Virus (FeLV-C) as a platform for efficient transduction of target human cell types, including CD34+ stem and progenitor cells, we recently developed an FeLV-C based packaging system named CatPac. HEK293 cells were engineered to stably package MoMLV vectors with FeLV 61E gag-pol gene products, in addition to FeLV-C Sarna env, resulting in a completely FeLV-derived packaging system6. In the current study we assessed the integration profile of CatPac vector in comparison to the same MoMLV vector packaged with MoMLV gag-pol and amphotropic env in order to elucidate the influence of the FeLV component packaging proteins on viral integration. Both primary vector sequences as well as integrase proteins have been implicated in genomic integration site selection for retroviruses, and thus we asked whether this hybrid vector might have a potentially less genotoxic integration pattern than standard MoMLV, due to the presence of the alternative FeLV integrase machinery. FeLV-C and MoMLV are both members of the mammalian C-type retrovirus family, but there is no prior information on the integration profile for either FeLV-C or hybrid MoMLV/FeLV packaging and vector systems.
Rhesus macaque CD34+ cells were obtained as previously described7. Two rhesus macaques (RQ4984 & RQ4972) were treated with a combination of G-CSF and SCF and mobilized circulating hematopoietic progenitor and stem cells were collected by apheresis. PBMNCs contained in apheresis product were purified by density gradient centrifugation over lymphocyte separation media (LSM, MP Biomedicals) and CD34+ cells were positively immunoselected using the 12.8 IgM anti-CD34 biotinylated antibody and MACS streptavidin microbeads (Miltenyi Biotec). The purified CD34+ cells were stimulated for 48 hrs in DMEM supplemented with 10% FCS, SCF, Flt3-L, and TPO and then transduced twice on fibronectin-treated plates (Retronectin, Takara) with the CatPac or control MoMLV vector supernatants, using our standard in vitro transduction conditions8. The vectors contained the GFP marker gene, allowing assessment of transduction efficiency by flow cytometry. In both animals, the transduction efficiencies of CD34+ cells for CatPac and control MoMLV packaging systems were equivalent: for RQ4984 8.1 % and 8.8 % of GFP positive cells for CatPac and MoMLV vectors respectively, and for RQ4972 39.7 % and 20.3% respectively. Genomic DNA of transduced CD34+ cells was isolated using Qiagen DNeasy Blood & Tissue kit (#69506) and LAM-PCR was carried out as previously described9 with 100 ng of total genomic DNA. TasI was used as the restriction enzyme. Linear PCR was carried out with biotinylated primer LTRa (Biot 5′-TGCTTACCACAGATATCCTG-3′). The first exponential PCR was carried out with primer LCI (5′-GACCCGGGAGATCTGAAT-3′) and LTR-R1 primer (5′-CAGCTGTTCCATCTGTTC-3′), whereas the second exponential PCR was carried out with LCIII (5′-AGTGGCACAGCAGTTAGG-3′) and LTR-R2 primer (5′-GCTAGCTTGCCAAACCTA-3′). Sequences obtained were aligned by BLAT or BLAST to the rhesus macaque genome assembly (Mmul 1.0, Jan 2006). We obtained 200 and 187 valid integration sites (IS) for CatPac and MoMLV vectors respectively (Supplemental Information). Sites mapping to 2 or more genomic positions as well as sites mapping within repetitive genomic sequences were omitted to finally obtain 184 and 175 unique IS respectively for CatPac and MoMLV vectors. We also compared both profiles to in silico-randomly generated IS sets as previously described9. Briefly, 10 000 sets of 184 or 175 random IS were designed in silico as follows: An AATT (TasI) site in the genome was selected at random using a random number generator. The in silico IS was placed either upstream or downstream (p=0.5) of the AATT site, at a distance matching the size of one of the sequences obtained experimentally. The in silico IS was validated only when a BLAST alignment of the genomic sequence between the AATT and the IS returned a unique sequence in the genome. This operation was repeated 184 times and 175 times respectively for CatPac and MoMLV vectors to obtain a single matching random dataset. These control datasets were subjected to the same analyses as the experimental datasets, and the results were used to generate empiric p-values. We generated two groups of control genomic coordinates, one with 10 000 sets of 184 coordinates each to mimic the CatPac vector IS, and one with 10 000 sets of 175 coordinates each to mimic the MoMLV IS. For gene annotation, Ensembl release 54 (May 2009) comprising 38 146 predicted gene transcripts was used. The association between the CatPac and MoMLV vectors integration sites was tested using a Chi-square test.
The results obtained for MoMLV are comparable to previously reported integration patterns for standard MoMLV derived vectors10,11. We confirmed the preference of MoMLV for integrating near the TSS, a feature also shared by CatPac vector. Like other retroviral vectors already studied such as MoMLV10,11, ASLV12, SIV10,13, and HIV12,14, CatPac vector exhibited a profile of integration significantly different from random integrations as can be seen in all the tables and figures presented in this report. As shown in Figure 1, out of the 114 CatPac vector and 112 MoMLV IS located in a window of 60 Kb centered on transcription start sites (TSS), 44% (CatPac vector) and 35% (MoMLV) occurred in a 10 Kb window centered on TSS, implicating a preference for integration near TSS. With regard to inter or intragenic insertions, Table 1 shows that approximately half of the vector integrations were found inside of a gene (49.5 and 54.3% for CatPac vector and MoMLV vectors respectively). One quarter of all integrations were found within a 30 Kb window upstream of a gene (Table 1). In these upstream regions CpG islands are motifs found in or in the vicinity of promoters. As shown in Table 2 integration of both vectors in or around (±1 Kb and ±5 Kb) CpG islands was significantly different from the random datasets. A quarter of the insertion sites were found ±5 Kb around CpG islands for CatPac (27.7 %) and MoMLV (25.7 %) vectors. However, this higher percentage probably represents the propensity of CatPac and MoMLV vectors to integrate close to the TSS, rather than a preference for CpG islands. Looking at integrations with regard to gene density, Table 2 shows that less than half of the integrations for both vectors were found in lower gene-density regions (0 to 10 genes / Mb) of the genome, and thus 53.8 % and 57.7 % of integration respectively for CatPac and MoMLV vectors in gene dense regions (11 genes/Mb and up). Although this integration profile is significantly different from in silico-generated integrations, there were no obvious differences in the integration pattern between CatPac and MoMLV vectors with respect to gene density.
Theoretically, virus-based vectors can lead to tumor formation by insertional mutagenesis either by activating an oncogene or by disrupting a tumor suppressor gene15. Most, if not all, cases of malignant disease development following gene therapy for hematological disease have been associated with the activation of a proto-oncogene 2–5. We therefore looked at integration of CatPac and MoMLV vectors in the proximity of oncogenes (Table 3). We assessed both the proximity of proto-oncogenes to IS, and conversely the proximity of IS to proto-oncogenes. We defined proto-oncogenes based on the most recent version of the Sanger oncogene file (dated 2008-12-16, at http://www.sanger.ac.uk/genetics/CGP/Census/). Three hundred eighty four human entries were mapped to 368 unique Ensembl rhesus macaque gene identifiers. As shown in Table 3, with this size dataset, the MoMLV integrations were not notably different from randoms with regard to proximity to oncogenes. However, the CatPac vector integrations were more likely than the random datasets to be in proximity to oncogenes. The 12 CatPac vector IS in or within a 30 kb window of an oncogene were located: upstream of MLLT11, MUTYH, CYTSB, and LMO2; downstream of ARNT; in the intron of LASP1, Q4W6X8_MACMU, AKAP9, ERG, CBFB, and PCM1; and in the exon of SUFU. The 7 MoMLV IS were located: upstream of MLLT11 and TAL1; downstream of KRAS and STIL; in the intron of CDK6, PMS1, and CIITA. The only oncogene in proximity to both CatPac vector and MoMLV integrations was MLLT11, myeloid/lymphoid or mixed-lineage leukemia translocated to chromosome 11, also termed AF1q. Expression of this gene has been shown to be linked to poor prognosis in AML (reviewed in16) and has a function in apoptosis and drug resistance. We did not find any integrations in or near the MDS1-EVI1 gene complex, previously reported to be over-represented in engrafting human, murine, and rhesus macaque hematopoietic stem and progenitor cells transduced with MoMLV vectors17. Interestingly, in the 12 oncogenes targeted by the CatPac vector we found an integration 715 nucleotides upstream of LMO2, the gene activated in X-SCID trials and causing leukemia through a T cell clonal expansion3–5. LMO2 was also reported as an MLV insertion site in the ADA-SCID and CGD gene therapy trials, but so far, no clonal expansion has been observed18–20.
In order to analyze if there was a significant clustering of vector insertions we used the definition of common integration sites (CIS) reported by Suzuki and coworkers21. As shown in Figure 2, there is a difference between experimental data obtained for CatPac vector and random integrations. We found 7 second order CIS for CatPac vector, a higher value than the frequency of second order CIS in the matching random datasets. None of the CIS identified were localized within 200 Kb of a known proto-oncogene.
These investigations contribute to knowledge regarding the determinants of retrovirus integration into the genome. At least two vector-related elements are involved: the integrase/capsid proteins and/or the primary sequence of the vector backbone itself. The relative contributions of each element in determining integration preferences remain under intense investigation, but it appears that the integrase core region is critical. Swapping the integrase between even closely related viruses, such as HIV and FIV, results in a change in the integration pattern determined primarily by the integrase, along with a contribution from gag-encoded capsid proteins22,23. The path of entry into the cell, determined by the env gene product, appears to have little impact on integration preferences24. In a previous study, pseudotyping of a standard MoMLV retroviral vector with the FeLV-C env protein only resulted in more efficient transduction of primitive CD34+ cells compared to a standard GALV env pseudotype, but no change in integration profile, based on a limited number of integration sites retrieved from myeloid cells following in vivo engraftment25. In the present study, we went on to study the impact of using FeLV gag and pol to package an MoMLV backbone and compared it to an MoMLV backbone packaged with standard MoMLV components, to determine if the integrase and capsid proteins of FeLV would change the integration pattern of MoMLV vector backbones. Utilization of FeLV gag, pol, and env components did not alter the integration profile of MoMLV vectors, in contrast to the prior results comparing the impact of swapping integrase between HIV and FIV viruses. Both standard MoMLV and CatPac vectors integrated within genes and in areas of high gene density, near TSS. Other studies have indicated that vector backbone sequences are also important determinants of integration patterns. The pre-integration complex contains viral integrase and capsid proteins, reverse-transcribed vector DNA, and host cell factors, all implicated in both site-selection and the actual integration process26–28. There are no large-scale integration profiles published for FeLV, however CIS identification in tumors induced by FeLV infection indicate activation of an overlapping but not identical set of genes by FeLV as compared to MoMLV29. Swapping the U3 region of the MoLV LTR with the same region of FeLV results in a partial shift in CIS in tumors30. However, the integration profile of CIS in tumors may relate more to the enhancer activity of a specific U3 region in a target cell type as opposed to actual integration characteristics, since analysis of tumor CIS relies to a large degree on in vivo selection of clones with activating insertions, from an initial highly polyclonal pool of cells.
In conclusion, retroviral vectors packaged using the CatPac system integrate into the genome in a non-random fashion, with a general profile very similar to vectors packaged using standard MoMLV gag and pol gene products, unfortunately characterized by a preference for integration within genes and near TSS. Although the study of a larger number of integration sites might uncover subtle differences between FeLV and MoMLV integrase and capsid determinants of integration, it is clear from the data we obtained in this pilot study that their general profiles are similar and characterized by the same risks. Therefore, while CatPac vector may have advantages in terms of ease of high titer vector production and potential improved efficiency of gene transfer to some target cell populations, the integration profile we have characterized does not suggest any increased safety of these vectors compared to standard retrovirus packaged with MoMLV components.
Jean-Yves Métais would like to dedicate this manuscript to his grandmother Marie Pradier born Sautereau.
This research was supported in part by the Intramural Research Programs of the National Heart, Lung, and Blood Institute and the National Human Genome Research Institute, National Institutes of Health.
Raymond T. Doty and Janis L. Abkowitz are the inventors on a patent filed for the CatPac producer cell line which may have potential for commercial use (patent application serial number 61/058,148). Upon request CatPac packaging cells will be supplied to academic investigators. The authors have no other potential conflicts of interest to declare.