PMCC PMCC

Search tips
Search criteria

Advanced
Results 1-10 (10)
 

Clipboard (0)
None

Select a Filter Below

Journals
Year of Publication
Document Types
1.  Conservation and divergence of chemical defense system in the tunicate Oikopleura dioica revealed by genome wide response to two xenobiotics 
BMC Genomics  2012;13:55.
Background
Animals have developed extensive mechanisms of response to xenobiotic chemical attacks. Although recent genome surveys have suggested a broad conservation of the chemical defensome across metazoans, global gene expression responses to xenobiotics have not been well investigated in most invertebrates. Here, we performed genome survey for key defensome genes in Oikopleura dioica genome, and explored genome-wide gene expression using high density tiling arrays with over 2 million probes, in response to two model xenobiotic chemicals - the carcinogenic polycyclic aromatic hydrocarbon benzo[a]pyrene (BaP) the pharmaceutical compound Clofibrate (Clo).
Results
Oikopleura genome surveys for key genes of the chemical defensome suggested a reduced repertoire. Not more than 23 cytochrome P450 (CYP) genes could be identified, and neither CYP1 family genes nor their transcriptional activator AhR was detected. These two genes were present in deuterostome ancestors. As in vertebrates, the genotoxic compound BaP induced xenobiotic biotransformation and oxidative stress responsive genes. Notable exceptions were genes of the aryl hydrocarbon receptor (AhR) signaling pathway. Clo also affected the expression of many biotransformation genes and markedly repressed genes involved in energy metabolism and muscle contraction pathways.
Conclusions
Oikopleura has the smallest number of CYP genes among sequenced animal genomes and lacks the AhR signaling pathway. However it appears to have basic xenobiotic inducible biotransformation genes such as a conserved genotoxic stress response gene set. Our genome survey and expression study does not support a role of AhR signaling pathway in the chemical defense of metazoans prior to the emergence of vertebrates.
doi:10.1186/1471-2164-13-55
PMCID: PMC3292500  PMID: 22300585
2.  Genome sequence of the stramenopile Blastocystis, a human anaerobic parasite 
Genome Biology  2011;12(3):R29.
Background
Blastocystis is a highly prevalent anaerobic eukaryotic parasite of humans and animals that is associated with various gastrointestinal and extraintestinal disorders. Epidemiological studies have identified different subtypes but no one subtype has been definitively correlated with disease.
Results
Here we report the 18.8 Mb genome sequence of a Blastocystis subtype 7 isolate, which is the smallest stramenopile genome sequenced to date. The genome is highly compact and contains intriguing rearrangements. Comparisons with other available stramenopile genomes (plant pathogenic oomycete and diatom genomes) revealed effector proteins potentially involved in the adaptation to the intestinal environment, which were likely acquired via horizontal gene transfer. Moreover, Blastocystis living in anaerobic conditions harbors mitochondria-like organelles. An incomplete oxidative phosphorylation chain, a partial Krebs cycle, amino acid and fatty acid metabolisms and an iron-sulfur cluster assembly are all predicted to occur in these organelles. Predicted secretory proteins possess putative activities that may alter host physiology, such as proteases, protease-inhibitors, immunophilins and glycosyltransferases. This parasite also possesses the enzymatic machinery to tolerate oxidative bursts resulting from its own metabolism or induced by the host immune system.
Conclusions
This study provides insights into the genome architecture of this unusual stramenopile. It also proposes candidate genes with which to study the physiopathology of this parasite and thus may lead to further investigations into Blastocystis-host interactions.
doi:10.1186/gb-2011-12-3-r29
PMCID: PMC3129679  PMID: 21439036
3.  Efficient targeted transcript discovery via array-based normalization of RACE libraries 
Nature methods  2008;5(7):629-635.
RACE (Rapid Amplification of cDNA Ends) is a widely used approach for transcript identification. Random clone selection from the RACE mixture, however, is an ineffective sampling strategy if the dynamic range of transcript abundances is large. Here, we describe a strategy that uses array hybridization to improve sampling efficiency of human transcripts. The products of the RACE reaction are hybridized onto tiling arrays, and the exons detected are used to delineate a series of RT-PCR reactions, through which the original RACE mixture is segregated into simpler RT-PCR reactions. These are independently cloned, and randomly selected clones are sequenced. This approach is superior to direct cloning and sequencing of RACE products: it specifically targets novel transcripts, and often results in overall normalization of transcript abundances. We show theoretically and experimentally that this strategy leads indeed to efficient sampling of novel transcripts, and we investigate multiplexing it by pooling RACE reactions from multiple interrogated loci prior to hybridization.
doi:10.1038/nmeth.1216
PMCID: PMC2713501  PMID: 18500348
4.  Annotating genomes with massive-scale RNA sequencing 
Genome Biology  2008;9(12):R175.
A method for de novo genome annotation using high-throughput cDNA sequencing data.
Next generation technologies enable massive-scale cDNA sequencing (so-called RNA-Seq). Mainly because of the difficulty of aligning short reads on exon-exon junctions, no attempts have been made so far to use RNA-Seq for building gene models de novo, that is, in the absence of a set of known genes and/or splicing events. We present G-Mo.R-Se (Gene Modelling using RNA-Seq), an approach aimed at building gene models directly from RNA-Seq and demonstrate its utility on the grapevine genome.
doi:10.1186/gb-2008-9-12-r175
PMCID: PMC2646279  PMID: 19087247
5.  EGASP: the human ENCODE Genome Annotation Assessment Project 
Genome Biology  2006;7(Suppl 1):S2.
Background
We present the results of EGASP, a community experiment to assess the state-of-the-art in genome annotation within the ENCODE regions, which span 1% of the human genome sequence. The experiment had two major goals: the assessment of the accuracy of computational methods to predict protein coding genes; and the overall assessment of the completeness of the current human genome annotations as represented in the ENCODE regions. For the computational prediction assessment, eighteen groups contributed gene predictions. We evaluated these submissions against each other based on a 'reference set' of annotations generated as part of the GENCODE project. These annotations were not available to the prediction groups prior to the submission deadline, so that their predictions were blind and an external advisory committee could perform a fair assessment.
Results
The best methods had at least one gene transcript correctly predicted for close to 70% of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into account alternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotide level, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programs relying on mRNA and protein sequences were the most accurate in reproducing the manually curated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could be verified.
Conclusion
This is the first such experiment in human DNA, and we have followed the standards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe the results presented here contribute to the value of ongoing large-scale annotation projects and should guide further experimental methods when being scaled up to the entire human genome sequence.
doi:10.1186/gb-2006-7-s1-s2
PMCID: PMC1810551  PMID: 16925836
6.  GENCODE: producing a reference annotation for ENCODE 
Genome Biology  2006;7(Suppl 1):S4.
Background
The GENCODE consortium was formed to identify and map all protein-coding genes within the ENCODE regions. This was achieved by a combination of initial manual annotation by the HAVANA team, experimental validation by the GENCODE consortium and a refinement of the annotation based on these experimental results.
Results
The GENCODE gene features are divided into eight different categories of which only the first two (known and novel coding sequence) are confidently predicted to be protein-coding genes. 5' rapid amplification of cDNA ends (RACE) and RT-PCR were used to experimentally verify the initial annotation. Of the 420 coding loci tested, 229 RACE products have been sequenced. They supported 5' extensions of 30 loci and new splice variants in 50 loci. In addition, 46 loci without evidence for a coding sequence were validated, consisting of 31 novel and 15 putative transcripts. We assessed the comprehensiveness of the GENCODE annotation by attempting to validate all the predicted exon boundaries outside the GENCODE annotation. Out of 1,215 tested in a subset of the ENCODE regions, 14 novel exon pairs were validated, only two of them in intergenic regions.
Conclusion
In total, 487 loci, of which 434 are coding, have been annotated as part of the GENCODE reference set available from the UCSC browser. Comparison of GENCODE annotation with RefSeq and ENSEMBL show only 40% of GENCODE exons are contained within the two sets, which is a reflection of the high number of alternative splice forms with unique exons annotated. Over 50% of coding loci have been experimentally verified by 5' RACE for EGASP and the GENCODE collaboration is continuing to refine its annotation of 1% human genome with the aid of experimental validation.
doi:10.1186/gb-2006-7-s1-s4
PMCID: PMC1810553  PMID: 16925838
7.  Evaluation and selection of tandem repeat loci for a Brucella MLVA typing assay 
BMC Microbiology  2006;6:9.
Background
The classification of Brucella into species and biovars relies on phenotypic characteristics and sometimes raises difficulties in the interpretation of the results due to an absence of standardization of the typing reagents. In addition, the resolution of this biotyping is moderate and requires the manipulation of the living agent. More efficient DNA-based methods are needed, and this work explores the suitability of multiple locus variable number tandem repeats analysis (MLVA) for both typing and species identification.
Results
Eighty tandem repeat loci predicted to be polymorphic by genome sequence analysis of three available Brucella genome sequences were tested for polymorphism by genotyping 21 Brucella strains (18 reference strains representing the six 'classical' species and all biovars as well as 3 marine mammal strains currently recognized as members of two new species). The MLVA data efficiently cluster the strains as expected according to their species and biovar. For practical use, a subset of 15 loci preserving this clustering was selected and applied to the typing of 236 isolates. Using this MLVA-15 assay, the clusters generated correspond to the classical biotyping scheme of Brucella spp. The 15 markers have been divided into two groups, one comprising 8 user-friendly minisatellite markers with a good species identification capability (panel 1) and another complementary group of 7 microsatellite markers with higher discriminatory power (panel 2).
Conclusion
The MLVA-15 assay can be applied to large collections of Brucella strains with automated or manual procedures, and can be proposed as a complement, or even a substitute, of classical biotyping methods. This is facilitated by the fact that MLVA is based on non-infectious material (DNA) whereas the biotyping procedure itself requires the manipulation of the living agent. The data produced can be queried on a dedicated MLVA web service site.
doi:10.1186/1471-2180-6-9
PMCID: PMC1513380  PMID: 16469109
8.  Variable Number of Tandem Repeats in Salmonella enterica subsp. enterica for Typing Purposes 
Journal of Clinical Microbiology  2004;42(12):5722-5730.
The genomic sequences of Salmonella enterica subsp. enterica strains CT18, Ty2 (serovar Typhi), and LT2 (serovar Typhimurium) were analyzed for potential variable number tandem repeats (VNTRs). A multiple-locus VNTR analysis (MLVA) of 99 strains of S. enterica supsp. enterica based on 10 VNTRs distinguished 52 genotypes and placed them into four groups. All strains tested were independent human isolates from France and did not reflect isolates from outbreak episodes. Of these 10 VNTRs, 7 showed variability within serovar Typhi, whereas 1 showed variability within serovar Typhimurium. Four VNTRs showed high Nei's diversity indices (DIs) of 0.81 to 0.87 within serovar Typhi (n = 27). Additionally, three of these more variable VNTRs showed DIs of 0.18 to 0.58 within serovar Paratyphi A (n = 10). The VNTR polymorphic site within multidrug-resistant (MDR) serovar Typhimurium isolates (n = 39; resistance to ampicillin, chloramphenicol, spectinomycin, sulfonamides, and tetracycline) showed a DI of 0.81. Cluster analysis not only identified three genetically distinct groups consistent with the present serovar classification of salmonellae (serovars Typhi, Paratyphi A, and Typhimurium) but also discriminated 25 subtypes (93%) within serovar Typhi isolates. The analysis discriminated only eight subtypes within serovar Typhimurium isolates resistant to ampicillin, chloramphenicol, spectinomycin, sulfonamides, and tetracycline, possibly reflecting the emergence in the mid-1990s of the DT104 phage type, which often displays such an MDR spectrum. Coupled with the ongoing improvements in automated procedures offered by capillary electrophoresis, use of these markers is proposed in further investigations of the potential of MLVA in outbreaks of salmonellosis, especially outbreaks of typhoid fever.
doi:10.1128/JCM.42.12.5722-5730.2004
PMCID: PMC535243  PMID: 15583305
9.  High resolution, on-line identification of strains from the Mycobacterium tuberculosis complex based on tandem repeat typing 
BMC Microbiology  2002;2:37.
Background
Currently available reference methods for the molecular epidemiology of the Mycobacterium tuberculosis complex either lack sensitivity or are still too tedious and slow for routine application. Recently, tandem repeat typing has emerged as a potential alternative. This report contributes to the development of tandem repeat typing for M. tuberculosis by summarising the existing data, developing additional markers, and setting up a freely accessible, fast, and easy to use, internet-based service for strain identification.
Results
A collection of 21 VNTRs incorporating 13 previously described loci and 8 newly evaluated markers was used to genotype 90 strains from the M. tuberculosis complex (M. tuberculosis (64 strains), M. bovis (9 strains including 4 BCG representatives), M. africanum (17 strains)). Eighty-four different genotypes are defined. Clustering analysis shows that the M. africanum strains fall into three main groups, one of which is closer to the M. tuberculosis strains, and an other one is closer to the M. bovis strains. The resulting data has been made freely accessible over the internet to allow direct strain identification queries.
Conclusions
Tandem-repeat typing is a PCR-based assay which may prove to be a powerful complement to the existing epidemiological tools for the M. tuberculosis complex. The number of markers to type depends on the identification precision which is required, so that identification can be achieved quickly at low cost in terms of consumables, technical expertise and equipment.
doi:10.1186/1471-2180-2-37
PMCID: PMC140014  PMID: 12456266
10.  A tandem repeats database for bacterial genomes: application to the genotyping of Yersinia pestis and Bacillus anthracis 
BMC Microbiology  2001;1:2.
Background
Some pathogenic bacteria are genetically very homogeneous, making strain discrimination difficult. In the last few years, tandem repeats have been increasingly recognized as markers of choice for genotyping a number of pathogens. The rapid evolution of these structures appears to contribute to the phenotypic flexibility of pathogens. The availability of whole-genome sequences has opened the way to the systematic evaluation of tandem repeats diversity and application to epidemiological studies.
Results
This report presents a database () of tandem repeats from publicly available bacterial genomes which facilitates the identification and selection of tandem repeats. We illustrate the use of this database by the characterization of minisatellites from two important human pathogens, Yersinia pestis and Bacillus anthracis. In order to avoid simple sequence contingency loci which may be of limited value as epidemiological markers, and to provide genotyping tools amenable to ordinary agarose gel electrophoresis, only tandem repeats with repeat units at least 9 bp long were evaluated. Yersinia pestis contains 64 such minisatellites in which the unit is repeated at least 7 times. An additional collection of 12 loci with at least 6 units, and a high internal conservation were also evaluated. Forty-nine are polymorphic among five Yersinia strains (twenty-five among three Y. pestis strains). Bacillus anthracis contains 30 comparable structures in which the unit is repeated at least 10 times. Half of these tandem repeats show polymorphism among the strains tested.
Conclusions
Analysis of the currently available bacterial genome sequences classifies Bacillus anthracis and Yersinia pestis as having an average (approximately 30 per Mb) density of tandem repeat arrays longer than 100 bp when compared to the other bacterial genomes analysed to date. In both cases, testing a fraction of these sequences for polymorphism was sufficient to quickly develop a set of more than fifteen informative markers, some of which show a very high degree of polymorphism. In one instance, the polymorphism information content index reaches 0.82 with allele length covering a wide size range (600-1950 bp), and nine alleles resolved in the small number of independent Bacillus anthracis strains typed here.
PMCID: PMC31411  PMID: 11299044

Results 1-10 (10)