PMCC PMCC

Search tips
Search criteria

Advanced
Results 1-25 (55)
 

Clipboard (0)
None

Select a Filter Below

Journals
more »
Year of Publication
Document Types
1.  OikoBase: a genomics and developmental transcriptomics resource for the urochordate Oikopleura dioica 
Nucleic Acids Research  2012;41(D1):D845-D853.
We report the development of OikoBase (http://oikoarrays.biology.uiowa.edu/Oiko/), a tiling array-based genome browser resource for Oikopleura dioica, a metazoan belonging to the urochordates, the closest extant group to vertebrates. OikoBase facilitates retrieval and mining of a variety of useful genomics information. First, it includes a genome browser which interrogates 1260 genomic sequence scaffolds and features gene, transcript and CDS annotation tracks. Second, we annotated gene models with gene ontology (GO) terms and InterPro domains which are directly accessible in the browser with links to their entries in the GO (http://www.geneontology.org/) and InterPro (http://www.ebi.ac.uk/interpro/) databases, and we provide transcript and peptide links for sequence downloads. Third, we introduce the transcriptomics of a comprehensive set of developmental stages of O. dioica at high resolution and provide downloadable gene expression data for all developmental stages. Fourth, we incorporate a BLAST tool to identify homologs of genes and proteins. Finally, we include a tutorial that describes how to use OikoBase as well as a link to detailed methods, explaining the data generation and analysis pipeline. OikoBase will provide a valuable resource for research in chordate development, genome evolution and plasticity and the molecular ecology of this important marine planktonic organism.
doi:10.1093/nar/gks1159
PMCID: PMC3531137  PMID: 23185044
2.  The Paramecium Germline Genome Provides a Niche for Intragenic Parasitic DNA: Evolutionary Dynamics of Internal Eliminated Sequences 
PLoS Genetics  2012;8(10):e1002984.
Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of ∼45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a ∼10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the genome in which parasitic DNA is not usually tolerated.
Author Summary
Ciliates are unicellular eukaryotes that rearrange their genomes at every sexual generation when a new somatic macronucleus, responsible for gene expression, develops from a copy of the germline micronucleus. In Paramecium, assembly of a functional somatic genome requires precise excision of interstitial DNA segments, the Internal Eliminated Sequences (IES), involving a domesticated piggyBac transposase, PiggyMac. To study IES origin and evolution, we sequenced germline DNA and identified 45,000 IESs. We found that at least some of these unique-copy elements are decayed Tc1/mariner transposons and that IES insertion is likely an ongoing process. After insertion, elements decay rapidly by accumulation of deletions and substitutions. The 93% of IESs shorter than 150 bp display a remarkable size distribution with a periodicity of 10 bp, the helical repeat of double-stranded DNA, consistent with the idea that evolution has only retained IESs that can form a double-stranded DNA loop during assembly of an excision complex. We propose that the ancient domestication of a piggyBac transposase, which provided a precise excision mechanism, enabled transposons to subsequently invade Paramecium coding sequences, a fraction of the genome that does not usually tolerate parasitic DNA.
doi:10.1371/journal.pgen.1002984
PMCID: PMC3464196  PMID: 23071448
3.  Complete Genome Sequence of the Highly Hemolytic Strain Bacillus cereus F837/76 
Journal of Bacteriology  2012;194(6):1630.
Highly hemolytic strain Bacillus cereus F837/76 was isolated in 1976 from a contaminated prostate wound. The complete nucleotide sequence of this strain reported here counts nearly 36,500 single-nucleotide differences from the closest sequenced strain, Bacillus thuringiensis Al Hakam. F827/76 also contains a 10-kb plasmid that was not detected in the Al Hakam strain.
doi:10.1128/JB.06719-11
PMCID: PMC3294841  PMID: 22374959
4.  Gene functionalities and genome structure in Bathycoccus prasinos reflect cellular specializations at the base of the green lineage 
Genome Biology  2012;13(8):R74.
Background
Bathycoccus prasinos is an extremely small cosmopolitan marine green alga whose cells are covered with intricate spider's web patterned scales that develop within the Golgi cisternae before their transport to the cell surface. The objective of this work is to sequence and analyze its genome, and to present a comparative analysis with other known genomes of the green lineage.
Research
Its small genome of 15 Mb consists of 19 chromosomes and lacks transposons. Although 70% of all B. prasinos genes share similarities with other Viridiplantae genes, up to 428 genes were probably acquired by horizontal gene transfer, mainly from other eukaryotes. Two chromosomes, one big and one small, are atypical, an unusual synapomorphic feature within the Mamiellales. Genes on these atypical outlier chromosomes show lower GC content and a significant fraction of putative horizontal gene transfer genes. Whereas the small outlier chromosome lacks colinearity with other Mamiellales and contains many unknown genes without homologs in other species, the big outlier shows a higher intron content, increased expression levels and a unique clustering pattern of housekeeping functionalities. Four gene families are highly expanded in B. prasinos, including sialyltransferases, sialidases, ankyrin repeats and zinc ion-binding genes, and we hypothesize that these genes are associated with the process of scale biogenesis.
Conclusion
The minimal genomes of the Mamiellophyceae provide a baseline for evolutionary and functional analyses of metabolic processes in green plants.
doi:10.1186/gb-2012-13-8-r74
PMCID: PMC3491373  PMID: 22925495
5.  Sequencing of the smallest Apicomplexan genome from the human pathogen Babesia microti† 
Nucleic Acids Research  2012;40(18):9102-9114.
We have sequenced the genome of the emerging human pathogen Babesia microti and compared it with that of other protozoa. B. microti has the smallest nuclear genome among all Apicomplexan parasites sequenced to date with three chromosomes encoding ∼3500 polypeptides, several of which are species specific. Genome-wide phylogenetic analyses indicate that B. microti is significantly distant from all species of Babesidae and Theileridae and defines a new clade in the phylum Apicomplexa. Furthermore, unlike all other Apicomplexa, its mitochondrial genome is circular. Genome-scale reconstruction of functional networks revealed that B. microti has the minimal metabolic requirement for intraerythrocytic protozoan parasitism. B. microti multigene families differ from those of other protozoa in both the copy number and organization. Two lateral transfer events with significant metabolic implications occurred during the evolution of this parasite. The genomic sequencing of B. microti identified several targets suitable for the development of diagnostic assays and novel therapies for human babesiosis.
doi:10.1093/nar/gks700
PMCID: PMC3467087  PMID: 22833609
6.  Expression sequence tag library derived from peripheral blood mononuclear cells of the chlorocebus sabaeus 
BMC Genomics  2012;13:279.
Background
African Green Monkeys (AGM) are amongst the most frequently used nonhuman primate models in clinical and biomedical research, nevertheless only few genomic resources exist for this species. Such information would be essential for the development of dedicated new generation technologies in fundamental and pre-clinical research using this model, and would deliver new insights into primate evolution.
Results
We have exhaustively sequenced an Expression Sequence Tag (EST) library made from a pool of Peripheral Blood Mononuclear Cells from sixteen Chlorocebus sabaeus monkeys. Twelve of them were infected with the Simian Immunodeficiency Virus. The mononuclear cells were or not stimulated in vitro with Concanavalin A, with lipopolysacharrides, or through mixed lymphocyte reaction in order to generate a representative and broad library of expressed sequences in immune cells. We report here 37,787 sequences, which were assembled into 14,410 contigs representing an estimated 12% of the C. sabaeus transcriptome. Using data from primate genome databases, 9,029 assembled sequences from C. sabaeus could be annotated. Sequences have been systematically aligned with ten cDNA references of primate species including Homo sapiens, Pan troglodytes, and Macaca mulatta to identify ortholog transcripts. For 506 transcripts, sequences were quasi-complete. In addition, 6,576 transcript fragments are potentially specific to the C. sabaeus or corresponding to not yet described primate genes.
Conclusions
The EST library we provide here will prove useful in gene annotation efforts for future sequencing of the African Green Monkey genomes. Furthermore, this library, which particularly well represents immunological and hematological gene expression, will be an important resource for the comparative analysis of gene expression in clinically relevant nonhuman primate and human research.
doi:10.1186/1471-2164-13-279
PMCID: PMC3539953  PMID: 22726727
7.  Complete Genome Sequence of the Clinical Streptococcus salivarius Strain CCHSS3 ▿  
Journal of Bacteriology  2011;193(18):5041-5042.
Streptococcus salivarius is a commensal species commonly found in the human oral cavity and digestive tract, although it is also associated with human infections such as meningitis, endocarditis, and bacteremia. Here, we report the complete sequence of S. salivarius strain CCHSS3, isolated from human blood.
doi:10.1128/JB.05416-11
PMCID: PMC3165645  PMID: 21742894
8.  Complete Genome Sequence of the Commensal Streptococcus salivarius Strain JIM8777 ▿  
Journal of Bacteriology  2011;193(18):5024-5025.
The commensal bacterium Streptococcus salivarius is a prevalent species of the human oropharyngeal tract with an important role in oral ecology. Here, we report the complete 2.2-Mb genome sequence and annotation of strain JIM8777, which was recently isolated from the oral cavity of a healthy, dentate infant.
doi:10.1128/JB.05390-11
PMCID: PMC3165664  PMID: 21742871
9.  Pichia sorbitophila, an Interspecies Yeast Hybrid, Reveals Early Steps of Genome Resolution After Polyploidization 
G3: Genes|Genomes|Genetics  2012;2(2):299-311.
Polyploidization is an important process in the evolution of eukaryotic genomes, but ensuing molecular mechanisms remain to be clarified. Autopolyploidization or whole-genome duplication events frequently are resolved in resulting lineages by the loss of single genes from most duplicated pairs, causing transient gene dosage imbalance and accelerating speciation through meiotic infertility. Allopolyploidization or formation of interspecies hybrids raises the problem of genetic incompatibility (Bateson-Dobzhansky-Muller effect) and may be resolved by the accumulation of mutational changes in resulting lineages. In this article, we show that an osmotolerant yeast species, Pichia sorbitophila, recently isolated in a concentrated sorbitol solution in industry, illustrates this last situation. Its genome is a mosaic of homologous and homeologous chromosomes, or parts thereof, that corresponds to a recently formed hybrid in the process of evolution. The respective parental contributions to this genome were characterized using existing variations in GC content. The genomic changes that occurred during the short period since hybrid formation were identified (e.g., loss of heterozygosity, unilateral loss of rDNA, reciprocal exchange) and distinguished from those undergone by the two parental genomes after separation from their common ancestor (i.e., NUMT (NUclear sequences of MiTochondrial origin) insertions, gene acquisitions, gene location movements, reciprocal translocation). We found that the physiological characteristics of this new yeast species are determined by specific but unequal contributions of its two parents, one of which could be identified as very closely related to an extant Pichia farinosa strain.
doi:10.1534/g3.111.000745
PMCID: PMC3284337  PMID: 22384408
osmotolerant yeast P. sorbitophila; allopolyploidy; hybridization; genome evolution; loss of heterozygosity
10.  Whole Genome Profiling provides a robust framework for physical mapping and sequencing in the highly complex and repetitive wheat genome 
BMC Genomics  2012;13:47.
Background
Sequencing projects using a clone-by-clone approach require the availability of a robust physical map. The SNaPshot technology, based on pair-wise comparisons of restriction fragments sizes, has been used recently to build the first physical map of a wheat chromosome and to complete the maize physical map. However, restriction fragments sizes shared randomly between two non-overlapping BACs often lead to chimerical contigs and mis-assembled BACs in such large and repetitive genomes. Whole Genome Profiling (WGP™) was developed recently as a new sequence-based physical mapping technology and has the potential to limit this problem.
Results
A subset of the wheat 3B chromosome BAC library covering 230 Mb was used to establish a WGP physical map and to compare it to a map obtained with the SNaPshot technology. We first adapted the WGP-based assembly methodology to cope with the complexity of the wheat genome. Then, the results showed that the WGP map covers the same length than the SNaPshot map but with 30% less contigs and, more importantly with 3.5 times less mis-assembled BACs. Finally, we evaluated the benefit of integrating WGP tags in different sequence assemblies obtained after Roche/454 sequencing of BAC pools. We showed that while WGP tag integration improves assemblies performed with unpaired reads and with paired-end reads at low coverage, it does not significantly improve sequence assemblies performed at high coverage (25x) with paired-end reads.
Conclusions
Our results demonstrate that, with a suitable assembly methodology, WGP builds more robust physical maps than the SNaPshot technology in wheat and that WGP can be adapted to any genome. Moreover, WGP tag integration in sequence assemblies improves low quality assembly. However, to achieve a high quality draft sequence assembly, a sequencing depth of 25x paired-end reads is required, at which point WGP tag integration does not provide additional scaffolding value. Finally, we suggest that WGP tags can support the efficient sequencing of BAC pools by enabling reliable assignment of sequence scaffolds to their BAC of origin, a feature that is of great interest when using BAC pooling strategies to reduce the cost of sequencing large genomes.
doi:10.1186/1471-2164-13-47
PMCID: PMC3311077  PMID: 22289472
11.  Feminizing Wolbachia: a transcriptomics approach with insights on the immune response genes in Armadillidium vulgare 
BMC Microbiology  2012;12(Suppl 1):S1.
Background
Wolbachia are vertically transmitted bacteria known to be the most widespread endosymbiont in arthropods. They induce various alterations of the reproduction of their host, including feminization of genetic males in isopod crustaceans. In the pill bug Armadillidium vulgare, the presence of Wolbachia is also associated with detrimental effects on host fertility and lifespan. Deleterious effects have been demonstrated on hemocyte density, phenoloxidase activity, and natural hemolymph septicemia, suggesting that infected individuals could have defective immune capacities. Since nothing is known about the molecular mechanisms involved in Wolbachia-A. vulgare interactions and its secondary immunocompetence modulation, we developed a transcriptomics strategy and compared A. vulgare gene expression between Wolbachia-infected animals (i.e., “symbiotic” animals) and uninfected ones (i.e., “asymbiotic” animals) as well as between animals challenged or not challenged by a pathogenic bacteria.
Results
Since very little genetic data is available on A. vulgare, we produced several EST libraries and generated a total of 28 606 ESTs. Analyses of these ESTs revealed that immune processes were over-represented in most experimental conditions (responses to a symbiont and to a pathogen). Considering canonical crustacean immune pathways, these genes encode antimicrobial peptides or are involved in pathogen recognition, detoxification, and autophagy. By RT-qPCR, we demonstrated a general trend towards gene under-expression in symbiotic whole animals and ovaries whereas the same gene set tends to be over-expressed in symbiotic immune tissues.
Conclusion
This study allowed us to generate the first reference transcriptome ever obtained in the Isopoda group and to identify genes involved in the major known crustacean immune pathways encompassing cellular and humoral responses. Expression of immune-related genes revealed a modulation of host immunity when females are infected by Wolbachia, including in ovaries, the crucial tissue for the Wolbachia route of transmission.
doi:10.1186/1471-2180-12-S1-S1
PMCID: PMC3287506  PMID: 22375708
12.  Host gene response to endosymbiont and pathogen in the cereal weevil Sitophilus oryzae 
BMC Microbiology  2012;12(Suppl 1):S14.
Background
Insects thriving on nutritionally poor habitats have integrated mutualistic intracellular symbiotic bacteria (endosymbionts) in a bacteria-bearing tissue (the bacteriome) that isolates the endosymbionts and protects them against a host systemic immune response. Whilst the metabolic and physiological features of long-term insect associations have been investigated in detail over the past decades, cellular and immune regulations that determine the host response to endosymbionts and pathogens have attracted interest more recently.
Results
To investigate bacteriome cellular specificities and weevil immune responses to bacteria, we have constructed and sequenced 7 cDNA libraries from Sitophilus oryzae whole larvae and bacteriomes. Bioinformatic analysis of 26,886 ESTs led to the generation of 8,941 weevil unigenes. Based on in silico analysis and on the examination of genes involved in the cellular pathways of potential interest to intracellular symbiosis (i.e. cell growth and apoptosis, autophagy, immunity), we have selected and analyzed 29 genes using qRT-PCR, taking into consideration bacteriome specificity and symbiosis impact on the host response to pathogens. We show that the bacteriome tissue accumulates transcripts from genes involved in cellular development and survival, such as the apoptotic inhibitors iap2 and iap3, and endosomal fusion and trafficking, such as Rab7, Hrs, and SNARE. As regards our investigation into immunity, we first strengthen the bacteriome immunomodulation previously reported in S. zeamais. We show that the sarcotoxin, the c-type lysozyme, and the wpgrp2 genes are downregulated in the S. oryzae bacteriome, when compared to aposymbiotic insects and insects challenged with E. coli. Secondly, transcript level comparison between symbiotic and aposymbiotic larvae provides evidence that the immune systemic response to pathogens is decreased in symbiotic insects, as shown by the relatively high expression of wpgrp2, wpgrp3, coleoptericin-B, diptericin, and sarcotoxin genes in aposymbiotic insects.
Conclusions
Library sequencing significantly increased the number of unigenes, allowing for improved functional and genetic investigations in the cereal weevil S. oryzae. Transcriptomic analyses support selective and local immune gene expression in the bacteriome tissue and uncover cellular pathways that are of potential interest to bacteriocyte survival and homeostasis. Bacterial challenge experiments have revealed that the systemic immune response would be less induced in a symbiotic insect, thus highlighting new perspectives on host immunity in long-term invertebrate co-evolutionary associations.
doi:10.1186/1471-2180-12-S1-S14
PMCID: PMC3287511  PMID: 22375912
13.  Influence of Wolbachia on host gene expression in an obligatory symbiosis 
BMC Microbiology  2012;12(Suppl 1):S7.
Background
Wolbachia are intracellular bacteria known to be facultative reproductive parasites of numerous arthropod hosts. Apart from these reproductive manipulations, recent findings indicate that Wolbachia may also modify the host’s physiology, notably its immune function. In the parasitoid wasp, Asobara tabida, Wolbachia is necessary for oogenesis completion, and aposymbiotic females are unable to produce viable offspring. The absence of egg production is also associated with an increase in programmed cell death in the ovaries of aposymbiotic females, suggesting that a mechanism that ensures the maintenance of Wolbachia in the wasp could also be responsible for this dependence. In order to decipher the general mechanisms underlying host-Wolbachia interactions and the origin of the dependence, we developed transcriptomic approaches to compare gene expression in symbiotic and aposymbiotic individuals.
Results
As no genetic data were available on A. tabida, we constructed several Expressed Sequence Tags (EST) libraries, and obtained 12,551 unigenes from this species. Gene expression was compared between symbiotic and aposymbiotic ovaries through in silico analysis and in vitro subtraction (SSH). As pleiotropic functions involved in immunity and development could play a major role in the establishment of dependence, the expression of genes involved in oogenesis, programmed cell death (PCD) and immunity (broad sense) was analyzed by quantitative RT-PCR. We showed that Wolbachia might interfere with these numerous biological processes, in particular some related to oxidative stress regulation. We also showed that Wolbachia may interact with immune gene expression to ensure its persistence within the host.
Conclusions
This study allowed us to constitute the first major dataset of the transcriptome of A. tabida, a species that is a model system for both host/Wolbachia and host/parasitoid interactions. More specifically, our results highlighted that symbiont infection may interfere with numerous pivotal processes at the individual level, suggesting that the impact of Wolbachia should also be investigated beyond reproductive manipulations.
doi:10.1186/1471-2180-12-S1-S7
PMCID: PMC3287518  PMID: 22376153
14.  A Holistic Approach to Marine Eco-Systems Biology 
PLoS Biology  2011;9(10):e1001177.
The structure, robustness, and dynamics of ocean plankton ecosystems remain poorly understood due to sampling, analysis, and computational limitations. The Tara Oceans consortium organizes expeditions to help fill this gap at the global level.
doi:10.1371/journal.pbio.1001177
PMCID: PMC3196472  PMID: 22028628
15.  Genomic Analysis of the Necrotrophic Fungal Pathogens Sclerotinia sclerotiorum and Botrytis cinerea 
Amselem, Joelle | Cuomo, Christina A. | van Kan, Jan A. L. | Viaud, Muriel | Benito, Ernesto P. | Couloux, Arnaud | Coutinho, Pedro M. | de Vries, Ronald P. | Dyer, Paul S. | Fillinger, Sabine | Fournier, Elisabeth | Gout, Lilian | Hahn, Matthias | Kohn, Linda | Lapalu, Nicolas | Plummer, Kim M. | Pradier, Jean-Marc | Quévillon, Emmanuel | Sharon, Amir | Simon, Adeline | ten Have, Arjen | Tudzynski, Bettina | Tudzynski, Paul | Wincker, Patrick | Andrew, Marion | Anthouard, Véronique | Beever, Ross E. | Beffa, Rolland | Benoit, Isabelle | Bouzid, Ourdia | Brault, Baptiste | Chen, Zehua | Choquer, Mathias | Collémare, Jérome | Cotton, Pascale | Danchin, Etienne G. | Da Silva, Corinne | Gautier, Angélique | Giraud, Corinne | Giraud, Tatiana | Gonzalez, Celedonio | Grossetete, Sandrine | Güldener, Ulrich | Henrissat, Bernard | Howlett, Barbara J. | Kodira, Chinnappa | Kretschmer, Matthias | Lappartient, Anne | Leroch, Michaela | Levis, Caroline | Mauceli, Evan | Neuvéglise, Cécile | Oeser, Birgitt | Pearson, Matthew | Poulain, Julie | Poussereau, Nathalie | Quesneville, Hadi | Rascle, Christine | Schumacher, Julia | Ségurens, Béatrice | Sexton, Adrienne | Silva, Evelyn | Sirven, Catherine | Soanes, Darren M. | Talbot, Nicholas J. | Templeton, Matt | Yandava, Chandri | Yarden, Oded | Zeng, Qiandong | Rollins, Jeffrey A. | Lebrun, Marc-Henri | Dickman, Marty | Richardson, Paul M.
PLoS Genetics  2011;7(8):e1002230.
Sclerotinia sclerotiorum and Botrytis cinerea are closely related necrotrophic plant pathogenic fungi notable for their wide host ranges and environmental persistence. These attributes have made these species models for understanding the complexity of necrotrophic, broad host-range pathogenicity. Despite their similarities, the two species differ in mating behaviour and the ability to produce asexual spores. We have sequenced the genomes of one strain of S. sclerotiorum and two strains of B. cinerea. The comparative analysis of these genomes relative to one another and to other sequenced fungal genomes is provided here. Their 38–39 Mb genomes include 11,860–14,270 predicted genes, which share 83% amino acid identity on average between the two species. We have mapped the S. sclerotiorum assembly to 16 chromosomes and found large-scale co-linearity with the B. cinerea genomes. Seven percent of the S. sclerotiorum genome comprises transposable elements compared to <1% of B. cinerea. The arsenal of genes associated with necrotrophic processes is similar between the species, including genes involved in plant cell wall degradation and oxalic acid production. Analysis of secondary metabolism gene clusters revealed an expansion in number and diversity of B. cinerea–specific secondary metabolites relative to S. sclerotiorum. The potential diversity in secondary metabolism might be involved in adaptation to specific ecological niches. Comparative genome analysis revealed the basis of differing sexual mating compatibility systems between S. sclerotiorum and B. cinerea. The organization of the mating-type loci differs, and their structures provide evidence for the evolution of heterothallism from homothallism. These data shed light on the evolutionary and mechanistic bases of the genetically complex traits of necrotrophic pathogenicity and sexual mating. This resource should facilitate the functional studies designed to better understand what makes these fungi such successful and persistent pathogens of agronomic crops.
Author Summary
Sclerotinia sclerotiorum and Botrytis cinerea are notorious plant pathogenic fungi with very wide host ranges. They cause vast economic damage during crop cultivation as well as in harvested produce. These fungi are typical examples of necrotrophs: they first kill host plant cells and then colonize the dead tissue. The genome sequences of the two fungi were determined in order to examine commonalities in structure and content and in order to find unique features that may distinguish them from other pathogenic fungi and from saprotrophic fungi. The genomes show high sequence identity and a similar arrangement of genes. S. sclerotiorum and B. cinerea differ in their regulation of sexual reproduction, and the genetic basis and its evolution could be explained from the genome sequence. The genome sequence revealed a striking difference in the number and diversity of secondary metabolism gene clusters, which may be involved in the adaptation to different ecological niches. Altogether, there were no unique features in the genomes of S. sclerotiorum and B. cinerea that could be identified as “silver bullets,” which distinguish these aggressive pathogens from other pathogenic and non-pathogenic fungi. These findings reinforce the quantitative, multigenic nature of necrotrophic pathogenesis.
doi:10.1371/journal.pgen.1002230
PMCID: PMC3158057  PMID: 21876677
16.  Analysis of BAC-end sequences in rainbow trout: Content characterization and assessment of synteny between trout and other fish genomes 
BMC Genomics  2011;12:314.
Background
Rainbow trout (Oncorhynchus mykiss) are cultivated worldwide for aquaculture production and are widely used as a model species to gain knowledge of many aspects of fish biology. The common ancestor of the salmonids experienced a whole genome duplication event, making extant salmonids such as the rainbow trout an excellent model for studying the evolution of tetraploidization and re-diploidization in vertebrates. However, the lack of a reference genome sequence hampers research progress for both academic and applied purposes. In order to enrich the genomic tools already available in this species and provide further insight on the complexity of its genome, we sequenced a large number of rainbow trout BAC-end sequences (BES) and characterized their contents.
Results
A total of 176,485 high quality BES, were generated, representing approximately 4% of the trout genome. BES analyses identified 6,848 simple sequence repeats (SSRs), of which 3,854 had high quality flanking sequences for PCR primers design. The first rainbow trout repeat elements database (INRA RT rep1.0) containing 735 putative repeat elements was developed, and identified almost 59.5% of the BES database in base-pairs as repetitive sequence. Approximately 55% of the BES reads (97,846) had more than 100 base pairs of contiguous non-repetitive sequences. The fractions of the 97,846 non-repetitive trout BES reads that had significant BLASTN hits against the zebrafish, medaka and stickleback genome databases were 15%, 16.2% and 17.9%, respectively, while the fractions of the non-repetitive BES reads that had significant BLASTX hits against the zebrafish, medaka, and stickleback protein databases were 10.7%, 9.5% and 9.5%, respectively. Comparative genomics using paired BAC-ends revealed several regions of conserved synteny across all the fish species analyzed in this study.
Conclusions
The characterization of BES provided insights on the rainbow trout genome. The discovery of specific repeat elements will facilitate analyses of sequence content (e.g. for SNPs discovery and for transcriptome characterization) and future genome sequence assemblies. The numerous microsatellites will facilitate integration of the linkage and physical maps and serve as valuable resource for fine mapping QTL and positional cloning of genes affecting aquaculture production traits. Furthermore, comparative genomics through BES can be used for identifying positional candidate genes from QTL mapping studies, aid in future assembly of a reference genome sequence and elucidating sequence content and complexity in the rainbow trout genome.
doi:10.1186/1471-2164-12-314
PMCID: PMC3125269  PMID: 21672188
17.  Genome sequence of the stramenopile Blastocystis, a human anaerobic parasite 
Genome Biology  2011;12(3):R29.
Background
Blastocystis is a highly prevalent anaerobic eukaryotic parasite of humans and animals that is associated with various gastrointestinal and extraintestinal disorders. Epidemiological studies have identified different subtypes but no one subtype has been definitively correlated with disease.
Results
Here we report the 18.8 Mb genome sequence of a Blastocystis subtype 7 isolate, which is the smallest stramenopile genome sequenced to date. The genome is highly compact and contains intriguing rearrangements. Comparisons with other available stramenopile genomes (plant pathogenic oomycete and diatom genomes) revealed effector proteins potentially involved in the adaptation to the intestinal environment, which were likely acquired via horizontal gene transfer. Moreover, Blastocystis living in anaerobic conditions harbors mitochondria-like organelles. An incomplete oxidative phosphorylation chain, a partial Krebs cycle, amino acid and fatty acid metabolisms and an iron-sulfur cluster assembly are all predicted to occur in these organelles. Predicted secretory proteins possess putative activities that may alter host physiology, such as proteases, protease-inhibitors, immunophilins and glycosyltransferases. This parasite also possesses the enzymatic machinery to tolerate oxidative bursts resulting from its own metabolism or induced by the host immune system.
Conclusions
This study provides insights into the genome architecture of this unusual stramenopile. It also proposes candidate genes with which to study the physiopathology of this parasite and thus may lead to further investigations into Blastocystis-host interactions.
doi:10.1186/gb-2011-12-3-r29
PMCID: PMC3129679  PMID: 21439036
18.  Ancestral Regulatory Circuits Governing Ectoderm Patterning Downstream of Nodal and BMP2/4 Revealed by Gene Regulatory Network Analysis in an Echinoderm 
PLoS Genetics  2010;6(12):e1001259.
Echinoderms, which are phylogenetically related to vertebrates and produce large numbers of transparent embryos that can be experimentally manipulated, offer many advantages for the analysis of the gene regulatory networks (GRN) regulating germ layer formation. During development of the sea urchin embryo, the ectoderm is the source of signals that pattern all three germ layers along the dorsal-ventral axis. How this signaling center controls patterning and morphogenesis of the embryo is not understood. Here, we report a large-scale analysis of the GRN deployed in response to the activity of this signaling center in the embryos of the Mediterranean sea urchin Paracentrotus lividus, in which studies with high spatial resolution are possible. By using a combination of in situ hybridization screening, overexpression of mRNA, recombinant ligand treatments, and morpholino-based loss-of-function studies, we identified a cohort of transcription factors and signaling molecules expressed in the ventral ectoderm, dorsal ectoderm, and interposed neurogenic (“ciliary band”) region in response to the known key signaling molecules Nodal and BMP2/4 and defined the epistatic relationships between the most important genes. The resultant GRN showed a number of striking features. First, Nodal was found to be essential for the expression of all ventral and dorsal marker genes, and BMP2/4 for all dorsal genes. Second, goosecoid was identified as a central player in a regulatory sub-circuit controlling mouth formation, while tbx2/3 emerged as a critical factor for differentiation of the dorsal ectoderm. Finally, and unexpectedly, a neurogenic ectoderm regulatory circuit characterized by expression of “ciliary band” genes was triggered in the absence of TGF beta signaling. We propose a novel model for ectoderm regionalization, in which neural ectoderm is the default fate in the absence of TGF beta signaling, and suggest that the stomodeal and neural subcircuits that we uncovered may represent ancient regulatory pathways controlling embryonic patterning.
Author Summary
Echinoderms (sea urchins, starfish, etc.) are marine invertebrates that share a close ancestry with vertebrates. Their embryos offer many advantages for the analysis of transcriptional circuits that control developmental programs. During early development of the common sea urchin Paracentrotus lividus, a signaling center located within the ventral ectoderm sends two key signals, Nodal and BMP2/4, that control patterning of the embryo along the whole dorsal-ventral axis. How this signaling center works is not understood. We have conducted a large-scale functional analysis of the genes responsible for patterning of the ectoderm along the dorsal-ventral axis. We identified direct targets of Nodal and BMP2/4 and identified several key regulators that mediate the effects of these factors and drive essential and probably ancient regulatory circuits that together constitute a transcriptional program controlling morphogenesis of the embryo. In addition, we uncovered a striking parallel between the mouse embryo and the sea urchin embryo by showing that in both models a neurogenic ectoderm is the default state of ectoderm differentiation in the absence of Nodal and BMP signaling. Our results support the idea that inhibition of Nodal and BMP signaling was probably an ancient mechanism to specify neural cells in the ancestor of vertebrates.
doi:10.1371/journal.pgen.1001259
PMCID: PMC3009687  PMID: 21203442
19.  Bioinformatic analysis of ESTs collected by Sanger and pyrosequencing methods for a keystone forest tree species: oak 
BMC Genomics  2010;11:650.
Background
The Fagaceae family comprises about 1,000 woody species worldwide. About half belong to the Quercus family. These oaks are often a source of raw material for biomass wood and fiber. Pedunculate and sessile oaks, are among the most important deciduous forest tree species in Europe. Despite their ecological and economical importance, very few genomic resources have yet been generated for these species. Here, we describe the development of an EST catalogue that will support ecosystem genomics studies, where geneticists, ecophysiologists, molecular biologists and ecologists join their efforts for understanding, monitoring and predicting functional genetic diversity.
Results
We generated 145,827 sequence reads from 20 cDNA libraries using the Sanger method. Unexploitable chromatograms and quality checking lead us to eliminate 19,941 sequences. Finally a total of 125,925 ESTs were retained from 111,361 cDNA clones. Pyrosequencing was also conducted for 14 libraries, generating 1,948,579 reads, from which 370,566 sequences (19.0%) were eliminated, resulting in 1,578,192 sequences. Following clustering and assembly using TGICL pipeline, 1,704,117 EST sequences collapsed into 69,154 tentative contigs and 153,517 singletons, providing 222,671 non-redundant sequences (including alternative transcripts). We also assembled the sequences using MIRA and PartiGene software and compared the three unigene sets. Gene ontology annotation was then assigned to 29,303 unigene elements. Blast search against the SWISS-PROT database revealed putative homologs for 32,810 (14.7%) unigene elements, but more extensive search with Pfam, Refseq_protein, Refseq_RNA and eight gene indices revealed homology for 67.4% of them. The EST catalogue was examined for putative homologs of candidate genes involved in bud phenology, cuticle formation, phenylpropanoids biosynthesis and cell wall formation. Our results suggest a good coverage of genes involved in these traits. Comparative orthologous sequences (COS) with other plant gene models were identified and allow to unravel the oak paleo-history. Simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were searched, resulting in 52,834 SSRs and 36,411 SNPs. All of these are available through the Oak Contig Browser http://genotoul-contigbrowser.toulouse.inra.fr:9092/Quercus_robur/index.html.
Conclusions
This genomic resource provides a unique tool to discover genes of interest, study the oak transcriptome, and develop new markers to investigate functional diversity in natural populations.
doi:10.1186/1471-2164-11-650
PMCID: PMC3017864  PMID: 21092232
20.  Insights into metazoan evolution from alvinella pompejana cDNAs 
BMC Genomics  2010;11:634.
Background
Alvinella pompejana is a representative of Annelids, a key phylum for evo-devo studies that is still poorly studied at the sequence level. A. pompejana inhabits deep-sea hydrothermal vents and is currently known as one of the most thermotolerant Eukaryotes in marine environments, withstanding the largest known chemical and thermal ranges (from 5 to 105°C). This tube-dwelling worm forms dense colonies on the surface of hydrothermal chimneys and can withstand long periods of hypo/anoxia and long phases of exposure to hydrogen sulphides. A. pompejana specifically inhabits chimney walls of hydrothermal vents on the East Pacific Rise. To survive, Alvinella has developed numerous adaptations at the physiological and molecular levels, such as an increase in the thermostability of proteins and protein complexes. It represents an outstanding model organism for studying adaptation to harsh physicochemical conditions and for isolating stable macromolecules resistant to high temperatures.
Results
We have constructed four full length enriched cDNA libraries to investigate the biology and evolution of this intriguing animal. Analysis of more than 75,000 high quality reads led to the identification of 15,858 transcripts and 9,221 putative protein sequences. Our annotation reveals a good coverage of most animal pathways and networks with a prevalence of transcripts involved in oxidative stress resistance, detoxification, anti-bacterial defence, and heat shock protection. Alvinella proteins seem to show a slow evolutionary rate and a higher similarity with proteins from Vertebrates compared to proteins from Arthropods or Nematodes. Their composition shows enrichment in positively charged amino acids that might contribute to their thermostability. The gene content of Alvinella reveals that an important pool of genes previously considered to be specific to Deuterostomes were in fact already present in the last common ancestor of the Bilaterian animals, but have been secondarily lost in model invertebrates. This pool is enriched in glycoproteins that play a key role in intercellular communication, hormonal regulation and immunity.
Conclusions
Our study starts to unravel the gene content and sequence evolution of a deep-sea annelid, revealing key features in eukaryote adaptation to extreme environmental conditions and highlighting the proximity of Annelids and Vertebrates.
doi:10.1186/1471-2164-11-634
PMCID: PMC3018142  PMID: 21080938
21.  The Complete Genome of Propionibacterium freudenreichii CIRM-BIA1T, a Hardy Actinobacterium with Food and Probiotic Applications 
PLoS ONE  2010;5(7):e11748.
Background
Propionibacterium freudenreichii is essential as a ripening culture in Swiss-type cheeses and is also considered for its probiotic use [1]. This species exhibits slow growth, low nutritional requirements, and hardiness in many habitats. It belongs to the taxonomic group of dairy propionibacteria, in contrast to the cutaneous species P. acnes. The genome of the type strain, P. freudenreichii subsp. shermanii CIRM-BIA1 (CIP 103027T), was sequenced with an 11-fold coverage.
Methodology/Principal Findings
The circular chromosome of 2.7 Mb of the CIRM-BIA1 strain has a GC-content of 67% and contains 22 different insertion sequences (3.5% of the genome in base pairs). Using a proteomic approach, 490 of the 2439 predicted proteins were confirmed. The annotation revealed the genetic basis for the hardiness of P. freudenreichii, as the bacterium possesses a complete enzymatic arsenal for de novo biosynthesis of aminoacids and vitamins (except panthotenate and biotin) as well as sequences involved in metabolism of various carbon sources, immunity against phages, duplicated chaperone genes and, interestingly, genes involved in the management of polyphosphate, glycogen and trehalose storage. The complete biosynthesis pathway for a bifidogenic compound is described, as well as a high number of surface proteins involved in interactions with the host and present in other probiotic bacteria. By comparative genomics, no pathogenicity factors found in P. acnes or in other pathogenic microbial species were identified in P. freudenreichii, which is consistent with the Generally Recognized As Safe and Qualified Presumption of Safety status of P. freudenreichii. Various pathways for formation of cheese flavor compounds were identified: the Wood-Werkman cycle for propionic acid formation, amino acid degradation pathways resulting in the formation of volatile branched chain fatty acids, and esterases involved in the formation of free fatty acids and esters.
Conclusions/Significance
With the exception of its ability to degrade lactose, P. freudenreichii seems poorly adapted to dairy niches. This genome annotation opens up new prospects for the understanding of the P. freudenreichii probiotic activity.
doi:10.1371/journal.pone.0011748
PMCID: PMC2909200  PMID: 20668525
22.  Construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes 
BMC Genomics  2010;11:407.
Background
The medicinal leech, Hirudo medicinalis, is an important model system for the study of nervous system structure, function, development, regeneration and repair. It is also a unique species in being presently approved for use in medical procedures, such as clearing of pooled blood following certain surgical procedures. It is a current, and potentially also future, source of medically useful molecular factors, such as anticoagulants and antibacterial peptides, which may have evolved as a result of its parasitizing large mammals, including humans. Despite the broad focus of research on this system, little has been done at the genomic or transcriptomic levels and there is a paucity of openly available sequence data. To begin to address this problem, we constructed whole embryo and adult central nervous system (CNS) EST libraries and created a clustered sequence database of the Hirudo transcriptome that is available to the scientific community.
Results
A total of ~133,000 EST clones from two directionally-cloned cDNA libraries, one constructed from mRNA derived from whole embryos at several developmental stages and the other from adult CNS cords, were sequenced in one or both directions by three different groups: Genoscope (French National Sequencing Center), the University of Iowa Sequencing Facility and the DOE Joint Genome Institute. These were assembled using the phrap software package into 31,232 unique contigs and singletons, with an average length of 827 nt. The assembled transcripts were then translated in all six frames and compared to proteins in NCBI's non-redundant (NR) and to the Gene Ontology (GO) protein sequence databases, resulting in 15,565 matches to 11,236 proteins in NR and 13,935 matches to 8,073 proteins in GO. Searching the database for transcripts of genes homologous to those thought to be involved in the innate immune responses of vertebrates and other invertebrates yielded a set of nearly one hundred evolutionarily conserved sequences, representing all known pathways involved in these important functions.
Conclusions
The sequences obtained for Hirudo transcripts represent the first major database of genes expressed in this important model system. Comparison of translated open reading frames (ORFs) with the other openly available leech datasets, the genome and transcriptome of Helobdella robusta, shows an average identity at the amino acid level of 58% in matched sequences. Interestingly, comparison with other available Lophotrochozoans shows similar high levels of amino acid identity, where sequences match, for example, 64% with Capitella capitata (a polychaete) and 56% with Aplysia californica (a mollusk), as well as 58% with Schistosoma mansoni (a platyhelminth). Phylogenetic comparisons of putative Hirudo innate immune response genes present within the Hirudo transcriptome database herein described show a strong resemblance to the corresponding mammalian genes, indicating that this important physiological response may have older origins than what has been previously proposed.
doi:10.1186/1471-2164-11-407
PMCID: PMC2996935  PMID: 20579359
23.  Detection and analysis of alternative splicing in Yarrowia lipolytica reveal structural constraints facilitating nonsense-mediated decay of intron-retaining transcripts 
Genome Biology  2010;11(6):R65.
Background
Hemiascomycetous yeasts have intron-poor genomes with very few cases of alternative splicing. Most of the reported examples result from intron retention in Saccharomyces cerevisiae and some have been shown to be functionally significant. Here we used transcriptome-wide approaches to evaluate the mechanisms underlying the generation of alternative transcripts in Yarrowia lipolytica, a yeast highly divergent from S. cerevisiae.
Results
Experimental investigation of Y. lipolytica gene models identified several cases of alternative splicing, mostly generated by intron retention, principally affecting the first intron of the gene. The retention of introns almost invariably creates a premature termination codon, as a direct consequence of the structure of intron boundaries. An analysis of Y. lipolytica introns revealed that introns of multiples of three nucleotides in length, particularly those without stop codons, were underrepresented. In other organisms, premature termination codon-containing transcripts are targeted for degradation by the nonsense-mediated mRNA decay (NMD) machinery. In Y. lipolytica, homologs of S. cerevisiae UPF1 and UPF2 genes were identified, but not UPF3. The inactivation of Y. lipolytica UPF1 and UPF2 resulted in the accumulation of unspliced transcripts of a test set of genes.
Conclusions
Y. lipolytica is the hemiascomycete with the most intron-rich genome sequenced to date, and it has several unusual genes with large introns or alternative transcription start sites, or introns in the 5' UTR. Our results suggest Y. lipolytica intron structure is subject to significant constraints, leading to the under-representation of stop-free introns. Consequently, intron-containing transcripts are degraded by a functional NMD pathway.
doi:10.1186/gb-2010-11-6-r65
PMCID: PMC2911113  PMID: 20573210
24.  Analysis of Virion Structural Components Reveals Vestiges of the Ancestral Ichnovirus Genome 
PLoS Pathogens  2010;6(5):e1000923.
Many thousands of endoparasitic wasp species are known to inject polydnavirus (PDV) particles into their caterpillar host during oviposition, causing immune and developmental dysfunctions that benefit the wasp larva. PDVs associated with braconid and ichneumonid wasps, bracoviruses and ichnoviruses respectively, both deliver multiple circular dsDNA molecules to the caterpillar. These molecules contain virulence genes but lack core genes typically involved in particle production. This is not completely unexpected given that no PDV replication takes place in the caterpillar. Particle production is confined to the wasp ovary where viral DNAs are generated from proviral copies maintained within the wasp genome. We recently showed that the genes involved in bracovirus particle production reside within the wasp genome and are related to nudiviruses. In the present work we characterized genes involved in ichnovirus particle production by analyzing the components of purified Hyposoter didymator Ichnovirus particles by LC-MS/MS and studying their organization in the wasp genome. Their products are conserved among ichnovirus-associated wasps and constitute a specific set of proteins in the virosphere. Strikingly, these genes are clustered in specialized regions of the wasp genome which are amplified along with proviral DNA during virus particle replication, but are not packaged in the particles. Clearly our results show that ichnoviruses and bracoviruses particles originated from different viral entities, thus providing an example of convergent evolution where two groups of wasps have independently domesticated viruses to deliver genes into their hosts.
Author Summary
The polydnaviruses (PDVs) are a unique virus type used by an organism (a parasitic wasp) to manipulate the physiology of another organism (a lepidopteran host) in order to ensure successful parasitism. The evolutionary origin of these unusual viruses, found in ∼17,500 braconid wasps (Bracoviruses) and ∼15,000 ichneumonid wasps (Ichnoviruses), has been a major question for the last decade. We thus undertook an exclusive work aiming at investigating this origin via the characterization of genes encoding structural components for both types of PDVs. The present paper constitutes the first report on the identity and genome organisation of the viral machinery producing Ichnovirus virions. Our results strongly suggest that Ichnoviruses originated from a virus belonging to a group as yet uncharacterized that integrated its genome into that of an ichneumonid wasp ancestor. More importantly, our results demonstrate that the ancestor of Ichnoviruses differs from that of Bracoviruses, which originated from a nudivirus. We have now identified, for the two types of PDVs, the non packaged viral genes and their products involved in producing particles injected into the host during oviposition. Together, these data provide an example of convergent evolution where different groups of wasps have independently domesticated viruses to deliver genes into their hosts.
doi:10.1371/journal.ppat.1000923
PMCID: PMC2877734  PMID: 20523890
25.  Identification of transcriptional signals in Encephalitozoon cuniculi widespread among Microsporidia phylum: support for accurate structural genome annotation 
BMC Genomics  2009;10:607.
Background
Microsporidia are obligate intracellular eukaryotic parasites with genomes ranging in size from 2.3 Mbp to more than 20 Mbp. The extremely small (2.9 Mbp) and highly compact (~1 gene/kb) genome of the human parasite Encephalitozoon cuniculi has been fully sequenced. The aim of this study was to characterize noncoding motifs that could be involved in regulation of gene expression in E. cuniculi and to show whether these motifs are conserved among the phylum Microsporidia.
Results
To identify such signals, 5' and 3'RACE-PCR experiments were performed on different E. cuniculi mRNAs. This analysis confirmed that transcription overrun occurs in E. cuniculi and may result from stochastic recognition of the AAUAAA polyadenylation signal. Such experiments also showed highly reduced 5'UTR's (<7 nts). Most of the E. cuniculi genes presented a CCC-like motif immediately upstream from the coding start. To characterize other signals involved in differential transcriptional regulation, we then focused our attention on the gene family coding for ribosomal proteins. An AAATTT-like signal was identified upstream from the CCC-like motif. In rare cases the cytosine triplet was shown to be substituted by a GGG-like motif. Comparative genomic studies confirmed that these different signals are also located upstream from genes encoding ribosomal proteins in other microsporidian species including Antonospora locustae, Enterocytozoon bieneusi, Anncaliia algerae (syn. Brachiola algerae) and Nosema ceranae. Based on these results a systematic analysis of the ~2000 E. cuniculi coding DNA sequences was then performed and brings to highlight that 364 translation initiation codons (18.29% of total CDSs) had been badly predicted.
Conclusion
We identified various signals involved in the maturation of E. cuniculi mRNAs. Presence of such signals, in phylogenetically distant microsporidian species, suggests that a common regulatory mechanism exists among the microsporidia. Furthermore, 5'UTRs being strongly reduced, these signals can be used to ensure the accurate prediction of translation initiation codons for microsporidian genes and to improve microsporidian genome annotation.
doi:10.1186/1471-2164-10-607
PMCID: PMC2803860  PMID: 20003517

Results 1-25 (55)