Genome research on the Pacific oyster,
Crassostrea gigas, has been facilitated by the recent development of species-specific tools such as linkage maps [
1,
2], large-insert libraries [
3], a public clearing-house [
4], and gene expression profiles [
5-
7]. Several factors motivate further development of genomic resources for
C. gigas: (I) Because this species has the highest annual production of any aquatic organism,
C. gigas has been the subject of a great deal of research to elucidate the molecular basis underlying the physiological and genetic mechanisms of economically-relevant traits. (II) The Pacific oyster's phylogenic position in the Lophotrochozoa, an understudied clade of bilaterian animals, makes molecular data on
C. gigas highly relevant for studies of genome evolution. (III) Oysters play an important role as sentinels in estuarine and coastal marine habitats where increasing human activities exacerbate the impacts of disease and stress in exploited populations. (IV)
C. gigas can be an invasive species when introduced into new habitats [
8]. As a result, the Pacific oyster is becoming an attractive model species for genome-related research activities focusing on comparative immunology [
e.g. [
9-
11]], disease ecology [
e.g. [
12-
14]], stress response to pollutants and parasites [
e.g. [
15]], developmental and reproductive physiology [
e.g. [
16,
17]] and evolutionary genetics [
e.g. [
18-
20]].
The genomic strategies currently employed for the identification of novel and previously-characterized genes affecting phenotypes of interest in the Pacific oyster include the identification of quantitative trait loci (QTL), and high-throughput studies of gene expression [
21]. QTL mapping of genetic variation affecting, for example, resistance to summer mortality [
22] or hybrid vigor [
6] requires a large number of mapped molecular markers and testing for associations between marker genotypes and phenotypes to identify chromosomal regions harbouring genes that directly affect the trait of interest. Recently developed BAC libraries and fingerprinting [
3] (P. Gaffney, Pers. Com.), have facilitated fine mapping of such regions, and ultimately specification of marker position on the genetic linkage map, allowing gene-assisted selection. Functional genomic approaches are also required for gene-expression profiling experiments such as macroarrays [
17], microarrays [
7], SAGE (Serial Analysis of Gene Expression), MPSS (Massively Parallel Signature Sequencing) [
6], or technologies addressing single genes, such RT-qPCR (real-time quantitative PCR). These techniques have potential applications in ecological monitoring [
23], evaluating oyster broodstock for selective breeding and understanding of gene regulation involved, for example, in the molecular pathways associated with responses to stress or pathogens.
In the present paper, we report the generation and analysis of 47,889 ESTs by sequencing clones from the Network of Excellence "Marine Genomics Europe" (MGE) normalized gonad cDNA library (partially published in [
24]), and two other projects: I) the Genoscope project (CEA Evry, France) and II) the European Aquafirst project (Table ). The objective of the Genoscope project (EST sequencing from
Crassostrea gigas) was to substantially expand genomic information on oysters by sequencing ESTs from: (I) a normalized "hemocyte" cDNA library constructed with mRNA from bacteria-challenged and unchallenged hemocytes, and (II) ESTs from an "all developmental stages and Central Nervous System (CNS)" normalized cDNA library derived from mRNA extracted from all embryonic and larval stages, as well as from adult visceral ganglia. The European "Aquafirst" project that uses genetic and functional genomic approaches to develop summer mortality resistance markers in oysters, produced ESTs by suppression subtractive hybridization between Resistant and Sensitive oyster lines in six different tissues [
25]. To maximize the utility of these collections, ESTs from all of these efforts, together with those in public databases (
e.g. [
26]), have been assembled in a unique public database: the GigasDatabase
http://public-contigbrowser.sigenae.org:9090/Crassostrea_gigas/index.html containing 29,745 unique sequences.
| Table 1Summary statistics of the Pacific oyster cDNA libraries. |
This resource is highly valuable for identifying important gene networks controlling physiological processes, it facilitates the development of molecular markers for the construction of a reference genetic map, and it allows large-scale, expression-profiling experiments using microarrays. These tools will be useful to advance our knowledge of the genetic and physiological bases of development, reproduction, immunology, and associated processes that are important for oyster aquaculture. Finally, this work will be very useful for the annotation phase of the entire oyster genome, the principal objective of an international community of oyster biologists [
27] that will provide a critical point of comparison for understanding the early diversification of animals and their genome, as has been recently proposed for the gastropod snail
Lottia gigantea http://genome.jgi-psf.org/Lotgi1/Lotgi1.home.html.