Vertebrates have experienced two rounds of whole-genome duplication (WGD) in the stem lineages of deep nodes within the group and a subsequent duplication event in the stem lineage of the teleosts—a highly diverse group of ray-finned fishes. Here, we present the first full Hox gene sequences for any member of the Acipenseriformes, the American paddlefish, and confirm that an independent WGD occurred in the paddlefish lineage, approximately 42 Ma based on sequences spanning the entire HoxA cluster and eight genes on the HoxD gene cluster. These clusters comprise different HOX loci and maintain conserved synteny relative to bichir, zebrafish, stickleback, and pufferfish, as well as human, mouse, and chick. We also provide a gene genealogy for the duplicated fzd8 gene in paddlefish and present evidence for the first Hox14 gene in any ray-finned fish. Taken together, these data demonstrate that the American paddlefish has an independently duplicated genome. Substitution patterns of the “alpha” paralogs on both the HoxA and HoxD gene clusters suggest transcriptional inactivation consistent with functional diploidization. Further, there are similarities in the pattern of sequence divergence among duplicated Hox genes in paddlefish and teleost lineages, even though they occurred independently approximately 200 Myr apart. We highlight implications on comparative analyses in the study of the “fin-limb transition” as well as gene and genome duplication in bony fishes, which includes all ray-finned fishes as well as the lobe-finned fishes and tetrapod vertebrates.
Polyodon spathula; whole-genome duplication; WGD; rate asymmetry; paralog retention; fin-limb transition
Comparative genomic studies suggest that the modern day assemblage of ray-finned fishes have descended from an ancestral grouping of fishes that possessed 12–13 linkage groups. All jawed vertebrates are postulated to have experienced two whole genome duplications (WGD) in their ancestry (2R duplication). Salmonids have experienced one additional WGD (4R duplication event) compared to most extant teleosts which underwent a further 3R WGD compared to other vertebrates. We describe the organization of the 4R chromosomal segments of the proto-ray-finned fish karyotype in Atlantic salmon and rainbow trout based upon their comparative syntenies with two model species of 3R ray-finned fishes.
Evidence is presented for the retention of large whole-arm affinities between the ancestral linkage groups of the ray-finned fishes, and the 50 homeologous chromosomal segments in Atlantic salmon and rainbow trout. In the comparisons between the two salmonid species, there is also evidence for the retention of large whole-arm homeologous affinities that are associated with the retention of duplicated markers. Five of the 7 pairs of chromosomal arm regions expressing the highest level of duplicate gene expression in rainbow trout share homologous synteny to the 5 pairs of homeologs with the greatest duplicate gene expression in Atlantic salmon. These regions are derived from proto-Actinopterygian linkage groups B, C, E, J and K.
Two chromosome arms in Danio rerio and Oryzias latipes (descendants of the 3R duplication) can, in most instances be related to at least 4 whole or partial chromosomal arms in the salmonid species. Multiple arm assignments in the two salmonid species do not clearly support a 13 proto-linkage group model, and suggest that a 12 proto-linkage group arrangement (i.e., a separate single chromosome duplication and ancestral fusion/fissions/recombination within the putative G/H/I groupings) may have occurred in the more basal soft-rayed fishes. We also found evidence supporting the model that ancestral linkage group M underwent a single chromosome duplication following the 3R duplication. In the salmonids, the M ancestral linkage groups are localized to 5 whole arm, and 3 partial arm regions (i.e., 6 whole arm regions expected). Thus, 3 distinct ancestral linkage groups are postulated to have existed in the G/H and M lineage chromosomes in the ancestor of the salmonids.
Whole-genome duplications (WGDs) have occurred repeatedly in the vertebrate lineage, but their evolutionary significance for phenotypic evolution remains elusive. Here, we have investigated the impact of the fish-specific genome duplication (FSGD) on the evolution of pigmentation pathways in teleost fishes. Pigmentation and color patterning are among the most diverse traits in teleosts, and their pigmentary system is the most complex of all vertebrate groups.
Using a comparative genomic approach including phylogenetic and synteny analyses, the evolution of 128 vertebrate pigmentation genes in five teleost genomes following the FSGD has been reconstructed. We show that pigmentation genes have been preferentially retained in duplicate after the FSGD, so that teleosts have 30% more pigmentation genes compared with tetrapods. This is significantly higher than genome-wide estimates of FSGD gene duplicate retention in teleosts. Large parts of the melanocyte regulatory network have been retained in two copies after the FSGD. Duplicated pigmentation genes follow general evolutionary patterns such as the preservation of protein complex stoichiometries and the overrepresentation of developmental genes among retained duplicates. These results suggest that the FSGD has made an important contribution to the evolution of teleost-specific features of pigmentation, which include novel pigment cell types or the division of existing pigment cell types into distinct subtypes. Furthermore, we have observed species-specific differences in duplicate retention and evolution that might contribute to pigmentary diversity among teleosts.
Our study therefore strongly supports the hypothesis that WGDs have promoted the increase of complexity and diversity during vertebrate phenotypic evolution.
genome duplication; fish; conserved synteny; pigment cell; melanocyte; functional module
Based on the observation of an increased number of paralogous genes in teleost fishes compared with other vertebrates and on the conserved synteny between duplicated copies, it has been shown that a whole genome duplication (WGD) occurred during the evolution of Actinopterygian fish. Comparative phylogenetic dating of this duplication event suggests that it occurred early on, specifically in teleosts. It has been proposed that this event might have facilitated the evolutionary radiation and the phenotypic diversification of the teleost fish, notably by allowing the sub- or neo-functionalization of many duplicated genes.
In this paper, we studied in a wide range of Actinopterygians the duplication and fate of the androgen receptor (AR, NR3C4), a nuclear receptor known to play a key role in sex-determination in vertebrates. The pattern of AR gene duplication is consistent with an early WGD event: it has been duplicated into two genes AR-A and AR-B after the split of the Acipenseriformes from the lineage leading to teleost fish but before the divergence of Osteoglossiformes. Genomic and syntenic analyses in addition to lack of PCR amplification show that one of the duplicated copies, AR-B, was lost in several basal Clupeocephala such as Cypriniformes (including the model species zebrafish), Siluriformes, Characiformes and Salmoniformes. Interestingly, we also found that, in basal teleost fish (Osteoglossiformes and Anguilliformes), the two copies remain very similar, whereas, specifically in Percomorphs, one of the copies, AR-B, has accumulated substitutions in both the ligand binding domain (LBD) and the DNA binding domain (DBD).
The comparison of the mutations present in these divergent AR-B with those known in human to be implicated in complete, partial or mild androgen insensitivity syndrome suggests that the existence of two distinct AR duplicates may be correlated to specific functional differences that may be connected to the well-known plasticity of sex determination in fish. This suggests that three specific events have shaped the present diversity of ARs in Actinopterygians: (i) early WGD, (ii) parallel loss of one duplicate in several lineages and (iii) putative neofunctionalization of the same duplicate in percomorphs, which occurred a long time after the WGD.
Force, Lynch and Conery proposed the duplication-degeneration-complementation (DDC) model in which partitioning of ancestral functions (subfunctionalization) and acquisition of novel functions (neofunctionalization) were the two primary mechanisms for the retention of duplicated genes. The DDC model was tested by analyzing the transcriptional induction of the duplicated fatty acid-binding protein (fabp) genes by clofibrate in zebrafish. Clofibrate is a specific ligand of the peroxisome proliferator-activated receptor (PPAR); it activates PPAR which then binds to a peroxisome proliferator response element (PPRE) to induce the transcriptional initiation of genes primarily involved in lipid homeostasis. Zebrafish was chosen as our model organism as it has many duplicated genes owing to a whole genome duplication (WGD) event that occurred ~230-400 million years ago in the teleost fish lineage. We assayed the steady-state levels of fabp mRNA and heterogeneous nuclear RNA (hnRNA) transcripts in liver, intestine, muscle, brain and heart for four sets of duplicated fabp genes, fabp1a/fabp1b.1/fabp1b.2, fabp7a/fabp7b, fabp10a/fabp10b and fabp11a/fabp11b in zebrafish fed different concentrations of clofibrate.
Electron microscopy showed an increase in the number of peroxisomes and mitochondria in liver and heart, respectively, in zebrafish fed clofibrate. Clofibrate also increased the steady-state level of acox1 mRNA and hnRNA transcripts in different tissues, a gene with a functional PPRE. These results demonstrate that zebrafish is responsive to clofibrate, unlike some other fishes. The levels of fabp mRNA and hnRNA transcripts for the four sets of duplicated fabp genes was determined by reverse transcription, quantitative polymerase chain reaction (RT-qPCR). The level of hnRNA coded by a gene is an indirect estimate of the rate of transcriptional initiation of that gene. Clofibrate increased the steady-state level of fabp mRNAs and hnRNAs for both the duplicated copies of fabp1a/fabp1b.1, and fabp7a/fabp7b, but in different tissues. Clofibrate also increased the steady-state level of fabp10a and fabp11a mRNAs and hnRNAs in liver, but not for fabp10b and fabp11b.
Some duplicated fabp genes have, most likely, retained PPREs, but induction by clofibrate is over-ridden by an, as yet, unknown tissue-specific mechanism(s). Regardless of the tissue-specific mechanism(s), transcriptional control of duplicated zebrafish fabp genes by clofibrate has markedly diverged since the WGD event.
The grasses, Poaceae, are one of the largest and most successful angiosperm families. Like many radiations of flowering plants, the divergence of the major grass lineages was preceded by a whole-genome duplication (WGD), although these events are not rare for flowering plants. By combining identification of syntenic gene blocks with measures of gene pair divergence and different frequencies of ancient gene loss, we have separated the two subgenomes present in modern grasses. Reciprocal loss of duplicated genes or genomic regions has been hypothesized to reproductively isolate populations and, thus, speciation. However, in contrast to previous studies in yeast and teleost fishes, we found very little evidence of reciprocal loss of homeologous genes between the grasses, suggesting that post-WGD gene loss may not be the cause of the grass radiation. The sets of homeologous and orthologous genes and predicted locations of deleted genes identified in this study, as well as links to the CoGe comparative genomics web platform for analyzing pan-grass syntenic regions, are provided along with this paper as a resource for the grass genetics community.
polyploidy; gene loss; synteny; Poaceae; speciation
Large-scale—even genome-wide—duplications have repeatedly been invoked as an explanation for major radiations. Teleosts, the most species-rich vertebrate clade, underwent a “fish-specific genome duplication” (FSGD) that is shared by most ray-finned fish lineages. We investigate here the Hox complement of the goldeye (Hiodon alosoides), a representative of Osteoglossomorpha, the most basal teleostean clade. An extensive PCR survey reveals that goldeye has at least eight Hox clusters, indicating a duplicated genome compared to basal actinopterygians. The possession of duplicated Hox clusters is uncoupled to species richness. The Hox system of the goldeye is substantially different from that of other teleost lineages, having retained several duplicates of Hox genes for which crown teleosts have lost at least one copy. A detailed analysis of the PCR fragments as well as full length sequences of two HoxA13 paralogs, and HoxA10 and HoxC4 genes places the duplication event close in time to the divergence of Osteoglossomorpha and crown teleosts. The data are consistent with—but do not conclusively prove—that Osteoglossomorpha shares the FSGD.
Electronic supplementary material
The online version of this article (doi:10.1007/s12064-009-0056-1) contains supplementary material, which is available to authorized users.
Hox clusters; Fish-specific genome duplication; Goldeye Hiodon alosoides
Although the vertebrate skeleton arose in the sea 500 million years ago, our understanding of the molecular fingerprints of chondrocytes and osteoblasts may be biased because it is informed mainly by research on land animals. In fact, the molecular fingerprint of teleost osteoblasts differs in key ways from that of tetrapods, but we do not know the origin of these novel gene functions. They either arose as neofunctionalization events after the teleost genome duplication (TGD), or they represent preserved ancestral functions that pre-date the TGD. Here, we provide evolutionary perspective to the molecular fingerprints of skeletal cells and assess the role of genome duplication in generating novel gene functions. We compared the molecular fingerprints of skeletogenic cells in two ray-finned fish: zebrafish (Danio rerio)--a teleost--and the spotted gar (Lepisosteus oculatus)--a "living fossil" representative of a lineage that diverged from the teleost lineage prior to the TGD (i.e., the teleost sister group). We analyzed developing embryos for expression of the structural collagen genes col1a2, col2a1, col10a1, and col11a2 in well-formed cartilage and bone, and studied expression of skeletal regulators, including the transcription factor genes sox9 and runx2, during mesenchymal condensation.
Results provided no evidence for the evolution of novel functions among gene duplicates in zebrafish compared to the gar outgroup, but our findings shed light on the evolution of the osteoblast. Zebrafish and gar chondrocytes both expressed col10a1 as they matured, but both species' osteoblasts also expressed col10a1, which tetrapod osteoblasts do not express. This novel finding, along with sox9 and col2a1 expression in developing osteoblasts of both zebrafish and gar, demonstrates that osteoblasts of both a teleost and a basally diverging ray-fin fish express components of the supposed chondrocyte molecular fingerprint.
Our surprising finding that the "chondrogenic" transcription factor sox9 is expressed in developing osteoblasts of both zebrafish and gar can help explain the expression of chondrocyte genes in osteoblasts of ray-finned fish. More broadly, our data suggest that the molecular fingerprint of the osteoblast, which largely is constrained among land animals, was not fixed during early vertebrate evolution.
Recent genomic studies have revealed a teleost-specific third-round whole genome duplication (3R-WGD) event occurred in a common ancestor of teleost fishes. However, it is unclear how the genes duplicated in this event were lost or persisted during the diversification of teleosts, and therefore, how many of the duplicated genes contribute to the genetic differences among teleosts. This subject is also important for understanding the process of vertebrate evolution through WGD events. We applied a comparative evolutionary approach to this question by focusing on the genes involved in long-term potentiation, taste and olfactory transduction, and the tricarboxylic acid cycle, based on the whole genome sequences of four teleosts; zebrafish, medaka, stickleback, and green spotted puffer fish.
We applied a state-of-the-art method of maximum-likelihood phylogenetic inference and conserved synteny analyses to each of 130 genes involved in the above biological systems of human. These analyses identified 116 orthologous gene groups between teleosts and tetrapods, and 45 pairs of 3R-WGD-derived duplicate genes among them. This suggests that more than half [(45×2)/(116+45)] = 56.5%) of the loci, probably more than ten thousand genes, present in a common ancestor of the four teleosts were still duplicated after the 3R-WGD. The estimated temporal pattern of gene loss suggested that, after the 3R-WGD, many (71/116) of the duplicated genes were rapidly lost during the initial 75 million years (MY), whereas on average more than half (27.3/45) of the duplicated genes remaining in the ancestor of the four teleosts (45/116) have persisted for about 275 MY. The 3R-WGD-derived duplicates that have persisted for a long evolutionary periods of time had significantly larger number of interacting partners and longer length of protein coding sequence, implying that they tend to be more multifunctional than the singletons after the 3R-WGD.
We have shown firstly the temporal pattern of gene loss process after 3R-WGD on the basis of teleost phylogeny and divergence time frameworks. The 3R-WGD-derived duplicates have not undergone constant exponential decay, suggesting that selection favoured the long-term persistence of a subset of duplicates that tend to be multi-functional. On the basis of these results obtained from the analysis of 116 orthologous gene groups, we propose that more than ten thousand of 3R-WGD-derived duplicates have experienced lineage-specific evolution, that is, the differential sub-/neo-functionalization or secondary loss between lineages, and contributed to teleost diversity.
One of the main explanations for the stunning diversity of teleost fishes (~29,000 species, nearly half of all vertebrates) is that a fish-specific whole-genome duplication event (FSGD) in the ancestor to teleosts triggered their subsequent radiation. However, one critical assumption of this hypothesis, that diversification rates in teleosts increased soon after the acquisition of a duplicated genome, has never been tested.
Here we show that one of three major diversification rate shifts within ray-finned fishes occurred at the base of the teleost radiation, as predicted by the FSGD hypothesis. We also find evidence for two rate increases that are much younger than the inferred age of the FSGD: one in the common ancestor of most ostariophysan fishes, and a second one in the common ancestor of percomorphs. The biodiversity contained within these two clades accounts for more than 88% of living fish species.
Teleosts diversified explosively in their early history and this burst of diversification may have been caused by genome duplication. However, the FSGD itself may be responsible for a little over 10% of living teleost biodiversity. ~88% of species diversity is derived from two relatively recent radiations of freshwater and marine fishes where genome duplication is not suspected. Genome duplications are a common event on the tree of life and have been implicated in the diversification of major clades like flowering plants, vertebrates, and gnathostomes. However our results suggest that the causes of diversification in large clades are likely to be complex and not easily ascribed to a single event, even a dramatic one such as a whole genome duplication.
The genomic basis of teleost phenotypic complexity remains obscure, despite increasing availability of genome and transcriptome sequence data. Fish-specific genome duplication cannot provide sufficient explanation for the morphological complexity of teleosts, considering the relatively large number of extinct basal ray-finned fishes.
In this study, we performed comparative genomic analysis to discover the Conserved Teleost-Specific Genes (CTSGs) and orphan genes within zebrafish and found that these two sets of lineage-specific genes may have played important roles during zebrafish embryogenesis. Lineage-specific genes within zebrafish share many of the characteristics of their counterparts in other species: shorter length, fewer exon numbers, higher GC content, and fewer of them have transcript support. Chromosomal location analysis indicated that neither the CTSGs nor the orphan genes were distributed evenly in the chromosomes of zebrafish. The significant enrichment of immunity proteins in CTSGs annotated by gene ontology (GO) or predicted ab initio may imply that defense against pathogens may be an important reason for the diversification of teleosts. The evolutionary origin of the lineage-specific genes was determined and a very high percentage of lineage-specific genes were generated via gene duplications. The temporal and spatial expression profile of lineage-specific genes obtained by expressed sequence tags (EST) and RNA-seq data revealed two novel properties: in addition to being highly tissue-preferred expression, lineage-specific genes are also highly temporally restricted, namely they are expressed in narrower time windows than evolutionarily conserved genes and are specifically enriched in later-stage embryos and early larval stages.
Our study provides the first systematic identification of two different sets of lineage-specific genes within zebrafish and provides valuable information leading towards a better understanding of the molecular mechanisms of the genomic basis of teleost phenotypic complexity for future studies.
Teleost; Lineage-specific gene; Transcriptome; Zebrafish embryogenesis
We explore how whole-genome duplications (WGDs) may have given rise to complex innovations in cellular networks, innovations that could not have evolved through sequential single-gene duplications. We focus on two classical WGD events, one in bakers' yeast and the other at the base of vertebrates (i.e., two rounds of whole-genome duplication: 2R-WGD). Two complex adaptations are discussed in detail: aerobic ethanol fermentation in yeast and the rewiring of the vertebrate developmental regulatory network through the 2R-WGD. These two examples, derived from diverged branches on the eukaryotic tree, boldly underline the evolutionary potential of WGD in facilitating major evolutionary transitions. We close by arguing that the evolutionary importance of WGD may require updating certain aspects of modern evolutionary theory, perhaps helping to synthesize a new evolutionary systems biology.
Whole genome duplication (WGD) is a special case of gene duplication, observed rarely in animals, whereby all genes duplicate simultaneously through polyploidisation. Two rounds of WGD (2R-WGD) occurred at the base of vertebrates, giving rise to an enormous wave of genetic novelty, but a systematic analysis of functional consequences of this event has not yet been performed.
We show that 2R-WGD affected an overwhelming majority (74%) of signalling genes, in particular developmental pathways involving receptor tyrosine kinases, Wnt and transforming growth factor-β ligands, G protein-coupled receptors and the apoptosis pathway. 2R-retained genes, in contrast to tandem duplicates, were enriched in protein interaction domains and multifunctional signalling modules of Ras and mitogen-activated protein kinase cascades. 2R-WGD had a fundamental impact on the cell-cycle machinery, redefined molecular building blocks of the neuronal synapse, and was formative for vertebrate brains. We investigated 2R-associated nodes in the context of the human signalling network, as well as in an inferred ancestral pre-2R (AP2R) network, and found that hubs (particularly involving negative regulation) were preferentially retained, with high connectivity driving retention. Finally, microarrays and proteomics demonstrated a trend for gradual paralog expression divergence independent of the duplication mechanism, but inferred ancestral expression states suggested preferential subfunctionalisation among 2R-ohnologs (2ROs).
The 2R event left an indelible imprint on vertebrate signalling and the cell cycle. We show that 2R-WGD preferentially retained genes are associated with higher organismal complexity (for example, locomotion, nervous system, morphogenesis), while genes associated with basic cellular functions (for example, translation, replication, splicing, recombination; with the notable exception of cell cycle) tended to be excluded. 2R-WGD set the stage for the emergence of key vertebrate functional novelties (such as complex brains, circulatory system, heart, bone, cartilage, musculature and adipose tissue). A full explanation of the impact of 2R on evolution, function and the flow of information in vertebrate signalling networks is likely to have practical consequences for regenerative medicine, stem cell therapies and cancer treatment.
Corticosteroid receptors include mineralocorticoid (MR) and glucocorticoid (GR) receptors. Teleost fishes have a single MR and duplicate GRs that show variable sensitivities to mineralocorticoids and glucocorticoids. How these receptors compare functionally to tetrapod MR and GR, and the evolutionary significance of maintaining two GRs, remains unclear.
We used up to seven steroids (including aldosterone, cortisol and 11-deoxycorticosterone [DOC]) to compare the ligand specificity of the ligand binding domains of corticosteroid receptors between a mammal (Mus musculus) and the midshipman fish (Porichthys notatus), a teleost model for steroid regulation of neural and behavioral plasticity. Variation in mineralocorticoid sensitivity was considered in a broader phylogenetic context by examining the aldosterone sensitivity of MR and GRs from the distantly related daffodil cichlid (Neolamprologus pulcher), another teleost model for neurobehavioral plasticity. Both teleost species had a single MR and duplicate GRs. All MRs were sensitive to DOC, consistent with the hypothesis that DOC was the initial ligand of the ancestral MR. Variation in GR steroid-specificity corresponds to nine identified amino acid residue substitutions rather than phylogenetic relationships based on receptor sequences.
The mineralocorticoid sensitivity of duplicate GRs in teleosts is highly labile in the context of their evolutionary phylogeny, a property that likely led to neo-functionalization and maintenance of two GRs.
Background and Aims
To assess the number and phylogenetic distribution of large-scale genome duplications in the ancestry of Actinidia, publicly available expressed sequenced tags (ESTs) for members of the Actinidiaceae and related Ericales, including tea (Camellia sinensis), were analysed.
Synonymous divergences (Ks) were calculated for all duplications within gene families and examined for evidence of large-scale duplication events. Phylogenetic comparisons for a selection of orthologues among several related species in Ericales and two outgroups permitted placement of duplication events in relation to lineage divergences. Gene ontology (GO) categories were analysed for each whole-genome duplication (WGD) and the whole transcriptome.
Evidence for three ancient WGDs in Actinidia was found. Analyses of paleologue GO categories indicated a different pattern of retained genes for each genome duplication, but a pattern consistent with the dosage-balance hypothesis among all retained paleologues.
This study provides evidence for one independent WGD in the ancestry of Actinidia (Ad-α), a WGD shared by Actinidia and Camellia (Ad-β), and the well-established At-γ WGD that occurred prior to the divergence of all taxa examined. More ESTs in other taxa are needed to elucidate which groups in Ericales share the Ad-β or Ad-α duplications and their impact on diversification.
Paleopolyploidy; Actinidiaceae; Ericales; Actinidia; Camellia; kiwi; genome duplication; dosage balance
The relationship between genotypic and phenotypic divergence over evolutionary time varies widely, and cases of rapid phenotypic differentiation despite genetic similarity have attracted much attention. Here, we report an extreme case of the reverse pattern—morphological stasis in a tropical fish despite massive genetic divergence. We studied the enigmatic African freshwater butterfly fish (Pantodon buchholzi), whose distinctive morphology earns it recognition as a monotypic family. We sequenced the mitochondrial genome of Pantodon from the Congo basin and nine other osteoglossomorph taxa for comparison with previous mitogenomic profiles of Pantodon from the Niger basin and other related taxa. Pantodon populations form a monophyletic group, yet their mitochondrial coding sequences differ by 15.2 per cent between the Niger and Congo basins. The mitogenomic divergence time between these populations is estimated to be greater than 50 Myr, and deep genetic divergence was confirmed by nuclear sequence data. Among six sister-group comparisons of osteoglossomorphs, Pantodon exhibits the slowest rate of morphological divergence despite a level of genetic differentiation comparable to both species-rich (e.g. Mormyridae) and species-poor (e.g. Osteoglossidae) families. Morphological stasis in these two allopatric lineages of Pantodon offers a living vertebrate model for investigating phenotypic stability over millions of generations in the face of profound fluctuations in environmental conditions.
living fossil; morphological stasis; rate of evolution; cryptic diversity; Osteoglossomorpha
Long wave-sensitive (LWS) opsin genes have undergone multiple lineage-specific duplication events throughout the evolution of teleost fishes. LWS repertoire expansions in live-bearing fishes (family Poeciliidae) have equipped multiple species in this family with up to four LWS genes. Given that color vision, especially attraction to orange male coloration, is important to mate choice within poeciliids, LWS opsins have been proposed as candidate genes driving sexual selection in this family. To date the genomic organization of these genes has not been described in the family Poeciliidae, and little is known about the mechanisms regulating the expression of LWS opsins in any teleost.
Two BAC clones containing the complete genomic repertoire of LWS opsin genes in the green swordtail fish, Xiphophorus helleri, were identified and sequenced. Three of the four LWS loci identified here were linked in a tandem array downstream of two tightly linked short wave-sensitive 2 (SWS2) opsin genes. The fourth LWS opsin gene, containing only a single intron, was not linked to the other three and is the product of a retrotransposition event. Genomic and phylogenetic results demonstrate that the LWS genes described here share a common evolutionary origin with those previously characterized in other poeciliids. Using qualitative RT-PCR and MSP we showed that each of the LWS and SWS2 opsins, as well as three other cone opsin genes and a single rod opsin gene, were expressed in the eyes of adult female and male X. helleri, contributing to six separate classes of adult retinal cone and rod cells with average λmax values of 365 nm, 405 nm, 459 nm, 499 nm, 534 nm and 568 nm. Comparative genomic analysis identified two candidate teleost opsin regulatory regions containing putative CRX binding sites and hormone response elements in upstream sequences of LWS gene regions of seven teleost species, including X. helleri.
We report the first complete genomic description of LWS and SWS2 genes in poeciliids. These data will serve as a reference for future work seeking to understand the relationship between LWS opsin genomic organization, gene expression, gene family evolution, sexual selection and speciation in this fish family.
Comparative genomics has shown that ray-finned fish (Actinopterygii) contain more copies of many genes than other vertebrates. A large number of these additional genes appear to have been produced during a genome duplication event that occurred early during the evolution of Actinopterygii (i.e. before the teleost radiation). In addition to this ancient genome duplication event, many lineages within Actinopterygii have experienced more recent genome duplications. Here we introduce a curated database named Wanda that lists groups of orthologous genes with one copy from man, mouse and chicken, one or two from tetraploid Xenopus and two or more ancient copies (i.e. paralogs) from ray-finned fish. The database also contains the sequence alignments and phylogenetic trees that were necessary for determining the correct orthologous and paralogous relationships among genes. Where available, map positions and functional data are also reported. The Wanda database should be of particular use to evolutionary and developmental biologists who are interested in the evolutionary and functional divergence of genes after duplication. Wanda is available at http://www.evolutionsbiologie.uni-konstanz.de/Wanda/.
Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue.
BK channels; hearing; gene duplications
Teleost fishes have three distinct oestrogen receptor (ER) subtypes: ER-α, ER-βa (or ER-γ) and ER-βb. ER-βa and ER-βb arose from a duplication of an ancestral ER-β gene early in the teleost lineage. Here, we describe the distribution of the three ER mRNAs in the hypothalamus and cerebellum of the Atlantic croaker to address two issues: the specific functions of multiple ERs in the neuroendocrine system and the evolution and fate of duplicated genes. ER-α was detected in nuclei of the preoptic area (POA) and hypothalamus previously shown to possess ER-αs in teleosts. AcER-βb, but not ER-βa, labelling was detected in the magnocellular neurons of the POA, nucleus posterior tuberis, the nucleus recessus posterior and cerebellum. By contrast, acER-βa, but not ER-βb, was detected in the dorsal anterior parvocellular POA and suprachiasmatic nucleus. Both ER-βs were found in posterior parvocellular and ventral anterior POA nuclei, the ventral hypothalamus, and periventricular dorsal hypothalamus. The differences we observed in ER subtype mRNA distribution within well-characterized brain nuclei suggest that ER-βa and ER-βb have distinct functions in the neuroendocrine control of reproduction and behaviour, and provide evidence that the teleost ER-β paralogues have partitioned functions of the ancestral ER-β gene they shared with tetrapods.
oestrogen receptor; gene duplication; teleost fishes; neuroendocrine regulation; hypothalamus; brain
Whole-genome duplications (WGDs) have recurred in the evolution of angiosperms, resulting in many duplicated chromosomal segments. Local gene duplications are also widespread in angiosperms. WGD-derived duplicates, that is, ohnologs, and local duplicates often show contrasting patterns of gene retention and evolution. However, many genes in angiosperms underwent multiple gene duplication events, possibly by different modes, indicating that different modes of gene duplication are not mutually exclusive. In two representative angiosperm genomes, Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa), we found that 9.6% and 11.3% of unique ohnologs, corresponding to 15.5% and 17.1% of ohnolog pairs, were also involved in local duplications, respectively. Locally duplicated ohnologs are widely distributed in different duplicated chromosomal segments and functionally biased. Coding sequence divergence between duplicated genes is denoted by nonsynonymous (Ka) and synonymous (Ks) substitution rates. Locally duplicated ohnolog pairs tend to have higher Ka, Ka/Ks, and gene expression divergence than nonlocally duplicated ohnolog pairs. Locally duplicated ohnologs also tend to have higher interspecies sequence divergence. These observations indicate that locally duplicated ohnologs evolve faster than nonlocally duplicated ohnologs. This study highlights the necessity to take local duplications into account when analyzing the evolutionary dynamics of ohnologs.
local gene duplication; whole-genome duplication; ohnolog; divergence; colinearity
We characterized ectopic gene conversions in the genome of ten hemiascomycete yeast species. Of the ten species, three diverged prior to the whole genome duplication (WGD) event present in the yeast lineage and seven diverged after it. We analyzed gene conversions from three separate datasets: paralogs from the three pre-WGD species, paralogs from the seven post-WGD species, and common ohnologs from the seven post-WGD species. Gene conversions have similar lengths and frequency and occur between sequences having similar degrees of divergence, in paralogs from pre- and post-WGD species. However, the sequences of ohnologs are both more divergent and less frequently converted than those of paralogs. This likely reflects the fact that ohnologs are more often found on different chromosomes and are evolving under stronger selective pressures than paralogs. Our results also show that ectopic gene conversions tend to occur more frequently between closely linked genes. They also suggest that the mechanisms responsible for the loss of introns in S. cerevisiae are probably also involved in the gene 3′-end gene conversion bias observed between the paralogs of this species.
Based on a wide variety of data, it is now clear that the brains of birds and teleost (bony) fish possess a core “social behavior network” within the basal forebrain and midbrain that is homologous to the social behavior network of mammals. The nodes of this network are reciprocally connected, contain receptors for sex steroid hormones, and are involved in multiple forms of social behavior. Other hodological features and neuropeptide distributions are likewise very similar across taxa. This evolutionary conservation represents a boon for experiments on phenotypic behavioral variation, as the extraordinary social diversity of teleost fish and songbirds can now be used to generate broadly relevant insights into issues of brain function that are not particularly tractable in other vertebrate groups. Two such lines of research are presented here, each of which addresses functional variation within the network as it relates to divergent patterns of social behavior. In the first set of experiments, we have used a sexually polymorphic fish to demonstrate that natural selection can operate independently on hypothalamic neuroendocrine functions that are relevant for 1) gonadal regulation and 2) sex-typical behavioral modulation. In the second set of experiments, we have exploited the diversity of avian social organizations and ecologies to isolate species-typical group size as a quasi-independent variable. These experiments have shown that specific areas and peptidergic components of the social behavior network possess functional properties that evolve in parallel with divergence and convergence in sociality.
sociality; aggression; sexual behavior; communication; vocalization; arginine vasopressin; arginine vasotocin; isotocin; mesotocin; oxytocin; fish; bird; bed nucleus of the stria terminalis; amygdala; lateral septum; anterior hypothalamus; ventromedial hypothalamus; preoptic area; periaqueductal gray; nucleus intercollicularis; c-fos; egr-1; Zenk
Genomic resources for hundreds of species of evolutionary, agricultural, economic, and medical importance are unavailable due to the expense of well-assembled genome sequences and difficulties with multigenerational studies. Teleost fish provide many models for human disease but possess anciently duplicated genomes that sometimes obfuscate connectivity. Genomic information representing a fish lineage that diverged before the teleost genome duplication (TGD) would provide an outgroup for exploring the mechanisms of evolution after whole-genome duplication. We exploited massively parallel DNA sequencing to develop meiotic maps with thrift and speed by genotyping F1 offspring of a single female and a single male spotted gar (Lepisosteus oculatus) collected directly from nature utilizing only polymorphisms existing in these two wild individuals. Using Stacks, software that automates the calling of genotypes from polymorphisms assayed by Illumina sequencing, we constructed a map containing 8406 markers. RNA-seq on two map-cross larvae provided a reference transcriptome that identified nearly 1000 mapped protein-coding markers and allowed genome-wide analysis of conserved synteny. Results showed that the gar lineage diverged from teleosts before the TGD and its genome is organized more similarly to that of humans than teleosts. Thus, spotted gar provides a critical link between medical models in teleost fish, to which gar is biologically similar, and humans, to which gar is genomically similar. Application of our F1 dense mapping strategy to species with no prior genome information promises to facilitate comparative genomics and provide a scaffold for ordering the numerous contigs arising from next generation genome sequencing.
Acipenseriformes are composed of 25 sturgeon species and two paddlefish species distributed exclusively in the northern hemisphere. The Danube River and the Black Sea were originally inhabited by six sturgeon species but two are extinct and only four are still reproducing currently in the Lower Danube: Huso huso, Acipenser stellatus, A. gueldenstaedtii and A. ruthenus. Sturgeon species hybridize more easily than other fish and the determination of pure species or hybrid status is important for conservation and for breeding in fish farms. This survey demonstrated that morphological determination of this status is not reliable and a molecular tool, based on eight microsatellites genotypes is proposed. This method, based on three successive statistical analyses including Factorial Correspondence Analysis (FCA), Structure assignation and NewHybrids status determination, showed a high efficiency in discriminating pure species specimens from F1, F2 and two kinds of backcross individuals involving three of the four reproducing Lower Danube sturgeon species.
sturgeons; microsatellites; hybrids; statistical analysis