PMCC PMCC

Search tips
Search criteria

Advanced
Results 1-25 (741327)

Clipboard (0)
None

Related Articles

1.  Evidence for Widespread Degradation of Gene Control Regions in Hominid Genomes 
PLoS Biology  2005;3(2):e42.
Although sequences containing regulatory elements located close to protein-coding genes are often only weakly conserved during evolution, comparisons of rodent genomes have implied that these sequences are subject to some selective constraints. Evolutionary conservation is particularly apparent upstream of coding sequences and in first introns, regions that are enriched for regulatory elements. By comparing the human and chimpanzee genomes, we show here that there is almost no evidence for conservation in these regions in hominids. Furthermore, we show that gene expression is diverging more rapidly in hominids than in murids per unit of neutral sequence divergence. By combining data on polymorphism levels in human noncoding DNA and the corresponding human–chimpanzee divergence, we show that the proportion of adaptive substitutions in these regions in hominids is very low. It therefore seems likely that the lack of conservation and increased rate of gene expression divergence are caused by a reduction in the effectiveness of natural selection against deleterious mutations because of the low effective population sizes of hominids. This has resulted in the accumulation of a large number of deleterious mutations in sequences containing gene control elements and hence a widespread degradation of the genome during the evolution of humans and chimpanzees.
A comparison of hominid and rodent lineages reveals that the gene control regions of hominids are not conserved and are accumulating mutations, suggesting widespread degradation of the hominid genome
doi:10.1371/journal.pbio.0030042
PMCID: PMC544929  PMID: 15678168
2.  Rapid Sequence Evolution of Transcription Factors Controlling Neuron Differentiation in Caenorhabditis 
Molecular Biology and Evolution  2009;26(10):2373-2386.
Whether phenotypic evolution proceeds predominantly through changes in regulatory sequences is a controversial issue in evolutionary genetics. Ample evidence indicates that the evolution of gene regulatory networks via changes in cis-regulatory sequences is an important determinant of phenotypic diversity. However, recent experimental work suggests that the role of transcription factor (TF) divergence in developmental evolution may be underestimated. In order to help understand what levels of constraints are acting on the coding sequence of developmental regulatory genes, evolutionary rates were investigated among 48 TFs required for neuronal development in Caenorhabditis elegans. Allelic variation was then sampled for 28 of these genes within a population of the related species Caenorhabditis remanei. Neuronal TFs are more divergent, both within and between species, than structural genes. TFs affecting different neuronal classes are under different levels of selective constraints. The regulatory genes controlling the differentiation of chemosensory neurons evolve particularly fast and exhibit higher levels of within- and between-species nucleotide variation than TFs required for the development of several neuronal classes and TFs required for motorneuron differentiation. The TFs affecting chemosensory neuron development are also more divergent than chemosensory genes expressed in the neurons they differentiate. These results illustrate that TFs are not as highly constrained as commonly thought and suggest that the role of divergence in developmental regulatory genes during the evolution of gene regulatory networks requires further attention.
doi:10.1093/molbev/msp142
PMCID: PMC2766936  PMID: 19589887
Caenorhabditis; transcription factors; nucleotide variation; molecular evolution; chemosensory genes; neurons
3.  Next generation sequencing and analysis of a conserved transcriptome of New Zealand's kiwi 
Background
Kiwi is a highly distinctive, flightless and endangered ratite bird endemic to New Zealand. To understand the patterns of molecular evolution of the nuclear protein-coding genes in brown kiwi (Apteryx australis mantelli) and to determine the timescale of avian history we sequenced a transcriptome obtained from a kiwi embryo using next generation sequencing methods. We then assembled the conserved protein-coding regions using the chicken proteome as a scaffold.
Results
Using 1,543 conserved protein coding genes we estimated the neutral evolutionary divergence between the kiwi and chicken to be ~45%, which is approximately equal to the divergence computed for the human-mouse pair using the same set of genes. A large fraction of genes was found to be under high selective constraint, as most of the expressed genes appeared to be involved in developmental gene regulation. Our study suggests a significant relationship between gene expression levels and protein evolution. Using sequences from over 700 nuclear genes we estimated the divergence between the two basal avian groups, Palaeognathae and Neognathae to be 132 million years, which is consistent with previous studies using mitochondrial genes.
Conclusions
The results of this investigation revealed patterns of mutation and purifying selection in conserved protein coding regions in birds. Furthermore this study suggests a relatively cost-effective way of obtaining a glimpse into the fundamental molecular evolutionary attributes of a genome, particularly when no closely related genomic sequence is available.
doi:10.1186/1471-2148-10-387
PMCID: PMC3009673  PMID: 21156082
4.  A Comprehensive Analysis of Gene Expression Evolution Between Humans and Mice 
Evolutionary changes in gene expression account for most phenotypic differences between species. Advances in microarray technology have made the systematic study of gene expression evolution possible. In this study, gene expression patterns were compared between human and mouse genomes using two published methods. Specifically, we studied how gene expression evolution was related to GO terms and tried to decode the relationship between promoter evolution and gene expression evolution. The results showed that (1) the significant enrichment of biological processes in orthologs of expression conservation reveals functional significance of gene expression conservation. The more conserved gene expression in some biological processes than is expected in a purely neutral model reveals negative selection on gene expression. However, fast evolving genes mainly support the neutrality of gene expression evolution, and (2) gene expression conservation is positively but only slightly correlated with promoter conservation based on a motif-count score of the promoter alignment. Our results suggest a neutral model with negative selection for gene expression evolution between humans and mice, and promoter evolution could have some effects on gene expression evolution.
PMCID: PMC2747126  PMID: 19812728
gene expression; evolution; ortholog; neutral model; promoter
5.  Relative Contributions of Intrinsic Structural–Functional Constraints and Translation Rate to the Evolution of Protein-Coding Genes 
A long-standing assumption in evolutionary biology is that the evolution rate of protein-coding genes depends, largely, on specific constraints that affect the function of the given protein. However, recent research in evolutionary systems biology revealed unexpected, significant correlations between evolution rate and characteristics of genes or proteins that are not directly related to specific protein functions, such as expression level and protein–protein interactions. The strongest connections were consistently detected between protein sequence evolution rate and the expression level of the respective gene. A recent genome-wide proteomic study revealed an extremely strong correlation between the abundances of orthologous proteins in distantly related animals, the nematode Caenorhabditis elegans and the fruit fly Drosophila melanogaster. We used the extensive protein abundance data from this study along with short-term evolutionary rates (ERs) of orthologous genes in nematodes and flies to estimate the relative contributions of structural–functional constraints and the translation rate to the evolution rate of protein-coding genes. Together the intrinsic constraints and translation rate account for approximately 50% of the variance of the ERs. The contribution of constraints is estimated to be 3- to 5-fold greater than the contribution of translation rate.
doi:10.1093/gbe/evq010
PMCID: PMC2940324  PMID: 20624725
protein evolution; structural–functional constraints; misfolding; protein abundance
6.  Housekeeping genes tend to show reduced upstream sequence conservation 
Genome Biology  2007;8(7):R140.
Mammalian housekeeping genes show significantly lower promoter sequence conservation, especially upstream of position -500 with respect to the transcription start site, than genes expressed in a subset of tissues.
Background
Understanding the constraints that operate in mammalian gene promoter sequences is of key importance to understand the evolution of gene regulatory networks. The level of promoter conservation varies greatly across orthologous genes, denoting differences in the strength of the evolutionary constraints. Here we test the hypothesis that the number of tissues in which a gene is expressed is related in a significant manner to the extent of promoter sequence conservation.
Results
We show that mammalian housekeeping genes, expressed in all or nearly all tissues, show significantly lower promoter sequence conservation, especially upstream of position -500 with respect to the transcription start site, than genes expressed in a subset of tissues. In addition, we evaluate the effect of gene function, CpG island content and protein evolutionary rate on promoter sequence conservation. Finally, we identify a subset of transcription factors that bind to motifs that are specifically over-represented in housekeeping gene promoters.
Conclusion
This is the first report that shows that the promoters of housekeeping genes show reduced sequence conservation with respect to genes expressed in a more tissue-restricted manner. This is likely to be related to simpler gene expression, requiring a smaller number of functional cis-regulatory motifs.
doi:10.1186/gb-2007-8-7-r140
PMCID: PMC2323216  PMID: 17626644
7.  The Evolution of Gene Expression QTL in Saccharomyces cerevisiae 
PLoS ONE  2007;2(8):e678.
Understanding the evolutionary forces that influence patterns of gene expression variation will provide insights into the mechanisms of evolutionary change and the molecular basis of phenotypic diversity. To date, studies of gene expression evolution have primarily been made by analyzing how gene expression levels vary within and between species. However, the fundamental unit of heritable variation in transcript abundance is the underlying regulatory allele, and as a result it is necessary to understand gene expression evolution at the level of DNA sequence variation. Here we describe the evolutionary forces shaping patterns of genetic variation for 1206 cis-regulatory QTL identified in a cross between two divergent strains of Saccharomyces cerevisiae. We demonstrate that purifying selection against mildly deleterious alleles is the dominant force governing cis-regulatory evolution in S. cerevisiae and estimate the strength of selection. We also find that essential genes and genes with larger codon bias are subject to slightly stronger cis-regulatory constraint and that positive selection has played a role in the evolution of major trans-acting QTL.
doi:10.1371/journal.pone.0000678
PMCID: PMC1925141  PMID: 17668057
8.  Divergence of nucleosome positioning between two closely related yeast species: genetic basis and functional consequences 
Inter-species hybrids can be used to dissect the relative contribution of cis and trans effects to the evolution of nucleosome positioning. Most (∼70%) differences in nucleosome positioning between two closely related yeast species are due to cis effects.Cis effects are primarily due to divergence of AT-rich nucleosome-disfavoring sequences, but are not associated with divergence of nucleosome-favoring sequences.Differences in nucleosome positioning propagate to multiple adjacent nucleosomes, supporting the statistical positioning hypothesis.Divergence of nucleosome positioning is excluded from regulatory elements and is not correlated with gene expression divergence, suggesting a neutral mode of evolution.
Phenotypic diversity is often due to changes in gene regulation, and recent studies have characterized extensive differences between the gene expression programs of closely related species (Khaitovich et al, 2006; Tirosh et al, 2009). However, very little is known about the mechanisms that drive this divergence. Here, we analyze the evolution of nucleosome positioning, by comparing the patterns of nucleosomes between two yeast species, as well as generating the allele-specific nucleosome profile in their hybrid. We ask two main questions: (1) what is the genetic basis of inter-species differences in nucleosome positioning? and (2) what is the regulatory function of these differences?
Generally speaking, we can classify the genetic basis of the divergence in nucleosome positioning into two mechanisms. First, mutations in the local DNA sequence may influence the ability to bind nucleosomes at this region; we refer to these as cis effects. Second, mutations may affect the activity of various proteins that alter nucleosome positioning either actively (e.g. chromatin-remodeling enzymes) or by simply competing with nucleosomes for binding to the same DNA sequence (e.g. transcription factors); we refer to these as trans effects.
To classify the observed inter-species differences into cis versus trans effects, we measured allele-specific nucleosome positions within the inter-specific hybrid of the two species (Wittkopp et al, 2004; Tirosh et al, 2009). The hybrid contains the alleles of both species; hence, cis effects, which involve mutations that discriminate between the two alleles, will be maintained in the hybrid so that nucleosome positioning will be different between the alleles coming from the different species. Trans effects, in contrast, will not discriminate between the two hybrid alleles from the different species, as these two alleles reside together at the same trans environment (hybrid nucleus) and are thus regulated by the same set of proteins—the combination of proteins from the two species. Using this approach, we found that ∼70% of the inter-species differences in nucleosome positioning are due to cis effects, whereas the rest is due to trans effects.
The local DNA sequence is indeed known to affect nucleosome positions, and many features of DNA sequences were proposed to influence nucleosome binding, either by rejecting nucleosomes, or by being favorable for nucleosome binding (Segal et al, 2006; Lee et al, 2007; Kaplan et al, 2009). We find, however, that nucleosome positions diverged primarily through changes in AT-rich sequences, which exclude nucleosomes, whereas mutations in sequences that correlate with high-nucleosome occupancy do not influence inter-species divergence.
Nucleosomes restrict the access of proteins to the DNA and may thus affect DNA-related processes such as transcription, recombination or replication. Indeed, promoters and regulatory sequences are often depleted of nucleosomes, and highly transcribed genes are associated with low occupancy of nucleosomes at their promoters (Lee et al, 2007). Several earlier studies also suggested that evolutionary divergence of gene expression is driven by changes in chromatin structure (Lee et al, 2006; Choi and Kim, 2008; Tirosh et al, 2008; Field et al, 2009). However, we find that nucleosome positions (or occupancy) at regulatory elements are largely conserved, and furthermore, that the inter-species differences in nucleosome positions do not correlate with gene expression differences. These results suggest that nucleosome positioning is not a central mechanism for evolutionary changes in gene regulation and that most of the observed changes may be due to neutral drift.
Does the apparent low influence of nucleosome positioning on gene expression divergence implies that nucleosome positions do not have a function in gene regulation? To address this, we examined two additional modes of gene regulation: transcriptional response to changes in growth conditions (glucose versus glycerol media), and the expression differences between different cell types (haploid versus diploid cells). Consistent with earlier studies, we found that the response to growth conditions is significantly, albeit weakly, associated with changes in nucleosome positioning. Interestingly, we also found a strikingly strong association between gene expression and nucleosomal changes in the two cell types. Taken together, these results suggest that nucleosome positioning is used preferentially for biological processes in which genes are turned on and off (e.g. different cell type), but less so during divergence of closely related species in which gradual changes accumulate over time.
Gene regulation differs greatly between related species, constituting a major source of phenotypic diversity. Recent studies characterized extensive differences in the gene expression programs of closely related species. In contrast, virtually nothing is known about the evolution of chromatin structure and how it influences the divergence of gene expression. Here, we compare the genome-wide nucleosome positioning of two closely related yeast species and, by profiling their inter-specific hybrid, trace the genetic basis of the observed differences into mutations affecting the local DNA sequences (cis effects) or the upstream regulators (trans effects). The majority (∼70%) of inter-species differences is due to cis effects, leaving a significant contribution (30%) for trans factors. We show that cis effects are well explained by mutations in nucleosome-disfavoring AT-rich sequences, but are not associated with divergence of nucleosome-favoring sequences. Differences in nucleosome positioning propagate to multiple adjacent nucleosomes, supporting the statistical positioning hypothesis, and we provide evidence that nucleosome-free regions, but not the +1 nucleosome, serve as stable border elements. Surprisingly, although we find that differential nucleosome positioning among cell types is strongly correlated with differential expression, this does not seem to be the case for evolutionary changes: divergence of nucleosome positioning is excluded from regulatory elements and is not correlated with gene expression divergence, suggesting a primarily neutral mode of evolution. Our results provide evolutionary insights to the genetic determinants and regulatory function of nucleosome positioning.
doi:10.1038/msb.2010.20
PMCID: PMC2890324  PMID: 20461072
evolution; gene regulation; nucleosome positioning
9.  Negative Correlation between Expression Level and Evolutionary Rate of Long Intergenic Noncoding RNAs 
Genome Biology and Evolution  2011;3:1390-1404.
Mammalian genomes contain numerous genes for long noncoding RNAs (lncRNAs). The functions of the lncRNAs remain largely unknown but their evolution appears to be constrained by purifying selection, albeit relatively weakly. To gain insights into the mode of evolution and the functional range of the lncRNA, they can be compared with much better characterized protein-coding genes. The evolutionary rate of the protein-coding genes shows a universal negative correlation with expression: highly expressed genes are on average more conserved during evolution than the genes with lower expression levels. This correlation was conceptualized in the misfolding-driven protein evolution hypothesis according to which misfolding is the principal cost incurred by protein expression. We sought to determine whether long intergenic ncRNAs (lincRNAs) follow the same evolutionary trend and indeed detected a moderate but statistically significant negative correlation between the evolutionary rate and expression level of human and mouse lincRNA genes. The magnitude of the correlation for the lincRNAs is similar to that for equal-sized sets of protein-coding genes with similar levels of sequence conservation. Additionally, the expression level of the lincRNAs is significantly and positively correlated with the predicted extent of lincRNA molecule folding (base-pairing), however, the contributions of evolutionary rates and folding to the expression level are independent. Thus, the anticorrelation between evolutionary rate and expression level appears to be a general feature of gene evolution that might be caused by similar deleterious effects of protein and RNA misfolding and/or other factors, for example, the number of interacting partners of the gene product.
doi:10.1093/gbe/evr116
PMCID: PMC3242500  PMID: 22071789
long noncoding RNA; ncRNA; RNA expression; genomic alignments; introns; RNA folding
10.  Patterns and rates of intron divergence between humans and chimpanzees 
Genome Biology  2007;8(2):R21.
An analysis of human-chimpanzee intron divergence shows strong correlations between intron length and divergence and GC-content.
Background
Introns, which constitute the largest fraction of eukaryotic genes and which had been considered to be neutral sequences, are increasingly acknowledged as having important functions. Several studies have investigated levels of evolutionary constraint along introns and across classes of introns of different length and location within genes. However, thus far these studies have yielded contradictory results.
Results
We present the first analysis of human-chimpanzee intron divergence, in which differences in the number of substitutions per intronic site (Ki) can be interpreted as the footprint of different intensities and directions of the pressures of natural selection. Our main findings are as follows: there was a strong positive correlation between intron length and divergence; there was a strong negative correlation between intron length and GC content; and divergence rates vary along introns and depending on their ordinal position within genes (for instance, first introns are more GC rich, longer and more divergent, and divergence is lower at the 3' and 5' ends of all types of introns).
Conclusion
We show that the higher divergence of first introns is related to their larger size. Also, the lower divergence of short introns suggests that they may harbor a relatively greater proportion of regulatory elements than long introns. Moreover, our results are consistent with the presence of functionally relevant sequences near the 5' and 3' ends of introns. Finally, our findings suggest that other parts of introns may also be under selective constraints.
doi:10.1186/gb-2007-8-2-r21
PMCID: PMC1852421  PMID: 17309804
11.  Positive and Purifying Selection Influence the Evolution of Doublesex in the Anastrepha fraterculus Species Group 
PLoS ONE  2012;7(3):e33446.
The gene doublesex (dsx) is considered to be under strong selective constraint along its evolutionary history because of its central role in somatic sex differentiation in insects. However, previous studies of dsx used global estimates of evolutionary rates to investigate its molecular evolution, which potentially miss signals of adaptive changes in generally conserved genes. In this work, we investigated the molecular evolution of dsx in the Anastrepha fraterculus species group (Diptera, Tephritidae), and test the hypothesis that this gene evolved solely by purifying selection using divergence-based and population-based methods. In the first approach, we compared sequences from Anastrepha and other Tephritidae with other Muscomorpha species, analyzed variation in nonsynonymous to synonymous rate ratios (dN/dS) in the Tephritidae, and investigated radical and conservative changes in amino acid physicochemical properties. We show a general selective constraint on dsx, but with signs of positive selection mainly in the common region. Such changes were localized in alpha-helices previously reported to be involved in dimer formation in the OD2 domain and near the C-terminal of the OD1 domain. In the population-based approach, we amplified a region of 540 bp that spanned almost all of the region common to both sexes from 32 different sites in Brazil. We investigated patterns of selection using neutrality tests based on the frequency spectrum and locations of synonymous and nonsynonymous mutations in a haplotype network. As in the divergence-based approach, these analyses showed that dsx has evolved under an overall selective constraint, but with some events of positive selection. In contrast to previous studies, our analyses indicate that even though dsx has indeed evolved as a conserved gene, the common region of dsx has also experienced bouts of positive selection, perhaps driven by sexual selection, during its evolution.
doi:10.1371/journal.pone.0033446
PMCID: PMC3302808  PMID: 22428050
12.  A Case-by-Case Evolutionary Analysis of Four Imprinted Retrogenes 
Retroposition is a widespread phenomenon resulting in the generation of new genes that are initially related to a parent gene via very high coding sequence similarity. We examine the evolutionary fate of four retrogenes generated by such an event; mouse Inpp5f_v2, Mcts2, Nap1l5, and U2af1-rs1. These genes are all subject to the epigenetic phenomenon of parental imprinting. We first provide new data on the age of these retrogene insertions. Using codon-based models of sequence evolution, we show these retrogenes have diverse evolutionary trajectories, including divergence from the parent coding sequence under positive selection pressure, purifying selection pressure maintaining parent-retrogene similarity, and neutral evolution. Examination of the expression pattern of retrogenes shows an atypical, broad pattern across multiple tissues. Protein 3D structure modeling reveals that a positively selected residue in U2af1-rs1, not shared by its parent, may influence protein conformation. Our case-by-case analysis of the evolution of four imprinted retrogenes reveals that this interesting class of imprinted genes, while similar in regulation and sequence characteristics, follow very varied evolutionary paths.
doi:10.1111/j.1558-5646.2010.01213.x
PMCID: PMC3107425  PMID: 21166792
Epigenetics; gene expression; imprinting; molecular evolution; retrogene
13.  Divergence of imprinted genes during mammalian evolution 
Background
In contrast to the majority of mammalian genes, imprinted genes are monoallelically expressed with the choice of the active allele depending on its parental origin. Due to their special inheritance patterns, maternally and paternally expressed genes might be under different evolutionary pressure. Here, we aimed at assessing the evolutionary history of imprinted genes.
Results
In this study, we investigated the conservation of imprinted genes in vertebrate genomes and their exposition to natural selection. In a genome-wide comparison, orthologs of imprinted genes show a stronger divergence on cDNA and protein level in mammals. This pattern is most pronounced for maternally expressed genes in rodents in comparison to their non-rodent orthologs. The divergence is not attributable to increased mutation of CpG positions. It is contrasted by strong conservation of paternally expressed genes in mouse and rat. Interestingly, we found that the early divergence of imprinted genes was accompanied by an unusually strict conservation of their paralogs.
Conclusions
The apparent degeneration of maternally expressed genes may reflect a relaxation of selective pressure due to counteracting effects on maternal and embryonic fitness. Functional redundancy provided by the presence of highly conserved (non-imprinted) paralogs may have facilitated the divergence. Moreover, intensification of imprinting in modern rodents seems to have shifted the evolutionary fate of imprinted genes towards strong purifying selection.
doi:10.1186/1471-2148-10-116
PMCID: PMC2875234  PMID: 20429903
14.  Gene Expression Divergence is Coupled to Evolution of DNA Structure in Coding Regions 
PLoS Computational Biology  2011;7(11):e1002275.
Sequence changes in coding region and regulatory region of the gene itself (cis) determine most of gene expression divergence between closely related species. But gene expression divergence between yeast species is not correlated with evolution of primary nucleotide sequence. This indicates that other factors in cis direct gene expression divergence. Here, we studied the contribution of DNA three-dimensional structural evolution as cis to gene expression divergence. We found that the evolution of DNA structure in coding regions and gene expression divergence are correlated in yeast. Similar result was also observed between Drosophila species. DNA structure is associated with the binding of chromatin remodelers and histone modifiers to DNA sequences in coding regions, which influence RNA polymerase II occupancy that controls gene expression level. We also found that genes with similar DNA structures are involved in the same biological process and function. These results reveal the previously unappreciated roles of DNA structure as cis-effects in gene expression.
Author Summary
The unique phenotype of each organism is partly determined by gene expression. Changes in gene expression are an important source of phenotypic variation, and can be caused by changes in regulatory and coding sequences of the gene itself (cis) and changes in regulatory factors (trans). The contribution of cis regulation to gene expression divergence between closely related species is much greater than that of trans regulation. However, evolution of primary nucleotide sequences is not correlated with gene expression divergence in yeast, suggesting that other factors in cis drive gene expression divergence. Here, we found that evolution of DNA structure in coding regions is coupled to gene expression divergence in yeast. We also found that DNA structure is associated with specific gene characteristics. Genes with similar DNA structures are involved in the same biological process and function. These results demonstrate the important roles of DNA structure in directing gene expression.
doi:10.1371/journal.pcbi.1002275
PMCID: PMC3219629  PMID: 22125484
15.  Complex patterns of divergence among green-sensitive (RH2a) African cichlid opsins revealed by Clade model analyses 
Background
Gene duplications play an important role in the evolution of functional protein diversity. Some models of duplicate gene evolution predict complex forms of paralog divergence; orthologous proteins may diverge as well, further complicating patterns of divergence among and within gene families. Consequently, studying the link between protein sequence evolution and duplication requires the use of flexible substitution models that can accommodate multiple shifts in selection across a phylogeny. Here, we employed a variety of codon substitution models, primarily Clade models, to explore how selective constraint evolved following the duplication of a green-sensitive (RH2a) visual pigment protein (opsin) in African cichlids. Past studies have linked opsin divergence to ecological and sexual divergence within the African cichlid adaptive radiation. Furthermore, biochemical and regulatory differences between the RH2aα and RH2aβ paralogs have been documented. It thus seems likely that selection varies in complex ways throughout this gene family.
Results
Clade model analysis of African cichlid RH2a opsins revealed a large increase in the nonsynonymous-to-synonymous substitution rate ratio (ω) following the duplication, as well as an even larger increase, one consistent with positive selection, for Lake Tanganyikan cichlid RH2aβ opsins. Analysis using the popular Branch-site models, by contrast, revealed no such alteration of constraint. Several amino acid sites known to influence spectral and non-spectral aspects of opsin biochemistry were found to be evolving divergently, suggesting that orthologous RH2a opsins may vary in terms of spectral sensitivity and response kinetics. Divergence appears to be occurring despite intronic gene conversion among the tandemly-arranged duplicates.
Conclusions
Our findings indicate that variation in selective constraint is associated with both gene duplication and divergence among orthologs in African cichlid RH2a opsins. At least some of this variation may reflect an adaptive response to differences in light environment. Interestingly, these patterns only became apparent through the use of Clade models, not through the use of the more widely employed Branch-site models; we suggest that this difference stems from the increased flexibility associated with Clade models. Our results thus bear both on studies of cichlid visual system evolution and on studies of gene family evolution in general.
doi:10.1186/1471-2148-12-206
PMCID: PMC3514295  PMID: 23078361
Codon substitution model; Visual pigment evolution; Nonsynonymous-to-synonymous substitution rate ratio; dN/dS; Clade model; Maximum likelihood; Gene family evolution
16.  A Scan for Positively Selected Genes in the Genomes of Humans and Chimpanzees 
PLoS Biology  2005;3(6):e170.
Since the divergence of humans and chimpanzees about 5 million years ago, these species have undergone a remarkable evolution with drastic divergence in anatomy and cognitive abilities. At the molecular level, despite the small overall magnitude of DNA sequence divergence, we might expect such evolutionary changes to leave a noticeable signature throughout the genome. We here compare 13,731 annotated genes from humans to their chimpanzee orthologs to identify genes that show evidence of positive selection. Many of the genes that present a signature of positive selection tend to be involved in sensory perception or immune defenses. However, the group of genes that show the strongest evidence for positive selection also includes a surprising number of genes involved in tumor suppression and apoptosis, and of genes involved in spermatogenesis. We hypothesize that positive selection in some of these genes may be driven by genomic conflict due to apoptosis during spermatogenesis. Genes with maximal expression in the brain show little or no evidence for positive selection, while genes with maximal expression in the testis tend to be enriched with positively selected genes. Genes on the X chromosome also tend to show an elevated tendency for positive selection. We also present polymorphism data from 20 Caucasian Americans and 19 African Americans for the 50 annotated genes showing the strongest evidence for positive selection. The polymorphism analysis further supports the presence of positive selection in these genes by showing an excess of high-frequency derived nonsynonymous mutations.
Humans and chimps diverged about 5 million years ago. This study seeks to find the genes that have undergone positive selection during the evolution of both lineages since that time.
doi:10.1371/journal.pbio.0030170
PMCID: PMC1088278  PMID: 15869325
17.  Is Transcription Factor Binding Site Turnover a Sufficient Explanation for Cis-Regulatory Sequence Divergence? 
The molecular evolution of cis-regulatory sequences is not well understood. Comparisons of closely related species show that cis-regulatory sequences contain a large number of sites constrained by purifying selection. In contrast, there are a number of examples from distantly related species where cis-regulatory sequences retain little to no sequence similarity but drive similar patterns of gene expression. Binding site turnover, whereby the gain of a redundant binding site enables loss of a previously functional site, is one model by which cis-regulatory sequences can diverge without a concurrent change in function. To determine whether cis-regulatory sequence divergence is consistent with binding site turnover, we examined binding site evolution within orthologous intergenic sequences from 14 yeast species defined by their syntenic relationships with adjacent coding sequences. Both local and global alignments show that nearly all distantly related orthologous cis-regulatory sequences have no significant level of sequence similarity but are enriched for experimentally identified binding sites. Yet, a significant proportion of experimentally identified binding sites that are conserved in closely related species are absent in distantly related species and so cannot be explained by binding site turnover. Depletion of binding sites depends on the transcription factor but is detectable for a quarter of all transcription factors examined. Our results imply that binding site turnover is not a sufficient explanation for cis-regulatory sequence evolution.
doi:10.1093/gbe/evq066
PMCID: PMC2997565  PMID: 21068212
evolution; regulation; yeast
18.  On Theoretical Models of Gene Expression Evolution with Random Genetic Drift and Natural Selection 
PLoS ONE  2009;4(11):e7943.
Background
The relative contributions of natural selection and random genetic drift are a major source of debate in the study of gene expression evolution, which is hypothesized to serve as a bridge from molecular to phenotypic evolution. It has been suggested that the conflict between views is caused by the lack of a definite model of the neutral hypothesis, which can describe the long-run behavior of evolutionary change in mRNA abundance. Therefore previous studies have used inadequate analogies with the neutral prediction of other phenomena, such as amino acid or nucleotide sequence evolution, as the null hypothesis of their statistical inference.
Methodology/Principal Findings
In this study, we introduced two novel theoretical models, one based on neutral drift and the other assuming natural selection, by focusing on a common property of the distribution of mRNA abundance among a variety of eukaryotic cells, which reflects the result of long-term evolution. Our results demonstrated that (1) our models can reproduce two independently found phenomena simultaneously: the time development of gene expression divergence and Zipf's law of the transcriptome; (2) cytological constraints can be explicitly formulated to describe long-term evolution; (3) the model assuming that natural selection optimized relative mRNA abundance was more consistent with previously published observations than the model of optimized absolute mRNA abundances.
Conclusions/Significance
The models introduced in this study give a formulation of evolutionary change in the mRNA abundance of each gene as a stochastic process, on the basis of previously published observations. This model provides a foundation for interpreting observed data in studies of gene expression evolution, including identifying an adequate time scale for discriminating the effect of natural selection from that of random genetic drift of selectively neutral variations.
doi:10.1371/journal.pone.0007943
PMCID: PMC2776274  PMID: 19936214
19.  Extensive parallelism in protein evolution 
Biology Direct  2007;2:20.
Background
Independently evolving lineages mostly accumulate different changes, which leads to their gradual divergence. However, parallel accumulation of identical changes is also common, especially in traits with only a small number of possible states.
Results
We characterize parallelism in evolution of coding sequences in three four-species sets of genomes of mammals, Drosophila, and yeasts. Each such set contains two independent evolutionary paths, which we call paths I and II. An amino acid replacement which occurred along path I also occurs along path II with the probability 50–80% of that expected under selective neutrality. Thus, the per site rate of parallel evolution of proteins is several times higher than their average rate of evolution, but still lower than the rate of evolution of neutral sequences. This deficit may be caused by changes in the fitness landscape, leading to a replacement being possible along path I but not along path II. However, constant, weak selection assumed by the nearly neutral model of evolution appears to be a more likely explanation. Then, the average coefficient of selection associated with an amino acid replacement, in the units of the effective population size, must exceed ~0.4, and the fraction of effectively neutral replacements must be below ~30%. At a majority of evolvable amino acid sites, only a relatively small number of different amino acids is permitted.
Conclusion
High, but below-neutral, rates of parallel amino acid replacements suggest that a majority of amino acid replacements that occur in evolution are subject to weak, but non-trivial, selection, as predicted by Ohta's nearly-neutral theory.
Reviewers
This article was reviewed by John McDonald (nominated by Laura Landweber), Sarah Teichmann and Subhajyoti De, and Chris Adami.
doi:10.1186/1745-6150-2-20
PMCID: PMC2020468  PMID: 17705846
20.  Ontogeny and phylogeny: molecular signatures of selection, constraint, and temporal pleiotropy in the development of Drosophila 
BMC Biology  2009;7:42.
Background
Karl Ernst Von Baer noted that species tend to show greater morphological divergence in later stages of development when compared to earlier stages. Darwin originally interpreted these observations via a selectionist framework, suggesting that divergence should be greatest during ontogenic stages in which organisms experienced varying 'conditions of existence' and opportunity for differential selection. Modern hypotheses have focused on the notion that genes and structures involved in early development will be under stronger purifying selection due to the deleterious pleiotropic effects of mutations propagating over the course of ontogeny, also known as the developmental constraint hypothesis.
Results
Using developmental stage-specific expressed sequence tag (EST) libraries, we tested the 2 hypotheses by comparing the rates of evolution of 7,180 genes obtained from 6 species of the Drosophila melanogaster group with respect to ontogeny, and sex and reproduction-related functions in gonadal tissues. Supporting morphological observations, we found evidence of a pattern of increasing mean evolutionary rate in genes that are expressed in subsequent stages of development. Furthermore, supporting expectations that early expressed genes are constrained in divergence, we found that embryo stage genes are involved in a higher mean number of interactions as compared to later stages. We noted that the accelerated divergence of genes in the adult stage is explained by those expressed specifically in the male gonads, whose divergence is driven by positive selection. In addition, accelerated gonadal gene divergence occurs only in the adult stage, suggesting that the effects of selection are observed primarily at the stages during which they are expected occur. Finally, we also found a significant correlation between temporal specificity of gene expression and evolutionary rate, supporting expectations that genes with ubiquitous expression are under stronger constraint.
Conclusion
Taken together, these results support both the developmental constraint hypothesis limiting the divergence of early expressed developmentally important genes, leading to a gradient of divergence rates over ontogeny (embryonic < larval/pupal < adult), as well as Darwin's 'selection opportunity' hypothesis leading to increased divergence in adults, particularly in the case of reproductive tissues. We suggest that a constraint early/opportunity late model best explains divergence over ontogeny.
doi:10.1186/1741-7007-7-42
PMCID: PMC2722573  PMID: 19622136
21.  Adaptive Gene Expression Divergence Inferred from Population Genomics 
PLoS Genetics  2007;3(10):e187.
Detailed studies of individual genes have shown that gene expression divergence often results from adaptive evolution of regulatory sequence. Genome-wide analyses, however, have yet to unite patterns of gene expression with polymorphism and divergence to infer population genetic mechanisms underlying expression evolution. Here, we combined genomic expression data—analyzed in a phylogenetic context—with whole genome light-shotgun sequence data from six Drosophila simulans lines and reference sequences from D. melanogaster and D. yakuba. These data allowed us to use molecular population genetics to test for neutral versus adaptive gene expression divergence on a genomic scale. We identified recent and recurrent adaptive evolution along the D. simulans lineage by contrasting sequence polymorphism within D. simulans to divergence from D. melanogaster and D. yakuba. Genes that evolved higher levels of expression in D. simulans have experienced adaptive evolution of the associated 3′ flanking and amino acid sequence. Concomitantly, these genes are also decelerating in their rates of protein evolution, which is in agreement with the finding that highly expressed genes evolve slowly. Interestingly, adaptive evolution in 5′ cis-regulatory regions did not correspond strongly with expression evolution. Our results provide a genomic view of the intimate link between selection acting on a phenotype and associated genic evolution.
Author Summary
Changes in patterns of gene expression likely contribute greatly to phenotypic differences among closely related organisms. However, the evolutionary mechanisms, such as Darwinian selection and random genetic drift, which are underlying differences in patterns of expression, are only now being understood on a genomic level. We combine measurements of gene expression and whole-genome sequence data to investigate the relationship between the forces driving sequence evolution and expression divergence among closely related fruit flies. We find that Darwinian selection acting on regions that may control gene expression is associated with increases in gene expression levels. Investigation of the functional consequences of adaptive evolution on regulating gene expression is clearly warranted. The genetic tools available in Drosophila make functional experiments possible and will shed light on how closely related species have responded to reproductive, pathogenic, and environmental pressures.
doi:10.1371/journal.pgen.0030187
PMCID: PMC2042001  PMID: 17967066
22.  Functional protein divergence in the evolution of Homo sapiens 
Genome Biology  2008;9(2):R33.
Quantification of the divergence of proteins by functional category shows that morphological changes in metazoa have been driven by variation in regulatory genes.
Background
Protein-coding regions in a genome evolve by sequence divergence and gene gain and loss, altering the gene content of the organism. However, it is not well understood how this has given rise to the enormous diversity of metazoa present today.
Results
To obtain a global view of human genomic evolution, we quantify the divergence of proteins by functional category at different evolutionary distances from human.
Conclusion
This analysis highlights some general systems-level characteristics of human evolution: regulatory processes, such as signal transducers, transcription factors and receptors, have a high degree of plasticity, while core processes, such as metabolism, transport and protein synthesis, are largely conserved. Additionally, this study reveals a dynamic picture of selective forces at short, medium and long evolutionary timescales. Certain functional categories, such as 'development' and 'organogenesis', exhibit temporal patterns of sequence divergence in eukaryotes relative to human. This framework for a grammar of human evolution supports previously postulated theories of robustness and evolvability.
doi:10.1186/gb-2008-9-2-r33
PMCID: PMC2374701  PMID: 18279504
23.  Effects of X-Linkage and Sex-Biased Gene Expression on the Rate of Adaptive Protein Evolution in Drosophila 
Molecular Biology and Evolution  2008;25(8):1639-1650.
Patterns of polymorphism and divergence in Drosophila protein-coding genes suggest that a considerable fraction of amino acid differences between species can be attributed to positive selection and that genes with sex-biased expression, that is, those expressed predominantly in one sex, have especially high rates of adaptive evolution. Previous studies, however, have been restricted to autosomal sex-biased genes and, thus, do not provide a complete picture of the evolutionary forces acting on sex-biased genes across the genome. To determine the effects of X-linkage on sex-biased gene evolution, we surveyed DNA sequence polymorphism and divergence in 45 X-linked genes, including 17 with male-biased expression, 13 with female-biased expression, and 15 with equal expression in the 2 sexes. Using both single- and multilocus tests for selection, we found evidence for adaptive evolution in both groups of sex-biased genes. The signal of adaptive evolution was particularly strong for X-linked male-biased genes. A comparison with data from 91 autosomal genes revealed a “fast-X” effect, in which the rate of adaptive evolution was greater for X-linked than for autosomal genes. This effect was strongest for male-biased genes but could be seen in the other groups as well. A genome-wide analysis of coding sequence divergence that accounted for sex-biased expression also uncovered a fast-X effect for male-biased and unbiased genes, suggesting that recessive beneficial mutations play an important role in adaptation.
doi:10.1093/molbev/msn111
PMCID: PMC2727381  PMID: 18477586
positive selection; gene expression; sex linkage; fast-X effect; Drosophila
24.  Comparable contributions of structural-functional constraints and expression level to the rate of protein sequence evolution 
Biology Direct  2008;3:40.
Background
Proteins show a broad range of evolutionary rates. Understanding the factors that are responsible for the characteristic rate of evolution of a given protein arguably is one of the major goals of evolutionary biology. A long-standing general assumption used to be that the evolution rate is, primarily, determined by the specific functional constraints that affect the given protein. These constrains were traditionally thought to depend both on the specific features of the protein's structure and its biological role. The advent of systems biology brought about new types of data, such as expression level and protein-protein interactions, and unexpectedly, a variety of correlations between protein evolution rate and these variables have been observed. The strongest connections by far were repeatedly seen between protein sequence evolution rate and the expression level of the respective gene. It has been hypothesized that this link is due to the selection for the robustness of the protein structure to mistranslation-induced misfolding that is particularly important for highly expressed proteins and is the dominant determinant of the sequence evolution rate.
Results
This work is an attempt to assess the relative contributions of protein domain structure and function, on the one hand, and expression level on the other hand, to the rate of sequence evolution. To this end, we performed a genome-wide analysis of the effect of the fusion of a pair of domains in multidomain proteins on the difference in the domain-specific evolutionary rates. The mistranslation-induced misfolding hypothesis would predict that, within multidomain proteins, fused domains, on average, should evolve at substantially closer rates than the same domains in different proteins because, within a mutlidomain protein, all domains are translated at the same rate. We performed a comprehensive comparison of the evolutionary rates of mammalian and plant protein domains that are either joined in multidomain proteins or contained in distinct proteins. Substantial homogenization of evolutionary rates in multidomain proteins was, indeed, observed in both animals and plants, although highly significant differences between domain-specific rates remained. The contributions of the translation rate, as determined by the effect of the fusion of a pair of domains within a multidomain protein, and intrinsic, domain-specific structural-functional constraints appear to be comparable in magnitude.
Conclusion
Fusion of domains in a multidomain protein results in substantial homogenization of the domain-specific evolutionary rates but significant differences between domain-specific evolution rates remain. Thus, the rate of translation and intrinsic structural-functional constraints both exert sizable and comparable effects on sequence evolution.
Reviewers
This article was reviewed by Sergei Maslov, Dennis Vitkup, Claus Wilke (nominated by Orly Alter), and Allan Drummond (nominated by Joel Bader). For the full reviews, please go to the Reviewers' Reports section.
doi:10.1186/1745-6150-3-40
PMCID: PMC2572155  PMID: 18840284
25.  Many genes in fish have species-specific asymmetric rates of molecular evolution 
BMC Genomics  2006;7:20.
Background
Gene and genome duplication events increase the amount of genetic material that might then contribute to an increase in the genomic and phenotypic complexity of organisms during evolution. Thus, it has been argued that there is a relationship between gene copy number and morphological complexity and/or species diversity. This hypothesis implies that duplicated genes have subdivided or evolved novel functions compared to their pre-duplication proto-orthologs. Such a functional divergence might be caused by an increase in evolutionary rates in one ortholog, by changes in expression, regulatory evolution, insertion of repetitive elements, or due to positive Darwinian selection in one copy. We studied a set of 2466 genes that were present in Danio rerio, Takifugu rubripes, Tetraodon nigroviridis and Oryzias latipes to test (i) for forces of positive Darwinian selection; (ii) how frequently duplicated genes are retained, and (iii) whether novel gene functions might have evolved.
Results
25% (610) of all investigated genes show significantly smaller or higher genetic distances in the genomes of particular fish species compared to their human ortholog than their orthologs in other fish according to relative rate tests. We identified 49 new paralogous pairs of duplicated genes in fish, in which one of the paralogs is under positive Darwinian selection and shows a significantly higher rate of molecular evolution in one of the four fish species, whereas the other copy apparently did not undergo adaptive changes since it retained the original rate of evolution. Among the genes under positive Darwinian selection, we found a surprisingly high number of ATP binding proteins and transcription factors.
Conclusion
The significant rate difference suggests that the function of these rate-changed genes might be essential for the respective fish species. We demonstrate that the measurement of positive selection is a powerful tool to identify divergence rates of duplicated genes and that this method has the capacity to identify potentially interesting candidates for adaptive gene evolution.
doi:10.1186/1471-2164-7-20
PMCID: PMC1413527  PMID: 16466575

Results 1-25 (741327)