Description of Data
From BLAT searches on archived genomes, we recovered six Nav channel genes from
Xenopus, nine from lizard, six from platypus (plus two smaller fragments that were not used for analysis), eight from opossum, and nine from chicken. Where possible, predicted
X. tropicalis gene sequences were confirmed by ESTs, and in two cases (
xt236 and
xt464b), incomplete genomic sequences were filled in by overlap with ESTs (
supplementary fig. 1,
Supplementary Material online).
We derived pieces of Nav channel genes from elephant shark. Due to the low coverage, each contig had one or at most a few exons, and contigs could not be unambiguously connected. However, we identified 3–4 distinct pieces corresponding to most exons of the Nav channel gene that suggested a total of four Nav channel genes. We recovered four contigs with all or part of the most 3′ exon (the longest exon in all vertebrate Nav channel genes); one contig included two other exons. This gave us sequences of from ~200 to 500 amino acids.
By RT-PCR, we cloned three Nav channel genes from lungfish, one each from brain and spinal cord, muscle, and heart tissue; two from horn shark, one from muscle and one from heart; and one from skate brain. These were not complete sequences but covered domains II–III (lungfish muscle and skate brain) or domains I–III (shark and lungfish heart, lungfish brain and lungfish muscle). Given the limitations of our tissue samples and the RT-PCR method, we do not claim that this provides the full complement of Nav channel genes of these species.
As we show below, we have a strong case for homologizing the amniote Nav channel genes; therefore, we use the mammalian gene nomenclature (SCNxA, where x = a number) for Nav channel genes from mammals, birds, and lizards (protein designation = Nav; gene designation = SCN). But because not all the nonamniote genes are orthologs of amniote genes, we did not use the mammalian nomenclature for these.
Expansion of Nav Channel Genes Occurred in the Devonian and Carboniferous Periods
We generated phylogenetic trees without (alignment1: ) and with (alignment2:
supplementary fig. 2,
Supplementary Material online) the four elephant shark sequences. Resulting tree topologies of alignment1 were identical for the two runs with minor variation in the posterior probabilities for a few branches. Alignment2 was run only once due to the low support values in the parts of the tree including the elephant shark sequences (
supplementary fig. 2,
Supplementary Material online). We will focus on alignment1.
SCN8A has a simple history with no duplications tracing back to an ancestral gene that is also represented in elasmobranchs (
skatebrain), lungfish (
lungfishbrain), and amphibians (
xt67) (: shaded in light blue). There are two duplicates in zebrafish due to a teleost-specific WGD. As in humans,
SCN8A orthologs of frog, chicken, and lizard reside alone on a chromosome (
supplementary table I,
Supplementary Material online).
SCN4A shows a similar history with amniote orthologs grouping with genes from frog (
xt43), lungfish (
lungfishmuscle), and shark (
sharkmuscle) and a pair of duplicate genes in teleosts (: shaded in yellow). The orthologs of
SCN4A also reside singly on a chromosome (
supplementary table I,
Supplementary Material online).
Orthologs of the three Nav channel genes on human chromosome 3 (SCN5A, SCN10A, and SCN11A) are found in other mammals, chicken, and lizard, and these three genes have shared synteny (: shaded in light red). They derive from a single gene represented in our sample by orthologs in shark (sharkheart), zebrafish (SCN5Laa and SCN5Lab), lungfish (lungfishheart), and frog (xt28). The single frog gene is syntenically related to the amniote genes (). The presence of a single gene at the amphibian–amniote split and of three genes before the synapsid (mammals)–diapsid (reptiles and birds) divergence means that two duplications of the ancestral gene occurred in a 30-My window at the origin of amniotes in the lower to mid-Carboniferous periods.
The history of the Nav channel genes on human chromosome 2 is more complex. These genes derive from a fourth ancestral gene (
Novak et al. 2006), although we were unable to detect an ortholog of this gene in lungfish tissues or skate brain despite extensive attempts to amplify it from RNA and genomic DNA. As expected, there are two gene duplicates in zebrafish (
SCN1Laa and
SCN1Lab). The gene tree suggests that a single ancestral gene underwent independent duplications in amphibians and amniotes (: shaded in green). In this scenario, multiple duplications of a putative ancestral Nav channel gene would have generated
SCN1A,
SCN2A,
SCN3A, and
SCN7A/SCN9A (the precursor to separate
SCNA7a and
SCN9A genes). Because these all have orthologs in mammals, lizards, and chicken, these duplications would have occurred within the same 30-My window as the triplicated genes on human chromosome 3 (). The final duplication of
SCN7A/
SCN9A into
SCN7A and
SCN9A likely occurred after the divergence of monotreme and therian mammals (220 Ma) preceding the marsupial–placental split (175 Ma). However, given the low values of the posterior probabilities in our trees, there is some uncertainty about the timing of this duplication.
On the other hand, the most parsimonious interpretation of the synteny () is that two duplications had already occurred in the common ancestor of amphibians and amniotes. This is because the Nav channel and other genes in this region of the amphibian and amniote chromosomes have the same syntenic relationships. Additionally, one Nav channel gene in each lineage and in the same relative chromosomal position (amphibian xt464b and amniote SCN2A) is oppositely oriented on the chromosome to all the other Nav channel genes (arrows, ). This suggests that most of the duplications occurred at a slower rate over 130 My.
What might account for the discrepancy in interpretation between the gene tree and synteny data? It is unlikely to be due to
Xenopus Nav channel genes that were missed or misassembled in genome sequencing. First, all the ESTs that we uncovered uniquely matched a specific gene (in most cases multiple ESTs were mapped to each gene), and all
Xenopus Nav channel genes were represented in the EST database (
supplementary fig. 1,
Supplementary Material online). Second, the few additional amphibian Nav channel genes available from GenBank of sufficient length to align (e.g., newt and xlaev1.2) appeared to be orthologs of genes that we had already uncovered in
X. tropicalis. Finally, the
Xenopus scaffolds were assembled from overlapping reads of shotgun sequence de novo so that the apparent synteny is not an artifact (
Hellsten et al. 2010). It is possible, but seems unlikely, that independent duplications in amphibians and amniotes could have resulted in identical patterns of synteny. If the duplications had occurred in the common ancestor of amphibians and amniotes as suggested by synteny, then the nonoverlapping clustering of amphibian and amniote genes in the tree might be explained by some amount of gene conversion within the amniote and/or amphibian lineages, as sometimes occurs following gene duplications (
Kellis et al. 2004).
Alignment2 included the fragments from elephant shark (
supplementary fig. 2,
Supplementary Material online). As expected, the inclusion of these short sequences resulted in low posterior probabilities for them and their neighboring branches; these were too low to trust their exact positioning in the tree. However, each of the 4 sequences grouped with 1 of the 4 clades of Nav channel genes with extremely strong (posterior probabilities = 100) support. Furthermore, a BLAT search of GenBank with the nucleotide sequences of each of the four 3′ exons had top matches with sequences in one specific clade. The inclusion of these sequences supports our contention (
Lopreato et al. 2001;
Novak et al. 2006) that the ancestor of teleosts and tetrapods had four Nav channel genes and gives an indication of the “missing” ancestor of the fourth Nav channel gene clade.
In sum, we suggest that after the second WGD (estimated at ~550 Ma,
Meyer and Schartl 1999;
Dehal and Boore 2005;
Panopoulou and Poustka 2005;
Blomme et al. 2006) brought the number of Nav channel genes to four, there was a period of stasis. Then a series of tandem Nav channel gene duplications occurred in a 30- to 130-My period during early tetrapod evolution, following which the Nav channel gene family remained largely stable for another 300 My ().
Genes Flanking the Nav Channel Genes Did Not Duplicate or Were Not Retained
We posit that the retention of the Nav channel gene paralogs was due to selection. The null hypothesis is that the Nav channel gene expansion was simply a consequence of instability in the regions of these two chromosomes in which the Nav channel genes reside that led to duplication and retention of all the genes in this region. We tested the null hypothesis by examining whether flanking genes show a similar history of duplication and retention. We sampled only those flanking genes that were located on chromosomes or scaffolds of the species for which we had synteny information. In our sample ( and ) 14 of 15 genes showed no tandem duplications on these chromosomes, with SLC22A13 and SLC22A14 being a duplicate pair. Given that the Nav channel genes on these two chromosomes have both duplicated (2 out of 2: this analysis ignores the fact that the Nav channel genes underwent multiple duplications), this is significantly greater than expected given the number of duplications of the flanking non-Nav channel genes on these two chromosomes (1 out of 14) (P < 0.0001, two-tailed chi-square).
However, it is also possible that these flanking genes duplicated but were then dispersed throughout the genome, whereas the Nav channel genes were retained where they duplicated. We derived sequences for the 16 flanking genes in and from GenBank or searches from genome databases and constructed trees for them. We found that most showed indications of the initial two rounds of WGD but no duplications in the 450 My following the actinopterygian–sarcopterygian split (). The only exception, mentioned above, was the duplicate pair of SLC22A members13 and 14. This gene family has a history of extensive duplication (>25 members) with the “center of gravity” of duplication elsewhere in the genome.
We conclude that the genes flanking the Nav channel genes either did not duplicate or, if they did so, were not retained. This supports the hypothesis that the Nav channel gene duplications were retained as a result of selection.
Other Ion Channel Gene Families Did Not Duplicate or Were Not Retained
Voltage-dependent ion channels are fundamental to the electrical activity of the brain. We next asked whether there was a general expansion of other six transmembrane (6TM) voltage-dependent ion channel gene families during tetrapod evolution or whether this expansion was specific to the Nav channel gene family. We addressed this question using both published literature (
Saito and Shingai 2006;
Hoegg and Meyer 2007;
Jackson et al. 2007) and gene trees that we constructed with sequences from teleost, human, frog, and chicken databases. The channel that we investigated included the major depolarizing (Ca
2+, transient receptor potential [TRP], and hyperpolarization-activated cyclic nucleotide-gated ion channel [HCN]) and hyperpolarizing (Kv, ether-a-go-go related [ERG], and slo) channels. We sampled all 17 members of the voltage-dependent K
+ channel (Kv), 4 members of the HCN), 7 members of the canonical TRP, 6 members of the TRP, vanilloid sensitive (TRPV), 4 members of the TRP, melastatin (TRPM), 1 member of the TRP ankyrin repeat, 8 members of the ERG, 3 members of the large-conductance calcium-activated K
+ family (slo), and 10 members of the voltage-dependent calcium (
CACNA1 or Cav) gene families ().
Most (54 out of 60) of the ion channel genes in our sample showed no duplications following the teleost–tetrapod divergence. There is a mammalian-specific duplication in the slo gene family (
slo3). A number of TRP channels do not have teleost orthologs (TRPV1/TRPV2, TRPV3, TRPV5/TRPV6, TRPM6, and TRPM8). However, the absence of TRPM8 and TRPV3 in teleosts is likely due to losses in teleosts rather than duplications in amniotes (
Saito and Shingai 2006). TRPV5/TRPV6 are a pair that clearly duplicated in amniotes (
Saito and Shingai 2006). The timing of the TRPV1/TRPV2 duplication is not resolvable (
Saito and Shingai 2006), but it may represent a tetrapod-specific duplication. Even assuming that all apparent duplications in tetrapods are real rather than reflecting losses of teleost genes, Nav channel genes duplicated (2 out of 4: again, this analysis ignores repeated duplications of each ancestral paralog) significantly more than other ion channel genes (6 out of 60) (
P < 0.007, two-tailed chi-square).