Comparative mapping is a powerful tool to transfer genomic information from sequenced genomes to closely related species for which whole genome sequence data are not yet available. Such an approach was initially demonstrated by Fujiyama
et al. [
1] for the construction of the human-chimpanzee comparative map. In these closely related primate species, approximately 98% of chimpanzee BAC end sequences (BES) had significant BLAST hits to the human genome sequence allowing putative orthologues to be identified [
1]. A similar approach was used for the construction of the human-mouse comparative map [
2]. Subsequently, this approach was extensively used in mammals including construction of the human-cattle, the human-horse, and the human-porcine comparative maps [
3-
5]. Most recently, this approach was utilized one step further for the construction of the comparative genome contig (CGC)-based physical map of the sheep genome [
6], where CGC is established based on anchorage of the sheep BES onto the genome sequences of dog, cow, and human. These successes depended on high percentage of BLAST hits and/or high levels of genome collinearity.
Five teleost fish genomes have been fully sequenced
http://www.ensembl.org/index.html including zebrafish (
Danio rerio, from the order of Cypriniformes), Japanese pufferfish (
Fugu rubripes, from the order of Tetraodontiformes), green spotted pufferfish (
Tetraodon nigroviridis, from the order of Tetraodontiformes), medaka (
Oryzias latipes, from the order of Beloniformes), and three-spined stickleback (
Gasterosteus aculeatus, from the order of Gasterosteiformes), while whole genome sequencing is also underway for tilapia
http://www.cichidgenome.org;
http://www.broad.mit.edu/science/projects/mammals-models/vertebrates-invertebrates/tilapia/tilapia-genome-sequencing-project. The availability of these whole genome sequences lends great opportunities for comparative genome analysis. Recently, major genomic resources have been developed from a number of fish species such as Atlantic salmon (
Salmo salar) [
7-
9], rainbow trout (
Oncorhynchus mykiss) [
10,
11], tilapia [
12,
13], gilthead sea bream (
Sparus auratus) [
14-
17], European sea bass (
Dicentrarchus labrax) [
18,
19], and channel catfish (
Ictalurus punctatus) (for a review, see [
20,
21]).
Catfish is the major aquaculture species in the United States. It is one of the six species included in the U.S. National Animal Genome Project NRSP-8. A number of genome resources have been developed in catfish including a large number of molecular markers [
22-
25], genetic linkage maps [
26-
28], several hundred thousands of ESTs ([
29-
33]; Z. Liu, unpublished data), microarray platforms [
34-
38], BAC libraries [
39,
40], and BAC-based physical maps [
41,
42]. To enable BAC end sequence-based comparative genome analysis, we previously reported generation of 20,366 BES in catfish [
25]. In spite of the great value of those BES for the characterization of genome repeat structures [
43] and for the identification of microsatellite markers, our previous comparative genome analysis using BES revealed very limited conservation between the catfish and zebrafish genomes. Of the 141 mate-paired BES with genes on both ends of the BAC inserts, only 34 (24.1%) were found in nearby genomic locations in the zebrafish genome, suggesting high levels of chromosomal rearrangements [
40]. Such findings were in strong contrast to the situations found between medaka-sea bream,
Tetraodon-sea bream, medaka-stickleback,
Tetraodon-medaka, stickleback-sea bream,
Tetraodon-stickleback genome comparisons where almost complete genome collinearities were found [
44]. We speculated that our earlier inability to discover large extent of genome collinearity between catfish and zebrafish could be a result of the low numbers of BES and the lack of a physical map. Therefore, in this study, we extended our efforts in BAC end sequencing and generated additional 43,021 BES, bringing the total to 63,387 (25,676 mate-paired). Using these catfish BES and its BAC contig-based physical map [
42], genetic linkage mapping of BAC end-anchored microsatellites, and the genome sequence of zebrafish, here we conducted extensive comparative genome analysis. We report the identification of conserved syntenies and demonstrate the construction of super scaffolds of contigs by genetic linkage mapping of BAC end-associated microsatellites.