The Rosaceae is one of the most economically important and morphologically diverse plant families with over 90 genera containing more than 3000 species. The family contains three sub-families; the Dryadoideae, the Rosoideae and the Spireaeoideae, with the economically-important genera
Prunus and
Malus contained within the Spireaeoideae, whilst
Fragaria is a member of the Rosoideae [
1]. The base chromosome number of the many genera within the family ranges from
x = 7 to
x = 17, and recent research has suggested that the ancestral chromosome number for Rosaceae may have been
x = 9 [
2,
3]. As in many other plant families, comparative genomics will enhance our understanding of genome structure and function and the evolutionary forces that have led to the current chromosomal configurations of the numerous Rosaceous species, and in turn to the mechanisms responsible for the wealth of morphological diversity encompassed by the family. An understanding of the degree of conservation of genome structure and function between related genera will enable inferences to be made about the genomic positions of genes controlling common traits among genera and permit information gained in one species to inform investigations in another.
The recent availability of whole genome sequences has permitted the delineation of syntenic blocks at high resolution and from this the evolutionary history in plant lineages can be inferred. In the grasses, paleogenomic modeling, using sequences of the maize, rice, and sorghum genomes as well as large sets of genetically mapped genes in wheat and barley, led to the proposal of an ancestral grass karyotype for the five ancestral chromosomes [
4,
5] from which all modern grass genomes evolved. The recent sequencing of the
Brachypodium genome [
6] revealed a whole-genome paleo-duplication in
Brachypodium chromosomes, whilst comparisons of the
Brachypodium, rice and sorghum genome sequences revealed orthologous relationships that were consistent with the evolution of the extant
Brachypodium genome from an ancestral genome containing five chromosomes.
Similarly, in the dicots, whole genome sequencing has revealed patterns of genome evolution that it had not been possible to detect using comparative mapping of orthologous markers. The sequencing of the grapevine genome [
7] and its comparison to the genomes of
Arabidopsis and poplar permitted the identification of a paleo-hexaploidisation event in the common lineage of the three species which occurred after the monocotyledonous and dicotyledenous plant lineages diverged. This hexaploidisation event had not previously been identified, despite the whole genome sequences of
Arabidopsis and poplar being available for some time [
8,
9]. This was primarily due to the subsequent polyploidisation events that had occurred in the genomes of these species (once in the case of poplar, and twice in the case of
Arabidopsis) since they diverged from a common ancestor. Thus, analyses based on higher levels of resolution, particularly those based on whole genome sequence data, reveal evermore complex patterns of genome evolution between species, but at the same time provide compelling evidence to support models of genome evolution and deduced ancestral chromosomal configurations.
So far no studies have been performed that have compared whole genome sequences of plant species that belong to different genera of the same family. In Rosaceae, as well as in other economically important plant families including Poaceae, Solanaceae, Brassicaceae and Fabaceae [
10-
14], the comparative genomics studies have been performed using conserved genetic markers. Dirlewanger et al [
15] first identified high levels of conservation of marker presence and order between three of the eight linkage groups of the
Prunus reference map [
16], and seven of the 17 linkage groups of the apple map [
17], demonstrating that markers mapping to a single
Prunus linkage group were located on two homeologous linkage groups on the
Malus linkage map and that large conserved syntenic blocks were clearly identifiable within the two genera. A number of other studies were also performed using PCR-based markers that had been developed from both
Malus and
Fragaria, which were applied to comparative mapping between
Prunus and these other members of the Rosaceae [
18,
19]. High level of co-linearity within the sub-family Maloideae between the genomes of
Malus and
Pyrus has also shown by comparative mapping using simple sequence repeat (SSR) markers [
20]. Vilanova et al [
2] reported a genome-wide inter-generic comparison of genetically mapped orthologous markers between diploid
Fragaria and
Prunus showing sufficiently well conserved macro-synteny to enable the reconstruction of a hypothetical ancestral genome for Rosaceae containing nine chromosomes. The study however also revealed a number of large-scale chromosomal rearrangements, including translocations of large syntenic blocks and numerous fusion-fission events that had occurred in the evolutionary history of the two genera. More recently, using the whole genome sequence from the apple cultivar 'Golden Delicious' [
21] and sequence data from 1,473 markers mapped in
Prunus and
Fragaria, including Rosaceous conserved orthologous sequences (RosCOS) [
22], Illa et al [
3] performed a genome-wide comparison between all three genera. Analyses based on the positions of the 129 markers revealed clear, conserved, syntenic blocks that were common to all three genomes, with a single syntenic block in
Prunus corresponding to one or two syntenic regions in
Fragaria, and two or four syntenic regions in apple. Illa et al [
3] reconstructed a hypothetical ancestral genome for the Rosaceae containing nine chromosomes (
x = 9), consistent with the report of Vilanova et al [
2]. The data suggested that the resolution of studies based on modest numbers of markers was perhaps not sufficient to elucidate the true number of small scale genomic inversions that have taken place in genome evolution within the Rosaceae, which may have played an important role in speciation within the family. Thus, an evaluation of the conservation of synteny between
Fragaria,
Malus and
Prunus based on whole genome sequence data may reveal much about sequence evolution in this closely-related, yet morphologically diverse family that has been hitherto undetected.
The genomes of three Rosaceous genera of significant economic importance,
Fragaria [
23],
Malus [
21] and
Prunus [
24] have recently been sequenced, presenting an exciting opportunity for high-resolution genome comparison. Here we report results from comparison of whole genome sequences of the three species of Rosaceae and the genome of
Vitis vinifera, included as an outgroup species representing a basal rosid genome. We were able to identify the orthologous regions among the three Rosaceous species at a much higher-resolution than has previously been reported. This higher-resolution enabled us to detect different patterns of genome evolution between the sub-families of Rosaceae. Furthermore, we reconstructed a hypothetical Rosaceae ancestral genome using the Multiple Genome Rearrangements and Ancestors (MGRA) algorithm and further manual analyses.