There is a great deal of interest in mammalian communication by pheromones, and much attention has been focused on rodents. This has been especially true of the house mouse, Mus musculus, where at least three large gene families encoding proteinaceous pheromones have been described: androgen-binding proteins (ABPs), exocrine gland–secreting peptides (ESPs), and major urinary proteins (MUPs).
ABPs have been shown to mediate assortative mate selection, based on subspecies recognition that potentially limits gene exchange between subspecies where they meet (
Laukaitis et al. 1997;
Talley et al. 2001). There is evidence that ABP-mediated mate preference across a transect of the European mouse hybrid zone is a case of reproductive character displacement as predicted by reinforcement (
Bimova et al. 2005).
ESPs constitute a newly described family of mouse proteinaceous pheromones (
Kimoto et al. 2005). Female mice respond to direct facial exposure to an ESP expressed in male exorbital lacrimal glands and released into tear fluid by upregulating
c-Fos and
egr1 gene expression in vomeronasal sensory neurons (
Kimoto et al. 2007). The same response occurs after close contact with the face or bedding of male mice, and a recombinant ESP protein stimulates electrical activity in an isolated female vomeronasal organ. The male response to similar signals is unremarkable (
Kimoto et al. 2005,
2007).
The MUPs are a family of lipocalins shown to mediate female recognition of potential mates (for a review, see
Hurst [2009]). For the most part,
MUP genes are expressed in liver and the products are passed through the kidneys into the urine. Each adult mouse expresses a pattern of 8–14 different MUP isoforms in its urine, which is determined by its genotype and by its sex because some
MUP genes show sex-limited expression (
Hurst 2009). This individual recognition profile has been likened to a protein “bar code” (
Robertson et al. 1996;
Beynon and Hurst 2003;
Armstrong et al. 2005;
Cheetham et al. 2007;
Logan et al. 2008). MUPs have also been implicated in male–male aggression.
Chamero et al. (2007) isolated high molecular weight components of male urine that activated dissociated vomeronasal neurons and were sufficient to cause male–male aggressive behavior when painted onto previously castrated males.
The major histocompatibility complex (MHC) has also been implicated in a mechanism that reduces inbreeding by mediating assortative mate selection based on MHC genotype. It has been suggested that nine-amino acid peptide ligands bind specifically to different MHC proteins and that these ligands are released when the MHC proteins break down (for a review, see
Hurst [2009]). The released ligands are filtered through the kidneys and secreted into the urine where they act as pheromones communicating information about the MHC genotype. Some researchers have pointed to difficulties with this model (
Hurst 2009), and to date, the genetic basis for the putative peptide pheromones is unknown. Thus, we cannot include the MHC pheromone system in the genomic comparison reported here.
All three families of proteinaceous pheromone genes,
Abps,
ESPs, and
MUPs, are encoded by large gene clusters in different regions of the mouse genome. The structure of the
Abp linkage group is complex in part because the ABP protein is a dimer composed of an alpha subunit disulfide bridged to a beta–gamma subunit, and so two different subunit genes are required to make the subunits for a functional dimer. The
Abp gene region is comprised of 64 paralogs mapping in a cluster on chromosome 7 (
Laukaitis et al. 2008). Of these, 30 are
Abpa genes, which encode ABP alpha subunits, and 34 are
Abpbg genes, which encode the beta–gamma subunits. The majority of
Abp genes occur in 27 alpha/beta–gamma pairs in a 5′-5’ orientation and numbered 1–27 (
Laukaitis et al. 2008). We designate these pairs <
Abpa-Abpbg> or <
Abpbg-Abpa> modules, where the arrowheads point in the 3’ direction.
A group of at least 38
ESP genes, ten of which are putative pseudogenes, map in a cluster on chromosome 17 (
Kimoto et al. 2007). The expressed genes encode the monomeric ESP pheromones. The gene family encoding the monomeric MUPs consists of at least 40–42 genes (there are still gaps in this region of the mouse genome), half of which are pseudogenes. All of these map in a cluster on chromosome 4 (
Logan et al. 2008;
Mudge et al. 2008). As in the case of the
Abp region, this cluster of genes is relatively complex, in that the center of the linkage group consists of at least 15 gene pairs, each containing a gene and a pseudogene in 5′-5’ orientation (
Logan et al. 2008).
Each of these gene families,
Abp, ESP, and
MUP, has expanded dramatically during house mouse evolutionary history (
Kimoto et al. 2007;
Laukaitis et al. 2008;
Logan et al. 2008), and two questions that fascinate us are: What mechanism(s) is responsible for the rapid expansions of these families of pheromone genes? Has the same mechanism caused the expansion of all three gene families?
We suspect that the process responsible for this gene family expansion may have resulted in copy number variation (CNV) through recent gene birth (duplication) and death (deletion). It is also possible that some events in
Abp evolutionary history, and potentially in the evolutionary histories of the other two families, have been obscured by gene conversion. In undertaking this study, it was our objective to determine the mechanism(s) by which duplication has produced the complex family of
Abp genes (
Emes et al. 2004 and
Laukaitis et al. 2008). We also compared the
ESP and
MUP gene regions studied by others for evidence of the same mechanisms operating in the
Abp region. We evaluated the three pheromone gene regions for signs of duplication as low copy repeats (LCRs;
Lupski 1998;
Stankiewicz and Lupski 2002) that have duplicated by nonallelic homologous recombination (NAHR;
Shaffer and Lupski 2000;
Stankiewicz and Lupski 2002), and we evaluated volatility in all three regions, using the Mouse Paralogy Browser (
She et al. 2008). One of them, the
Abp region, shows substantial variation among a number of mouse strains. We especially noted an extensive history of these differences in wild-derived
M. m. musculus strains, in classical inbred strains with part or all of an
M. m. musculus Abp region, and in one wild-derived
M. m. castaneus strain. No particularly remarkable volatility was observed in the other two gene families, and we discuss the significance of this in light of the various roles proposed for the three families of mouse proteinaceous pheromones.