Our findings show that ROH are frequent across all breeds and that certain ROH length categories can be used as an indication of consanguinity. They can also inform on breed population history as the effects of population bottlenecks, selection pressure and breeding management on the bovine genome may potentially leave an imprint on ROH length.
The bovine HD SNP assay allows an analysis of ROH at similar density to that employed to generate genomic signatures of endogamy that differ markedly among human populations
[
1-
3]. Moreover, cattle allow a first comprehensive testing of the inference drawn from ROH through comparison with estimates of inbreeding from calculations based on the detailed pedigree data available for many breeds. The strong correlation between the pedigree inbreeding coefficient and sum of ROH of length

>

0.5 kb suggests that, in the absence of an animal’s pedigree data, the extent of a genome under ROH may be used to infer aspects of recent population history even from relatively few samples, as previously suggested by McQuillan et al. (2008). However, 44% of the variance in ROH distribution remains unexplained by pedigree inbreeding and may partly reflect the limitations of ancestry recording in cattle where founder animals are generally, and often inaccurately, assumed to be unrelated (McParland et al., 2007). Additionally, the propensity for multiple megabase scale ancestral haplotypes in certain genome regions to persist even in outbred animals, perhaps due to localised low levels of recombination and high levels of LD may contribute
[
14]. Lastly, we note that pedigree relatedness gives an expected, not actual, proportion of genomic identity by descent among individuals and it might be anticipated that genotype-based estimates provide greater accuracy on relatedness
[
15].
Whereas HD SNP panels facilitate more accurate detection of ROH, the vast majority of cattle SNP genotype data, and emerging data in other livestock, is available at ~50,000 SNP density. It is therefore of interest as to whether these sparser genotypic data can reliably inform on ROH and inbreeding history. We found that HD ROH were not accurately identified in such a reduced panel if between 0.5-1 Mb in length. However, the SNP50 density genotypes were sufficient to recognise almost all ROH >5 Mb but they also has the potential to inflate ROH length. Importantly, ROH levels at the lower SNP density correlate equally well with the pedigree estimates of inbreeding. We conclude that this prevalent marker density is appropriate in identifying ROH.
We used three published
[
7-
9] SNP50 genotype collections to examine patterns in ROH distribution and compare aspects of population history among a range of cattle breeds. The domestication process itself featured a limited sampling from the wild with a more recent bottleneck detectable 50–100 generations ago, presumably corresponding to breed formation
[
16]. However, this traditional breed formation is largely a European phenomenon and its absence is most apparent in the data from African cattle. These samples, including
B. taurus breeds, humped
B. Indicus breeds and indicine/taurine hybrids, tended toward low levels of ROH per genome, reflecting traditional management practices in Africa, characterised by less controlled mating
[
17].
An open village breeding system may also predispose to random consanguineous matings and many African breeds show outlying highly inbred individuals (Figure
). The length distribution of ROH can help to distinguish different types of parental relatedness. Samples from human populations where cousin marriage is common show an excess of long ROH; whereas for example, Papuan and Melanesian human populations show an excess of shorter ROH, consistent with effects of reduced population and isolation rather than first degree relative unions (Kirin et al. 2010). Figure
compares the contributions of long (> 20 Mb) and short ROH to breed homozygosity in order to differentiate the effects of ancient and more recent relatedness among ancestors. Here, the three African taurine breeds (Oulmes zaer, Somba and Lagune) with higher homozygosity clearly show a strong influence of ROH of length greater than 20 Mb and hence of recent inbreeding. We note that Gautier et al. (2009) reported a high FIS value, as well as extensive linkage disequilibrium within the Lagune breed.
Zebu-taurus hybridisation is also a dynamic and contemporary process within Africa
[
18-
20]. This acts to increase genetic diversity and contributes to the interruption of stretches of homozygous genotypes within individuals. The effects of this process are evident in three hybrid breeds which show the lowest extent of ROH in African breeds. These include the Kuri; where previous work has shown a near 50:50 genetic admixture between surrounding zebu and themselves
[
21], and the Sheko breed, where the original taurine African
Y chromosome is in danger of disappearing from this breed due to the use of zebu bulls
[
20].
Within European breeds, British breeds tended toward higher quantities of ROH, reflecting results of previous microsatellite research, where British Isles breeds had lower levels of observed heterozygosities and gene diversities in comparison to other Mediterranean and Northern European breeds analysed
[
22]. The Channel Island breeds showed strong influence from long ROH reflecting their unusually closed population histories due to strict importation restrictions on both the Jersey and Guernsey Islands implemented during the 1800 s
[
23].
The zebu breeds represented in this study have contrasting histories. The mainland African zebu breeds (Bororo and Fulani) which are products of ancient introductions from South Asia and are all hybrids to some extent, had much lower quantities of average ROH in comparison to the American zebu breeds analysed (Figure
). Within the American and Madagascan zebu populations a stronger homozygosity signal, with a weighting toward smaller length ROH, suggests that these breeds were initially established by small founding populations but were not particularly affected by recent inbreeding (Figure
). The initial introduction of the now prolific zebu animals in the Americas featured very limited numbers during the 19
th and 20
th century and Madagascan zebu were founded by ancient importations from Asia and East Africa which were probably limited in scope due to the isolation of the island
[
20,
24].
The ascertainment bias
[
8] towards European
Bos taurus breeds that is associated with the Bovine SNP50 genotyping chip, does not seem to invalidate the trends in ROH levels observed here, as ROH levels were in fact higher in those breeds with a higher number of polymorphic SNPs, as validated by Illumina ( Additional file
7). Also, the existence of long ROH (>20 Mb) for example in many of the less polymorphic African village breeds (Oulmes zaer, Somba and Lagune) are unlikely to be artefactual due to the vanishingly small probability of long contiguous homozygous SNPs occurring by chance. However, some bias may exist in the
Bos Indicus ROH levels, as an over estimation of ROH amounts is possible due to low amount of polymorphic markers found in these breeds due to the design of the genotyping chip
[
8], as a result some caution must be taken when inferring ROH levels within these breeds. The bovine HD genotyping chip was designed from a more comprehensive range of breeds comprising several temperate and tropically adapted
Bos taurus, Bos indicus and hybrid breeds and thus does not exhibit the same level of ascertainment bias
[
25].
The Hapmap population data also allow comparison with an alternative inference of past population size. Linkage disequilibrium may be used to infer past population size where higher r
2 indicates lower effective population size with LD at longer genetic distances corresponding to younger time depths
[
26,
27]. Interestingly, the Hapmap breed samples analysed here show a strikingly similar ranking in LD at distances of >200 kb to that which they show in average ROH
[
28].
Analyses of human ROH have previously established a correlation between extensive LD, locally low rates of recombination and high incidence of homozygous runs
[
2]. Intensive selection intensity in cattle has possibly acted to maintain long lengths of homozygous tracts. Previous work carried out in over 500 animals from 8 breeds noted that high levels of LD, particularly in the Holstein breed, existed on chromosomes 14 and 16, the two chromosomes with highest proportions of ROH in our study
[
14]. Conversely, chromosome 12 was found to have higher than average recombination rates and lower levels of LD (r
2 <0.2) than the majority of chromosomes
[
14] and, interestingly, showed the highest proportion of SNPs uninvolved in a ROH within our sample population. The existence of recombination hotspots throughout the genome also can impact ROH, with multiple genomic regions that remained uninvolved in any ROH such as those on chromosomes 12 and 23 found to be well documented human and cattle recombination hotspots
[
29-
31].
The existence of QTL in ROH have been well documented in human studies
[
5,
6,
32]. Here, several of the highly involved genomic regions located on chromosomes 7, 14, 16 and 18 (Figure
) all potentially contain genes of importance in cattle with associations ranging from immunity through to carcass and dystocia related traits
[
33-
35] when explored using three QTL databases available online (
http://genomes.sapac.edu.au/bovineqtl/index.html, a
http://www.animalgenome.org/QTLdb/cattle.html,
http://www.ncbi.nlm.nih.gov/). In particular, Chromosomes 9 and 5, which had the highest amount of long ROHs (>20 Mb), are well documented to contain QTL pertaining to milk fat yield and weight related traits respectively
[
36-
39].