This study presents an analysis of the extent of LD in Spanish Churra sheep using 43,784 SNPs distributed across the autosomal genome, although the draft stage of the version of the Ovine Assembly it is based on should be taken into account. Future refinements and updates in the physical maps can lead to changes in the estimations reported here. To enable comparison with previous studies in sheep and other domestic species, we estimated two pairwise statistics:
D’ and
r2.
D’ values were higher than those estimated for
r2. This might be because rare alleles and unobserved haplotypes tend to inflate
D’ but not
r2[
1].
Comparing the level of LD obtained in different studies is difficult because of different sample sizes, LD measures, marker types, marker densities and recent and historical population demographics [
4]. Previous reports in sheep based on microsatellite marker analysis have described LD as extensive (up to 20

cM) [
1,
19], although its magnitude and significance was shown to vary markedly between different breeds [
1]. The results reported for wild sheep [
21] also showed LD extended over long distances (half-length
r2 of 4.6

Mb), which contrasts with the short extension of LD reported here for Churra sheep (half-length
r2 value 2.5

Mb). A recent assessment of LD based on the analysis of 51,446 SNPs in a sample of Sarda rams showed a similar level of LD than in Churra sheep, with an average
r2 value over 1,000

kb of 0.072 [
40]. Compared with the results based on SNP genotyping described for cattle [
4,
14,
15,
41], LD estimates between syntenic and nonsyntenic loci in Churra sheep was two times lower. Initial results from the analysis of 74 domestic sheep breeds with the Illumina Ovine SNP50 BeadChip [
20] were in concordance with our findings, which suggests a relatively low level of LD in sheep and a substantially lower LD in sheep when compared with a wide range of cattle breeds, including dairy and beef cattle [
42]. This analysis also showed Churra sheep as one of the breeds with a more remarkable decay of LD with the distance between markers when compared with other breeds [
20]. Average
r2 between nonsyntenic SNP pairs provides an idea of the LD that can be expected by chance. None of the nonsyntenic SNP pairs tested showed a ‘high’ LD value (
r2
>

0.25).
Differences in LD between chromosomes have already been reported in Holstein cattle [
4,
15]. These can be attributed to recombination rates varying between and within chromosomes, heterozygosity, genetic drift and effects of selection [
4]. Our results for average LD within a chromosome are in concordance with the block structure across the genome. Chromosomes showing higher LD also have more and longer blocks than chromosomes with lower average LD. In Churra, 88% of the blocks contained just two SNPs. Preliminary results from the SheepHapMap project also identified an overall limited genome coverage in haplotype blocks (of at least three SNPs) for domestic breeds with Churra showing the lowest coverage (0.8%), whereas wild Soay sheep showed a large genome coverage (21.84%) [
20]. Compared with domestic sheep, the genomic distribution and coverage of the genome by haplotype blocks is higher in other species, such as cattle [
4,
43], as expected according to the higher LD between markers reported in these species.
Also within the framework of the SheepHapMap project genomic regions containing signals of selection have been identified across a wide range of sheep breeds [
44]. Higher homozygosity and LD is expected in regions that have undergone selection and are now fixed in the breed under study. Also, more and longer haplotype blocks are expected in those regions. Although there were haplotype blocks close to some of the regions related to selection, none of the high-Fst SNPs depicted by those authors [
44] were involved in blocks in Spanish Churra sheep. For example, the region of OAR10 containing 8 haploblocks in Churra sheep is close to the
polled locus (
RXFP2 gene), which is related to the presence of horns in sheep [
44]. However, none of the SNPs linked to the polled phenotype were included in the Churra haploblocks. The longest haplotype block across the genome found in this study, which involved 8 SNPs and was located in OAR2 (111.9 – 113.1 Mbp), comprises the
HERC2 gene, which has already been related to pigmentation in cattle [
45].
We also investigated the past effective population size (
Ne) in the Churra sheep commercial population under study. First historical references from the existence of Spanish Churra sheep date from Middle Ages (thirteenth century) approximately 800

years ago [
46]. Therefore, the time points chosen in this work were based on this historical information. The correlation between the results of the three different cM/Mb calculations was over 0.99. Major differences between the estimates based on the three different ratios are found at small distances, corresponding to more than 75 generations ago (Figure ). Changes in the effective population size reflect past events that occurred in the corresponding populations. In Spanish Churra, the
Ne value has been descending through time until the selection scheme began. From that point on, no major changes are found. Effective population size estimated 50 generations ago in Churra (
Ne
=

467 animals) is in agreement with the observations reported within the framework of the SheepHapMap project, where most of the sheep breeds displayed high
Ne, and only two populations showed a narrow genetic base comparable to the current
Ne of domestic cattle breeds (
Ne
<

150) [
44]. No other
Ne estimations have been reported so far in sheep. High selection pressure and the use of artificial insemination are the main reasons for the low Ne values obtained in cattle [
42]. To ensure an animal population is long-term viable, a threshold of
Ne
=

100 has been given [
47]. Our results of current effective population size (
Ne
=

128) are above the threshold, but care should be taken on this regard to ensure that the effective population size is maintained.
The LD estimates reported in this work can serve to assess the utility of the Ovine SNP50 Beadchip to address fine-mapping studies in Churra sheep. In cattle, McKay et al. [
48] showed that at a physical distance of 100

kb separating flanking SNP loci, the average
r2 was 0.15-0.2; considering a bovine genome length of 2.87 Gb, they concluded that 28,700 fully informative markers would be needed to saturate the cattle genome at an average resolution of 100

kb. Considering the lower value of LD estimates reported in this study, one can easily estimate that to obtain similar resolution in Churra sheep the marker density needs to be at least two times higher than the currently analyzed dataset. Hence, to implement genomic selection in this population with appropriate accuracy, a SNP array of higher density would be valuable. In this regard, following previous reported estimations [
2,
3] we can estimate how many SNPs will be needed to accurately estimate breeding values in Churra sheep. Considering a marker density of 20 SNPs per genome effective segment, which represents each independent chromosome segment [
3], a population with
Ne of 128 animals and a genome of 30 Morgans, the SNP chip should include approximately 95,000 SNPs (assuming the same percentage of successful genotyping obtained in this study) to improve the accuracy of genetic breeding values estimation.
Pedigree-based inbreeding calculations rely on the completeness and accuracy of the available pedigree. The results reported herein based on the available Churra pedigree showed 94% of the animals included in the analysis were ‘non-inbred’, although this is due to the lack of a deep pedigree. We obtained some negative values for inbreeding coefficients, which corresponded to animals with lower homozygosity than the average population. This could be because we estimated the allele frequencies from the currently genotyped population instead of the base population. Correlation between the different methods to estimate inbreeding ranged from 0.27 (
F1vs. F2) to 0.83 (
F2vs. F3), with
F1 as the most different. In general, values calculated using pedigree information were lower than those obtained through marker analysis. The latter could be inflated because we assumed a homogeneous population [
13], while there is a structure due to the experimental design of the linkage-based mapping studies for which the resource population had initially been selected. Comparing between the three marker-based methods, a different percentage of the analyzed animals showed an inbreeding value higher than the critical level (6.25%, obtained when mating cousins, [
49]). This proportion varied from 8.45% (
F1) to 1.6% (
F3), which is lower than that described for Finnsheep [
49]. This percentage was very high when analyzing results from 2002 (method
F1), 50% of the animals had an
F1
>

0.0625. Moreover, we were not able to compare between methods as most of the animals for
F2 and
F3 had negative values which might also explain the low correlation found between
F1 and
F2. In agreement with previous studies, the results presented here show that genomic data sets can provide useful information on a per sample basis in cases of complex genealogies or in the absence of genealogical data [
13].