Linkage disequilibrium and within-breed genetic diversity in Iranian Zandi sheep

Ghoreishifar, Seyed Mohammad; Moradi-Shahrbabak, Hossein; Parna, Nahid; Davoudi, Pourya; Khansefid, Majid

doi:https://doi.org/10.5194/aab-62-143-2019

Articles | Volume 62, issue 1

https://doi.org/10.5194/aab-62-143-2019

Articles | Volume 62, issue 1

Original study

02 Apr 2019

Original study |

| 02 Apr 2019

Linkage disequilibrium and within-breed genetic diversity in Iranian Zandi sheep

Seyed Mohammad Ghoreishifar, Hossein Moradi-Shahrbabak, Nahid Parna, Pourya Davoudi, and Majid Khansefid

Abstract

This research aimed to measure the extent of linkage disequilibrium (LD), effective population size (N_e), and runs of homozygosity (ROHs) in one of the major Iranian sheep breeds (Zandi) using 96 samples genotyped with Illumina Ovine SNP50 BeadChip. The amount of LD (r²) for single-nucleotide polymorphism (SNP) pairs in short distances (10–20 kb) was 0.21±0.25 but rapidly decreased to 0.10±0.16 by increasing the distance between SNP pairs (40–60 kb). The N_e of Zandi sheep in past (approximately 3500 generations ago) and recent (five generations ago) populations was estimated to be 6475 and 122, respectively. The ROH-based inbreeding was 0.023. We found 558 ROH regions, of which 37 % were relatively long (> 10 Mb). Compared with the rate of LD reduction in other species (e.g., cattle and pigs), in Zandi, it was reduced more rapidly by increasing the distance between SNP pairs. According to the LD pattern and high genetic diversity of Zandi sheep, we need to use an SNP panel with a higher density than Illumina Ovine SNP50 BeadChip for genomic selection and genome-wide association studies in this breed.

Download & links

Article (PDF, 1160 KB)

Supplement (815 KB)

Download & links

Article (1160 KB)
Supplement (815 KB)
BibTeX
EndNote

How to cite

How to cite.

Dates

Received: 30 Aug 2018 – Revised: 05 Mar 2019 – Accepted: 08 Mar 2019 – Published: 02 Apr 2019

1 Introduction

The population of sheep in Iran, consisting of many different breeds, is estimated to be approximately 50 million, which constitutes the most important economic area in the livestock industry of the country (Moradi et al., 2012). According to Rashidi et al. (2008), the sheep industry contributes about 35 % of the meat production in Iran. As the supply of meat production from sheep does not satisfy the demand, a breeding plan seems reasonable to enhance the profitability of sheep production by improving such traits as litter size, body conformation, lamb weight, and other economically important traits (Ghafouri-Kesbi et al., 2008). The Zandi breed is one of the major, indigenous, fat-tailed sheep breeds of Iran that has adapted to survive in harsh weather conditions and poor and mountainous pastures in the central regions of the country (Mohammadi et al., 2013). Probably because of its large population in Iran (about 2 million heads), there is much interest in genetically improving Zandi sheep for growth traits (Ghafouri-Kesbi et al., 2008; Mohammadi et al., 2013).

The adoption of such new technologies as genomic selection can play an essential role in satisfying the demand for meat production. One advantage of genomic over traditional selection methods is that selection candidates can be evaluated more accurately at very early ages, which increases the rate of genetic gain (Hayes et al., 2013). Also, the accuracy of genomic selection is regularly higher than that of traditional pedigree-based selection (Goddard and Hayes, 2011). So, there is potential to increase genetic gain in Zandi sheep by adopting genomic selection technology; however, the success of genomic selection relies on the level of genome-wide linkage disequilibrium (LD) between markers and causal mutations (Hayes et al., 2013). The nonrandom association between alleles of different variants throughout the genome in the population is known as LD. The main reason for this association, or correlation between alleles of different markers, is their physical closeness. However, some other factors, such as genetic drift, migration, and, more importantly, either artificial or natural selection, also affect the LD and its pattern (Wang, 2005). Measuring the amount of genome-wide LD in a population can help us estimate the number of single-nucleotide polymorphism (SNP) markers required to have robust genomic evaluations (Hayes and Goddard, 2001) and genome-wide association studies (GWASs) (Carlson et al., 2004).

The extent of genome-wide LD can also help us estimate past effective population sizes (N_e) and rates of recombination in a population. Information on past N_e is critical to define optimal selection pressures (Rexroad and Vallejo, 2009) for obtaining breeding goals while sustaining acceptable levels of genetic heterogeneity in breeding populations (Scraggs et al., 2014). Reduction in genetic diversity can reduce the profitability of animal production, which is often referred to as the “inbreeding effect” (Leroy, 2014; Notter, 1999). A meta-analysis of 57 studies on seven livestock species and a wide range of traits showed that approximately 0.14 % of the mean of the traits decreased for every 1 % increase in inbreeding (Leroy, 2014). The inbreeding coefficient can be investigated by studying the length of identical haplotype segments inherited from parents to progeny. In such investigations, the successive homozygous genotypes in the progeny are known as runs of homozygosity (ROHs). Hence, the aims of this research were to (1) measure the amount and pattern of LD, (2) examine genetic diversity by measuring N_e and ROH-based inbreeding, and (3) estimate the minimum number of SNPs required for robust genomic selection and GWASs in Zandi sheep.

2 Materials and methods

2.1 Sample collection and quality control

Blood samples were taken from 99 Zandi sheep at the Khojir sheep breeding station in Iran. All experimental procedures were reviewed and approved by the research ethics committee of the University of Tehran, College of Agriculture and Natural Resources. The genomic DNA was extracted using the standard salting-out protocol (Helms, 1990). The samples were genotyped using Illumina Ovine SNP50 BeadChip for 54 241 SNP markers; the version of the ovine assembly used in our research was Ovis_aries_v4.0 obtained from the SNPchiMP v3 database (Nicolazzi et al., 2015). The filtering process was completed using PLINK V1.9 (Chang et al., 2015). Of 99 genotyped animals, three samples were excluded due to low genotype quality (missing genotypes > 10 %), and the remaining 96 samples had a genotyping call rate > 98.2 %. SNPs were removed due to (1) having an unknown map position (n=387), (2) being located on sex chromosomes (n=1449), (3) having an SNP missing rate of more than 10 % (n=948), (4) deviating from Hardy–Weinberg equilibrium (HWE) with a p value < 10⁻⁶ (n=132), and (5) having a minor allele frequency (MAF) of less than 0.01 (n=2068). Then, using a genomic relationship matrix (VanRaden, 2008), we ran a principal component analysis (PCA) to explore the covariance between animals.

2.2 LD calculation

LD was calculated by r² (Hill and Robertson, 1968) for two loci (A and B) with two alleles (allele 1 and allele 2) using Eq. (1):

\begin{matrix} (1) & r^{2} = \frac{\{[f (A_{1} B_{1}) f (A_{2} B_{2})] - [f (A_{1} B_{2}) f (A_{2} B_{1})]\}^{2}}{f (A_{1}) f (B_{1}) f (A_{2}) f (B_{2})}, \end{matrix}

where f(A₁B₁), f(A₁B₂), f(A₂B₁), and f(A₂B₂) are the frequencies of haplotypes A₁B₁, A₁B₂, A₂B₁, and A₂B₂; f(A₁), f(A₂), f(B₁), and f(B₂) are the frequencies of the first and second allele of loci A and B. The expected frequency for haplotype A₁B₁ is f(A₁)f(B₁). Thus, the difference between the observed and expected frequency of a given haplotype indicates the LD between the two markers.

To measure LD, haplotypes were inferred using Beagle v3.3.2 (Browning and Browning, 2007), separately for each ovine autosome. Missing genotypes after quality control (0.018 %) were also imputed during haplotype phasing. The phased haplotypes were read into HAPLOVIEW v4.2 (Barrett et al., 2004) to calculate LD. For each autosome, r² was computed for all pairs of SNPs located up to 5 Mb apart. The pairwise LDs calculated by HAPLOVIEW were assigned into different classes according to pairwise distances (i.e., 0–10, 10–20, 20–40, 40–60, 60–100, 100–200, and 200–500 kb; 0.5–1.0, 1.0–2.0, and 2.0–5.0 Mb), and the average r² was calculated in each category using R software (R Core Team, 2014).

2.3 Estimation of N_e

The N_e was estimated using SNeP v1.1 (Barbato et al., 2015) using Eq. (2) (Corbin et al., 2012):

\begin{matrix} (2) & N_{e (t)} = [\frac{1}{4 f (c_{t})}] [\frac{1}{E (r_{adjusted}^{2} | c_{t})} - α], \end{matrix}

where N_e(t) is an estimation of N_e t generations ago; c_t is the pairwise genomic physical distance showing the recombination rate t generations ago (i.e., $t = 1 / 2 c$ ); and α is a constant in the equation to correct for the occurrence of mutations (if required). However, instead of assuming 1 cM =1 Mb, a recombination rate modifier in Eq. (3) was used to calculate c (Sved, 1971):

\begin{matrix} (3) & c = d \frac{1 - (d / 2)}{(1 - d)^{2}}, \end{matrix}

where d is the linkage at distance c, which can be estimated using r² adjusted for sample size (adjustment required if the sample size is small). In this study, the default parameters of “1” and “no correction” were used for α and sample size adjustment, respectively.

2.4 ROH mapping

ROHs in the genotyped sheep were found using PLINK V1.9 according to the following criteria. Each ROH had at least one SNP per 100 kb and a minimum length of 4 Mb. The sliding window under examination was allowed to contain up to one heterozygous SNP, and the minimum number of SNPs for a given ROH was determined using Eq. (4) (Lencz et al., 2007):

\begin{matrix} (4) & l = \frac{\log_{e} \frac{α}{n_{i} n_{s}}}{\log_{e} (1 - het)}, \end{matrix}

where n_s and n_i represent the number of genotyped SNPs per animal and the number of animals, respectively; α is the percentage of false positive ROHs (set to 0.05 in the current study), and “het” is the mean SNP heterozygosity across all SNPs. Using R software (R Core Team, 2014), the detected ROH was assigned into different length categories, including 4–10, 10–20, 20–30, and > 30 Mb, and the frequency of different ROH lengths was calculated. Moreover, ROH-based inbreeding (F_ROH) for each individual was calculated using Eq. (5) (McQuillan et al., 2008):

\begin{matrix} (5) & F_{{ROH}_{i}} = \frac{\sum_{j = 1}^{n} L_{{ROH}_{j}}}{L_{Genome}} \end{matrix}

where $F_{{ROH}_{i}}$ is the F_ROH (inbreeding) for the ith animal; n and $L_{{ROH}_{j}}$ are the total number of ROHs and the length of the jth ROH for the ith animal, respectively; and L_Genome is the genome length covered by the SNP markers (i.e., 2645.2 Mb in our study).

3 Results

3.1 Descriptive statistics

After quality control, the final dataset consisted of 96 animals genotyped for 49 257 SNPs covering 2.6452 Gb of the ovine genome. The average distance between adjacent SNPs was 53.73±55.60 kb, and the average MAF was 0.28±0.01. The number of SNPs varied on ovine autosomes, with OAR1 as the longest and OAR24 as the shortest ovine autosome containing 11.3 % (n=5563) and 1.4 % (n=702) of SNPs, respectively. The average inter-marker distance was the shortest in OAR9 (49.9 kb) and the longest in OAR21 (65.2 kb). Further details for the SNPs located on the same chromosome are presented in Table 1. The distribution of MAF for the SNPs that passed the quality control process is shown as a histogram in Supplement Fig. S1; approximately 48 % of the SNPs had MAF ≥0.30 and approximately 5.1 % of the SNPs had MAF between 0.01 and 0.05. As illustrated in Fig. 1, the relationship between the first two principal components described only 10.5 % of the total variation among samples.

Table 1Summary statistics of single-nucleotide polymorphism (SNP) markers for each Ovine autosome.

^* Ovis aries autosome.

Download Print Version | Download XLSX

3.2 Extent of genome-wide LD

The reduction in LD (r²) by increasing the pairwise distance between SNPs is presented in Table 2. The average r² was 0.26±0.30 when the pairwise distance between SNPs was < 10 kb, but it rapidly decreased to 0.21±0.25 when the distance increased to 10–20 kb. At the pairwise distance of 40–60 kb, which was close to the average inter-marker space in this study, the average r² was only 0.10±0.16. The percentage of pairwise SNPs with high LD (r²≥0.20) up to 10 kb apart was 39 %, which decreased to 16 % for the SNP pairs with 40–60 kb of distance (Table 2). The average LD between SNPs with up to 5 Mb of distance (4 481 704 SNP pairs) is presented for all 26 ovine autosomes (Table S1 in the Supplement). Moreover, r² was calculated for only the adjacent SNP pairs (0.13±0.19), of which up to 51 % of SNP pairs had r²<0.05 and only 19.1 % had r²>0.2 (Fig. S2).

https://www.arch-anim-breed.net/62/143/2019/aab-62-143-2019-f01

Figure 1PCA plot based on a genomic relationship matrix. The first two principal components (PCs) and the variance explained by each component is shown on the corresponding axis. The amount of variance explained by the first two components is relatively small (10.5 %), representing a great deal of genetic diversity among samples.

Download

Table 2Summary statistics of average of r² calculated for each distance category.

Download Print Version | Download XLSX

3.3 N_e

The historical and recent N_e values were estimated using average r² calculated in different pairwise distances between SNPs. The N_e of approximately 3500 generations ago was estimated to be 6475, which decreased to approximately 3000 about 500 generations ago and then continued to decline more rapidly until recent generations. We estimated that the N_e for Zandi sheep five generations ago was 122 (Fig. 2). A plot magnifying the changes in N_e of more recent generations is also illustrated in Fig. S3.

https://www.arch-anim-breed.net/62/143/2019/aab-62-143-2019-f02

Figure 2Estimation of effective population size (N_e) of Zandi sheep using genome-wide linkage disequilibrium (r²) between SNPs. N_e was estimated using average r² between markers at different distances by SNeP software. The N_e in the past (approximately 3500 generations ago) was 6475, which decreased to 3000 approximately 500 generations ago and then continued to decrease more rapidly until recent generations. The N_e of recent populations (up to five generations ago) was estimated to be 122.

Download

3.4 ROHs and ROH-based inbreeding (F_ROH)

In total, 558 ROHs were detected for 89 sheep with an average length of 10.77±8.36 Mb (Fig. 3), and no ROH was found for the 7 remaining sheep (Table S2). The ROH segments with a length of 4–10 Mb were the most frequent (62 %), and those with a length of more than 30 Mb were the least frequent (3.2 %) (Fig. 3). The average number and median of ROHs detected in the 89 sheep were 6.3±4.2 and 5, respectively. The maximum number of ROHs found in one animal was 19. The longest ROH (72.4 Mb) consisted of 1425 SNPs, and the shortest ROH (4.02 Mb) consisted of 64 SNPs (Table S2).

https://www.arch-anim-breed.net/62/143/2019/aab-62-143-2019-f03

Figure 3Frequency and proportion of detected ROHs with different lengths (Mb). In total, 558 ROHs (with average length of 10.77±8.36 Mb) were detected in 89 sheep, and in 7 remaining sheep no ROHs were found. The ROH segments with the length of 4–10 Mb were the most frequent (62.2 %), and those with the length of > 30 Mb were the least frequent (3.2 %).

Download

https://www.arch-anim-breed.net/62/143/2019/aab-62-143-2019-f04

Figure 4Number of detected ROHs and percent of coverage per autosome. In total, 558 ROHs were detected. For each autosome, the average length of a ROH (for the sheep with at least one ROH) was divided by the length of the chromosome to calculate the percentage of each autosome covered by ROHs. OAR1 with 60 ROHs (average length of 12.9±8.8 Mb) had the highest number of ROHs, and OAR14, OAR16, and OAR24 with 9 ROHs (average length of 10.3±5.9 Mb) had the lowest number of ROHs.

Download

In ovine autosomes of Zandi sheep in our study, OAR1 had 60 ROHs, but OAR14, 16, and 24 had only 9 ROHs (Fig. 4). Hence, these chromosomes had the most and the minimum number of ROHs, respectively. We found a strong correlation (Pearson's correlation coefficient +0.95) between the number of ROHs and chromosome length and also between percentage of coverage and chromosome length (Pearson's correlation coefficient −0.79) (see Table 1 and Fig. 4). The average of ROH-based inbreeding (F_ROH) for Zandi sheep with and without including the seven animals that had no ROHs was 0.023 and 0.026, respectively. The four animals with the highest degree of homozygosity indicated 302.4, 275.1, 234.9, and 178.4 Mb of their genome classified as ROHs, covering close to 9 % of their total genome length (Table S2).

4 Discussion

One of the main factors affecting LD measurement is population structure. Overestimation of LD occurs when samples are close relatives; in fact, they share longer haplotypes than distant relatives (Ghoreishifar et al., 2018; Gusev et al., 2011). In the Zandi breed, LD remained at a moderate level ( $r^{2} = 0.26 \pm 0.30$ ) only up to 10 kb and decreased rapidly to 0.10±0.16 when the average SNP pair distance was 40–60 kb. This pattern of LD decay illustrates that short-length haplotypes are prevalent in the population, which means that samples are not close relatives. Also, only 10.5 % of the total variation is explained by the first two principal components, representing a great deal of diversity among samples.

The average of r² varied across autosomes (ranging from 0.16 in OAR26 to 0.34 in OAR10 for the pairwise distance of < 10 kb), which is in line with previous reports in sheep (Liu et al., 2017; Prieur et al., 2017; Zhao et al., 2014), beef cattle (Bohmanova et al., 2010; Edea et al., 2015), and dairy cattle (Qanbari et al., 2010). This phenomenon can be due to variation in the recombination rate in different autosomes, natural or artificial selection, and genetic drift (Liu et al., 2017; Mastrangelo et al., 2017; Qanbari et al., 2010). Moreover, the variation in r² estimated for different chromosomes was higher in short SNP pair distances (e.g., SD =0.048 in 0–10 kb vs. SD =0.001 in 4.0–5.0 Mb), which is in line with the results reported by Liu et al. (2017).

The r² we calculated for the Zandi breed was close to the amount of LD reported in some other sheep breeds. For example, in Churra sheep, the average r² for the pairwise space of 0–10 kb and 0.5–1.0 Mb was 0.33 and 0.05, respectively (García-Gámez et al., 2012). In Chinese Merino sheep, the average r² for the same marker intervals was 0.25 and 0.02, respectively (Liu et al., 2017). However, in Barbaresca sheep, with a small sample size, the average r² for the inter-marker distance of 0.5–1.0 Mb was 0.12 (Mastrangelo et al., 2017). The average r² for SNPs within 10 kb of distance for Australian Border Leicester, Poll Dorset, and Merino sheep breeds were 0.34, 0.33, and 0.27, respectively (Al-Mamun et al., 2015). Thus, the variation in the reported r² in different breeds suggests that the characteristics of LD are highly breed-specific in sheep.

The power of quantitative trait locus (QTL) mapping and the accuracy of genomic predictions largely depend on the amount of LD between SNPs and quantitative trait nucleotides (QTNs) (Hayes et al., 2013). Generally, when the extent of LD between markers decreases rapidly by increasing the inter-marker distances, the number of markers required for a robust genomic study should be kept high. Hayes and Goddard (2001) demonstrated that the accuracy of genomic predictions for dairy cattle reached 85 % when the average r² between adjacent markers was ≥0.2. However, higher LD between markers (r²≥0.3) might be required for precise QTL mapping (Corbin et al., 2012). In our study, the average r² was 0.21±0.25 when the distance between pairwise SNPs was 10–20 kb. However, the average inter-marker distance was 53.73±55.60 kb, and the average r² for only adjacent SNPs was 0.13±0.19 (19 % of adjacent SNPs had r²>0.2). Hence, our findings support the necessity of denser SNP panels for a successful genomic selection scheme in Zandi sheep.

The level of LD in Zandi sheep was less than it is in cattle (Biegelmeyer et al., 2016; Bohmanova et al., 2010; Jasielczuk et al., 2016) and pigs (Grossi et al., 2017) but close to the level of LD reported for other sheep breeds (Al-Mamun et al., 2015; García-Gámez et al., 2012; Liu et al., 2017). For example, Qanbari et al. (2010) reported r² of 0.20±0.24 for SNPs within 50–75 kb in German Holstein dairy cattle. Moreover, the average r² for inter-marker distance < 10 kb in Hereford and Bradford beef cattle breeds was reported at 0.49 and 0.43, respectively (Biegelmeyer et al., 2016). We found that for a very short inter-marker distance (< 10 kb), the calculated r² in Zandi sheep was 0.26±0.30.

For the very recent generations (i.e., fewer than five generations ago), the accuracy of estimating N_e is negatively affected by the low level of LD between SNPs at long distances (Liu et al., 2017). Thus, we estimated the N_e in Zandi sheep up to five generations ago. The estimated N_e in Zandi sheep for 59 and 80 generations ago was 786 and 1035, respectively, which is in line with the large N_e (> 500) reported for various breeds of sheep in the HapMap project (Kijas et al., 2012). The low level of r² even at relatively short distances shows the N_e in Zandi sheep was large in recent past generations compared with other species. For example, the r² for SNP pairs within 0.9–1.0 Mb and the N_e in recent generations in Duroc pigs were reported to be 0.2 and 75, respectively (Grossi et al., 2017). The r² (same SNP pair distance) and the N_e in recent generations of Zandi sheep were 0.07 and 122. The estimated genome-based N_e (i.e., N_e SNP=122) in the current study was higher than the pedigree-based N_e reported for Zandi (i.e., N_e PED=66) in Ghafouri-Kesbi et al. (2008), in which the estimated N_e PED is likely to be less accurate due to the lack of precise and in-depth pedigree records for this breed. However, results should be interpreted with caution since other factors such as sampling itself, for example, can potentially affect the results. The threshold of 100 for N_e has been recommended for many species to maintain genetic diversity in the population (Meuwissen, 2009). The N_e of Zandi sheep five generations ago (122) was above the recommended critical threshold. However, given the sharp drop in N_e in recent generations, we should be careful to maintain the N_e larger than 100.

We observed that the N_e decreased more intensively from roughly 550 generations ago, which is in agreement with the results reported for a couple of Iranian fat-tailed sheep breeds by Moradi et al. (2017). Considering a generation interval of 4 to 5 years for sheep, this point coincides with the period when the first archaeological evidence of Iranian fat-tailed sheep breeds at Takht-Jamshid, Iran, was obtained (Moradi et al., 2017). These results may support the hypothesis that selection for fat-tailed trait in Iranian indigenous sheep populations occurred nearly 2500 years ago and has continued since then. However, more evidence is required to reinforce this hypothesis.

One of the main factors affecting autozygosity estimates is the density of SNP chips applied to generate the data for ROH studies. Marras et al. (2015), in a study of ROHs applying a medium-density chip, reported that when heterozygous SNPs were allowed, the number of longer ROHs increased dramatically and suggested not using them in the ROH. Ferenčaković et al. (2013) reported that the 50K SNP panel led to the overestimation of short-length ROHs (< 4 Mb), probably because of heterozygous SNPs on high-density chips that may not exist on a medium-density one. Hence, they proposed that the 50K panel does not have enough sensitivity for the accurate determination of short runs of homozygosity (i.e., < 4 Mb length). In our study, because a medium-density SNP panel was adopted, we outlined ROHs as regions of homozygous genotypes that had at least 4 Mb of length distinguished with a maximum number of one heterozygous SNP.

Long ROHs in an individual can occur due to either inbreeding events or selection pressure (Mastrangelo et al., 2016). Since the Zandi population is subject to selection programs (Ghafouri-Kesbi et al., 2008; Mohammadi et al., 2013), the occurrence or runs of homozygosity in this breed can be attributed to both inbreeding effects and selection pressure. In the ROH analysis of Zandi sheep, around 37.8 % of the detected ROH was relatively long (> 10 Mb) and 62.2 % was shorter than 10 Mb. As reported in the literature, in Barbaresca sheep, 67 % of the detected ROH was short (ranging from 1 to 10 Mb) (Mastrangelo et al., 2017), and in some local breeds of dairy cattle, approximately 23 % of the detected ROH was longer than 10 Mb (Mastrangelo et al., 2016). However, comparing different ROH studies is not a straightforward matter because different researchers use different criteria, especially for the minimum length of a ROH and the minimum number of SNPs included in a ROH.

5 Conclusions

We found that in Zandi sheep, the amount of LD was relatively small between the adjacent SNPs in Illumina Ovine SNP50 BeadChip (0.13±0.19), and it decreased rapidly by increasing the distance between the markers. Therefore, a high-density SNP panel is required for a robust genomic selection and fine mapping of QTLs in this breed. Given the total length of autosomes in sheep (2645.1 Gb) and the average level of LD between SNPs in 5 kb (r²≈0.26) and 15 kb (r²≈0.21), we estimated that approximately 180 000 and 530 000 evenly spaced SNPs are required to increase the r² to 0.20 and 0.25, respectively. Compared with other species, average inbreeding in Zandi sheep was found to be relatively low (0.023). In our study, 7 sheep were not inbred (F_ROH=0), and the inbreeding level of 35 sheep was more than the average level of inbreeding. Thus, optimal mating designs can be beneficial in controlling population inbreeding in Zandi sheep. Although the N_e of the Zandi breed is not critically low, given the sharp decrease in N_e in recent generations, we should be careful to ensure the N_e remains large.

Data availability

The original data are available upon request to the corresponding authors.

Supplement

The supplement related to this article is available online at: https://doi.org/10.5194/aab-62-143-2019-supplement.

Author contributions

SMG participated in the design of the study, conducted the analysis, and was the primary author of the paper. HMS participated in the design of the study. NP and PD participated in data analysis and drafting the paper. MK participated in the design of the study, as well as drafting and reviewing the paper. All authors read and approved the final paper.

Competing interests

The authors declare that they have no conflict of interest.

Acknowledgements

The authors would like to thank the Animal Breeding Centre of Iran (ABCI) for allowing access to the records and animals of the Iranian Zandi sheep breed. The authors are also grateful to the staff of the Zandi Sheep Breeding Station for their participation in the blood sampling. The authors declare that they have received no specific funding for this work.

Review statement

This paper was edited by Steffen Maak and reviewed by Marco Milanesi and one anonymous referee.

References

Al-Mamun, H. A., Clark, S., Kwan, P., and Gondro, C.: Genome-wide linkage disequilibrium and genetic diversity in five populations of Australian domestic sheep, Genet. Sel. Evol., 47, 90, https://doi.org/10.1186/s12711-015-0169-6, 2015.

Barbato, M., Orozco-terWengel, P., Tapio, M., and Bruford, M. W.: SNeP: a tool to estimate trends in recent effective population size trajectories using genome-wide SNP data, Frontiers in genetics, 6, 109, https://doi.org/10.3389/fgene.2015.00109, 2015.

Barrett, J. C., Fry, B., Maller, J., and Daly, M. J.: Haploview: analysis and visualization of LD and haplotype maps, Bioinformatics, 21, 263–265, 2004.

Biegelmeyer, P., Gulias-Gomes, C. C., Caetano, A. R., Steibel, J. P., and Cardoso, F. F.: Linkage disequilibrium, persistence of phase and effective population size estimates in Hereford and Braford cattle, BMC Genet., 17, 32, https://doi.org/10.1186/s12863-016-0339-8, 2016.

Bohmanova, J., Sargolzaei, M., and Schenkel, F. S.: Characteristics of linkage disequilibrium in North American Holsteins, BMC Genomics, 11, 421, https://doi.org/10.1186/1471-2164-11-421, 2010.

Browning, S. R. and Browning, B. L.: Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering, Am. J. Hum. Genet., 81, 1084–1097, 2007.

Carlson, C. S., Eberle, M. A., Kruglyak, L., and Nickerson, D. A.: Mapping complex disease loci in whole-genome association studies, Nature, 429, 446, https://doi.org/10.1038/nature02623, 2004.

Chang, C. C., Chow, C. C., Tellier, L. C., Vattikuti, S., Purcell, S. M., and Lee, J. J.: Second-generation PLINK: rising to the challenge of larger and richer datasets, GigaScience, 4, 7, https://doi.org/10.1186/s13742-015-0047-8, 2015.

Corbin, L. J., Liu, A., Bishop, S., and Woolliams, J.: Estimation of historical effective population size using linkage disequilibria with marker data, J. Anim. Breed. Genet., 129, 257–270, 2012.

Edea, Z., Dadi, H., Dessie, T., Lee, S.-H., and Kim, K.-S.: Genome-wide linkage disequilibrium analysis of indigenous cattle breeds of Ethiopia and Korea using different SNP genotyping BeadChips, Genes Genom., 37, 759–765, 2015.

Ferenčaković, M., Sölkner, J., and Curik, I.: Estimating autozygosity from high-throughput information: effects of SNP density and genotyping errors, Genet. Sel. Evol., 45, 42, https://doi.org/10.1186/1297-9686-45-42, 2013.

García-Gámez, E., Sahana, G., Gutiérrez-Gil, B., and Arranz, J.-J.: Linkage disequilibrium and inbreeding estimation in Spanish Churra sheep, BMC Genet., 13, 43, https://doi.org/10.1186/1471-2156-13-43, 2012.

Ghafouri-Kesbi, F., Eskandarinasab, M., and Hassanabadi, A.: Investigation of genetic variability and inbreeding characteristics in a population of Zandi sheep, Can. J. Anim. Sci., 88, 409–417, 2008.

Ghoreishifar, S. M., Moradi-Shahrbabak, H., Moradi-Shahrbabak, M., Nicolazzi, E. L., Williams, J. L., Iamartino, D., and Nejati-Javaremi, A.: Accuracy of imputation of single-nucleotide polymorphism marker genotypes for water buffaloes (Bubalus bubalis) using different reference population sizes and imputation tools, Livest. Sci., 216, 174–182, 2018.

Goddard, M. and Hayes, B.: Using the genomic relationship matrix to predict the accuracy of genomic selection, J. Anim. Breed. Genet., 128, 409–421, 2011.

Grossi, D. A., Jafarikia, M., Brito, L. F., Buzanskas, M. E., Sargolzaei, M., and Schenkel, F. S.: Genetic diversity, extent of linkage disequilibrium and persistence of gametic phase in Canadian pigs, BMC Genet., 18, 6, https://doi.org/10.1186/s12863-017-0473-y, 2017.

Gusev, A., Palamara, P. F., Aponte, G., Zhuang, Z., Darvasi, A., Gregersen, P., and Pe'er, I.: The architecture of long-range haplotypes shared within and across populations, Mol. Biol. Evol., 29, 473–486, 2011.

Hayes, B. and Goddard, M.: Prediction of total genetic value using genome-wide dense marker maps, Genetics, 157, 1819–1829, 2001.

Hayes, B. J., Lewin, H. A., and Goddard, M. E.: The future of livestock breeding: genomic selection for efficiency, reduced emissions intensity, and adaptation, Trends Genet., 29, 206–214, 2013.

Helms, C.: Salting out Procedure for Human DNA extraction, The Donis-Keller Lab-Lab Manual Homepage, 1990.

Hill, W. and Robertson, A.: The effects of inbreeding at loci with heterozygote advantage, Genetics, 60, 615–628, 1968.

Jasielczuk, I., Gurgul, A., Szmatoła, T., Ząbek, T., Pawlina, K., Semik, E., and Bugno-Poniewierska, M.: Linkage disequilibrium and haplotype block structure in Limousin, Simmental and native Polish Red cattle, Livest. Sci., 191, 57–63, 2016.

Kijas, J. W., Lenstra, J. A., Hayes, B., Boitard, S., Neto, L. R. P., San Cristobal, M., Servin, B., McCulloch, R., Whan, V., and Gietzen, K.: Genome-wide analysis of the world's sheep breeds reveals high levels of historic mixture and strong recent selection, PLoS Biol., 10, e1001258, https://doi.org/10.1371/journal.pbio.1001258, 2012.

Lencz, T., Lambert, C., DeRosse, P., Burdick, K. E., Morgan, T. V., Kane, J. M., Kucherlapati, R., and Malhotra, A. K.: Runs of homozygosity reveal highly penetrant recessive loci in schizophrenia, P. Natl. Acad. Sci. USA, 104, 19942–19947, 2007.

Leroy, G.: Inbreeding depression in livestock species: review and meta-analysis, Anim. Genet., 45, 618–628, 2014.

Liu, S., He, S., Chen, L., Li, W., Di, J., and Liu, M.: Estimates of linkage disequilibrium and effective population sizes in Chinese Merino (Xinjiang type) sheep by genome-wide SNPs, Genes Genom., 39, 733–745, 2017.

Marras, G., Gaspa, G., Sorbolini, S., Dimauro, C., Ajmone-Marsan, P., Valentini, A., Williams, J. L., and Macciotta, N. P.: Analysis of runs of homozygosity and their relationship with inbreeding in five cattle breeds farmed in Italy, Anim. Genet., 46, 110–121, 2015.

Mastrangelo, S., Tolone, M., Di Gerlando, R., Fontanesi, L., Sardina, M., and Portolano, B.: Genomic inbreeding estimation in small populations: evaluation of runs of homozygosity in three local dairy cattle breeds, Animal, 10, 746–754, 2016.

Mastrangelo, S., Portolano, B., Di Gerlando, R., Ciampolini, R., Tolone, M., Sardina, M., and Consortium, I. S. G.: Genome-wide analysis in endangered populations: a case study in Barbaresca sheep, Animal, 11, 1107–1116, 2017.

McQuillan, R., Leutenegger, A.-L., Abdel-Rahman, R., Franklin, C. S., Pericic, M., Barac-Lauc, L., Smolej-Narancic, N., Janicijevic, B., Polasek, O., and Tenesa, A.: Runs of homozygosity in European populations, Am. J. Hum. Genet., 83, 359–372, 2008.

Meuwissen, T. H.: Accuracy of breeding values of 'unrelated' individuals predicted by dense SNP genotyping, Genet. Sel. Evol., 41, 35, https://doi.org/10.1186/1297-9686-41-35, 2009.

Mohammadi, K., Nassiri, M. T. B., Rahmatnejad, E., Sheikh, M., Fayazi, J., and Manesh, A. K.: Phenotypic and genetic parameter estimates for reproductive traits in Zandi sheep, Trop. Anim. Health Pro., 45, 671–677, 2013.

Moradi, M. H., Nejati-Javaremi, A., Moradi-Shahrbabak, M., Dodds, K. G., and McEwan, J. C.: Genomic scan of selective sweeps in thin and fat tail sheep breeds for identifying of candidate regions associated with fat deposition, BMC Genet., 13, 10, https://doi.org/10.1186/1471-2156-13-10, 2012.

Moradi, M. H., Farahani, A. H., and Nejati-Javaremi, A.: Genome-wide evaluation of effective population size in some Iranian sheep breeds using linkage disequilibrium information, IJAS, 48, 39–49, https://doi.org/10.22059/IJAS.2017.213736.653464, 2017 (in Farsi).

Nicolazzi, E. L., Caprera, A., Nazzicari, N., Cozzi, P., Strozzi, F., Lawley, C., Pirani, A., Soans, C., Brew, F., and Jorjani, H.: SNPchiMp v. 3: integrating and standardizing single nucleotide polymorphism data for livestock species, BMC Genomics, 16, p. 283, https://doi.org/10.1186/s12864-015-1497-1, 2015.

Notter, D. R.: The importance of genetic diversity in livestock populations of the future, J. Anim. Sci., 77, 61–69, 1999.

Prieur, V., Clarke, S. M., Brito, L. F., McEwan, J. C., Lee, M. A., Brauning, R., Dodds, K. G., and Auvray, B.: Estimation of linkage disequilibrium and effective population size in New Zealand sheep using three different methods to create genetic maps, BMC Genet., 18, 68, https://doi.org/10.1186/s12863-017-0534-2, 2017.

Qanbari, S., Pimentel, E., Tetens, J., Thaller, G., Lichtner, P., Sharifi, A., and Simianer, H.: The pattern of linkage disequilibrium in German Holstein cattle, Anim. Genet., 41, 346–356, 2010.

Rashidi, A., Mokhtari, M. S., Jahanshahi, A. S., and Abadi, M. M.: Genetic parameter estimates of pre-weaning growth traits in Kermani sheep, Small Ruminant Res., 74, 165–171, 2008.

R Core Team: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria 2014, available at: https://www.R-project.org (last access: 16 February 2015), 2014.

Rexroad, C. E. and Vallejo, R. L.: Estimates of linkage disequilibrium and effective population size in rainbow trout, BMC Genet., 10, 83, https://doi.org/10.1186/1471-2156-10-83, 2009.

Scraggs, E., Zanella, R., Wojtowicz, A., Taylor, J., Gaskins, C., Reeves, J., de Avila, J., and Neibergs, H.: Estimation of inbreeding and effective population size of full-blood wagyu cattle registered with the American Wagyu Cattle Association, J. Anim. Breed. Genet., 131, 3–10, 2014.

Sved, J.: Linkage disequilibrium and homozygosity of chromosome segments in finite populations, Theor. Popul. Biol., 2, 125–141, 1971.

VanRaden, P. M.: Efficient methods to compute genomic predictions, J. Dairy Sci., 91, 4414–4423, 2008.

Wang, J.: Estimation of effective population sizes from data on genetic markers, Philos. T. R. Soc. B, 360, 1395–1409, 2005.

Zhao, F., Wang, G., Zeng, T., Wei, C., Zhang, L., Wang, H., Zhang, S., Liu, R., Liu, Z., and Du, L.: Estimations of genomic linkage disequilibrium and effective population sizes in three sheep populations, Livest. Sci., 170, 22–29, 2014.

Articles

Short summary

Analysis of the genomic variation within a breed is an essential requirement for keeping the breed’s integrity and for defining proper breeding objectives. This research aimed to analyze the genomic structure of Iranian Zandi sheep and showed that a more dense marker is required to achieve genetic gain in Zandi sheep using genomic selection. Attention should be given to adopting an optimal mating system in order to avoid the further loss of genetic diversity in this breed.

Linkage disequilibrium and within-breed genetic diversity in Iranian Zandi sheep

2.1 Sample collection and quality control

2.2 LD calculation

2.3 Estimation of Ne

2.4 ROH mapping

3.1 Descriptive statistics

3.2 Extent of genome-wide LD

3.3 Ne

3.4 ROHs and ROH-based inbreeding (FROH)

2.3 Estimation of N_e

3.3 N_e

3.4 ROHs and ROH-based inbreeding (F_ROH)