Assessment of runs of homozygosity islands and estimates of genomic inbreeding in Gyr (Bos indicus) dairy cattle.
ABSTRACT: Runs of homozygosity (ROH) are continuous homozygous segments of the DNA sequence. They have been applied to quantify individual autozygosity and used as a potential inbreeding measure in livestock species. The aim of the present study was (i) to investigate genome-wide autozygosity to identify and characterize ROH patterns in Gyr dairy cattle genome; (ii) identify ROH islands for gene content and enrichment in segments shared by more than 50% of the samples, and (iii) compare estimates of molecular inbreeding calculated from ROH (FROH), genomic relationship matrix approach (FGRM) and based on the observed versus expected number of homozygous genotypes (FHOM), and from pedigree-based coefficient (FPED).ROH were identified in all animals, with an average number of 55.12?±?10.37 segments and a mean length of 3.17 Mb. Short segments (ROH1-2 Mb) were abundant through the genomes, which accounted for 60% of all segments identified, even though the proportion of the genome covered by them was relatively small. The findings obtained in this study suggest that on average 7.01% (175.28 Mb) of the genome of this population is autozygous. Overlapping ROH were evident across the genomes and 14 regions were identified with ROH frequencies exceeding 50% of the whole population. Genes associated with lactation (TRAPPC9), milk yield and composition (IRS2 and ANG), and heat adaptation (HSF1, HSPB1, and HSPE1), were identified. Inbreeding coefficients were estimated through the application of FROH, FGRM, FHOM, and FPED approaches. FPED estimates ranged from 0.00 to 0.327 and FROH from 0.001 to 0.201. Low to moderate correlations were observed between FPED-FROH and FGRM-FROH, with values ranging from -0.11 to 0.51. Low to high correlations were observed between FROH-FHOM and moderate between FPED-FHOM and FGRM-FHOM. Correlations between FROH from different lengths and FPED gradually increased with ROH length.Genes inside ROH islands suggest a strong selection for dairy traits and enrichment for Gyr cattle environmental adaptation. Furthermore, low FPED-FROH correlations for small segments indicate that FPED estimates are not the most suitable method to capture ancient inbreeding. The existence of a moderate correlation between larger ROH indicates that FROH can be used as an alternative to inbreeding estimates in the absence of pedigree records.
Project description:BACKGROUND:The aim of this study was to assess genome-wide autozygosity in a Nellore cattle population and to characterize ROH patterns and autozygosity islands that may have occurred due to selection within its lineages. It attempts also to compare estimates of inbreeding calculated from ROH (FROH), genomic relationship matrix (FGRM), and pedigree-based coefficient (FPED). RESULTS:The average number of ROH per animal was 55.15?±?13.01 with an average size of 3.24 Mb. The Nellore genome is composed mostly by a high number of shorter segments accounting for 78% of all ROH, although the proportion of the genome covered by them was relatively small. The genome autozygosity proportion indicates moderate to high inbreeding levels for classical standards, with an average value of 7.15% (178.70 Mb). The average of FPED and FROH, and their correlations (-?0.05 to 0.26) were low. Estimates of correlation between FGRM-FPED was zero, while the correlation (-?0.01 to -?0.07) between FGRM-FROH decreased as a function of ROH length, except for FROH?>?8Mb (-?0.03). Overall, inbreeding coefficients were not high for the genotyped animals. Autozygosity islands were evident across the genome (n?=?62) and their genomic location did not largely differ within lineages. Enriched terms (p?<?0.01) associated with defense response to bacteria (GO:0042742), immune complex reaction (GO:0045647), pregnancy-associated glycoproteins genes (GO:0030163), and organism growth (GO:0040014) were described within the autozygotic islands. CONCLUSIONS:Low FPED-FROH correlation estimates indicate that FPED is not the most suitable method for capturing ancient inbreeding when the pedigree does not extend back many generations and FROH should be used instead. Enriched terms (p <?0.01) suggest a strong selection for immune response. Non-overlapping islands within the lineages greatly explain the mechanism underlying selection for functionally important traits in Nellore cattle.
Project description:Angler (RVA) and Red-and-White dual-purpose (RDN) cattle were in the past decades crossed with influential Red Holstein (RH) sires. However, genome-wide diversity studies in these breeds are lacking. The objective of the present study was to elucidate the genome-wide diversity and population structure of the three German cattle breeds. Using 40,851 single nucleotide polymorphism markers scored in 337 individuals, runs of homozygosity (ROH) were analysed in each breed. Clustering and a high-resolution network visualisation analyses were performed on an extended dataset that included 11 additional (outgroup) breeds. Genetic diversity levels were high with observed heterozygosity above 0.35 in all three breeds. Only RVA had a recent past effective population size (Ne) estimate above 100 at 5 generations ago. ROH length distribution followed a similar pattern across breeds and the majority of ROH were found in the length class of >5 to 10 Mb. Estimates of average inbreeding calculated from ROH (FROH) were 0.021 (RVA), 0.045 (RDN) and 0.053 (RH). Moderate to high positive correlations were found between FROH and pedigree inbreeding (FPED) and between FROH and inbreeding derived from the excess of homozygosity (FHOM), while the intercept of the regression of FROH on FPED was above zero. The population structure analysis showed strong evidence of admixture between RVA and RH. Introgression of RDN with RH genes was minimally detected and for the first time, the study uncovered Norwegian Red Cattle ancestry in RVA. Highly heterogeneous genetic background was found for RVA and RH and as expected, the breeds of the extended dataset effectively differentiated mostly based on geographical origin, validating our findings. The results of this study confirm the impact of RH sires on RVA and RDN populations. Furthermore, a close monitoring is suggested to curb further reduction of Ne in the breeds.
Project description:Genome-wide runs of homozygosity (ROH) are suitable for understanding population history, calculating genomic inbreeding, deciphering genetic architecture of complex traits and diseases as well as identifying genes linked with agro-economic traits. Autozygosity and ROH islands, genomic regions with elevated ROH frequencies, were characterized in 112 animals of seven Indian native cattle breeds (B. indicus) using BovineHD BeadChip. In total, 4138 ROH were detected. The average number of ROH per animal was maximum in draft breed, Kangayam (63.62 ± 22.71) and minimum in dairy breed, Sahiwal (24.62 ± 11.03). The mean ROH length was maximum in Vechur (6.97 Mb) and minimum in Hariana (4.04 Mb). Kangayam revealed the highest ROH based inbreeding (FROH > 1Mb = 0.113 ± 0.059), whereas Hariana (FROH > 1Mb = 0.042 ± 0.031) and Sahiwal (FROH > 1Mb = 0.043 ± 0.048) showed the lowest. The high standard deviation observed in each breed highlights a considerable variability in autozygosity. Out of the total autozygous segments observed in each breed except Vechur, > 80% were of short length (< 8 Mb) and contributed almost 50% of the genome proportion under ROH. However, in Vechur cattle, long ROH contributed 75% of the genome proportion under ROH. ROH patterns revealed Hariana and Sahiwal breeds as less consanguineous, while recent inbreeding was apparent in Vechur. Maximum autozygosity observed in Kangayam is attributable to both recent and ancient inbreeding. The ROH islands were harbouring higher proportion of QTLs for production traits (20.68% vs. 14.64%; P? 0.05) but lower for reproductive traits (11.49% vs. 15.76%; P? 0.05) in dairy breeds compared to draft breed. In draft cattle, genes associated with resistant to diseases/higher immunity (LYZL1, SVIL, and GPX4) and stress tolerant (CCT4) were identified in ROH islands; while in dairy breeds, for milk production (PTGFR, CSN1S1, CSN2, CSN1S2, and CSN3). Significant difference in ROH islands among large and short statured breeds was observed at chromosome 3 and 5 involving genes like PTGFR and HMGA2 responsible for milk production and stature, respectively. PCA analysis on consensus ROH regions revealed distinct clustering of dairy, draft and short stature cattle breeds.
Project description:The analysis of runs of homozygosity (ROH), using high throughput genomic data, has become a valuable and frequently used methodology to characterize the genomic and inbreeding variation of livestock and wildlife animal populations. However, this methodology has been scarcely used in highly inbred domestic animals. Here, we analyzed and characterized the occurrence of ROH fragments in highly inbred (HI; average pedigree-based inbreeding coefficient FPED = 0.164; 0.103 to 0.306) and outbred Retinta bulls (LI; average FPED = 0.008; 0 to 0.025). We studied the length of the fragments, their abundance, and genome distribution using high-density microarray data. The number of ROH was significantly higher in the HI group, especially for long fragments (>8Mb). In the LI group, the number of ROH continuously decreased with fragment length. Genome-wide distribution of ROH was highly variable between samples. Some chromosomes presented a larger number of fragments (BTA1, BTA19, BTA29), others had longer fragments (BTA4, BTA12, BTA17), while other ones showed an increased ROH accumulation over specific loci (BTA2, BTA7, BTA23, BTA29). Similar differences were observed in the analysis of 12 individuals produced by a similar inbred event (FPED3 = 0.125). The correlation between the fraction of the genome covered by ROH (FROH) and FPED was high (0.79), suggesting that ROH-based estimations are indicative of inbreeding levels. On the other hand, the correlation between FPED and the microsatellite-based inbreeding coefficient (FMIC) was only moderate (r = 0.44), suggesting that STR-based inbreeding estimations should be avoided. Similarly, we found a very low correlation (r = -0.0132) between recombination rate and ROH abundance across the genome. Finally, we performed functional annotation analyses of genome regions with significantly enriched ROH abundance. Results revealed gene clusters related to pregnancy-associated proteins and immune reaction. The same analysis performed for regions enriched with recently formed ROH (> 8 Mb) showed gene clusters related to flagellum assembly. In both cases, the processes were related to male and female reproductive functions, which may partially explain the reduced fertility associated with inbred populations.
Project description:Genetic characterization of African goats is one of the current priorities in the improvement of goats in the continent. This study contributes to the characterization effort by determining the levels and number of generations to common ancestors ("age") associated with inbreeding in African goat breeds and identifies regions that contain copy number variation mistyped as being homozygous. Illumina 50k single nucleotide polymorphism genotype data for 608 goats from 31 breeds were used to compute the level and age of inbreeding at both local (marker) and global levels (FG) using a model-based approach based on a hidden Markov model. Runs of homozygosity (ROH) segments detected using the Viterbi algorithm led to ROH-based inbreeding coefficients for all ROH (FROH) and for ROH longer than 2 Mb (FROH > 2Mb). Some of the genomic regions identified as having ROH are likely to be hemizygous regions (copy number deletions) mistyped as homozygous regions. Although the proportion of these miscalled ROH is small and does not substantially affect estimates of levels of inbreeding for individual animals, the inbreeding metrics were adjusted by removing these regions from the ROH. All the inbreeding metrics varied widely across breeds, with overall means of 0.0408, 0.0370, and 0.0691 and medians of 0.0125, 0.0098, and 0.0366 for FROH, FROH > 2Mb, and FG, respectively. Several breeds (including Menabe and Sofia from Madagascar) had high proportions of recent inbreeding, while Small East African, Ethiopian, and most of the West African breeds (including West African Dwarf) had more ancient inbreeding.
Project description:Inbreeding leaves distinct genomic traces, most notably long genomic tracts that are identical by descent and completely homozygous. These runs of homozygosity (ROH) can contribute to inbreeding depression if they contain deleterious variants that are fully or partially recessive. Several lines of evidence have been used to show that long (> 5 megabase) ROH are disproportionately likely to harbor deleterious variation, but the extent to which long vs. short tracts contribute to autozygosity at loci known to be deleterious and recessive has not been studied. In domestic dogs, nearly 200 mutations are known to cause recessive diseases, most of which can be efficiently assayed using SNP arrays. By examining genome-wide data from over 200,000 markers, including 150 recessive disease variants, we built high-resolution ROH density maps for nearly 2,500 dogs, recording ROH down to 500 kilobases. We observed over 678 homozygous deleterious recessive genotypes in the panel across 29 loci, 90% of which overlapped with ROH inferred by GERMLINE. Although most of these genotypes were contained in ROH over 5 Mb in length, 14% were contained in short (0.5 - 2.5 megabase) tracts, a significant enrichment compared to the genetic background, suggesting that even short tracts are useful for computing inbreeding metrics like the coefficient of inbreeding estimated from ROH (FROH ). In our dataset, FROH differed significantly both within and among dog breeds. All breeds harbored some regions of reduced genetic diversity due to drift or selective sweeps, but the degree of inbreeding and the proportion of inbreeding caused by short vs. long tracts differed between breeds, reflecting their different population histories. Although only available for a few species, large genome-wide datasets including recessive disease variants hold particular promise not only for disentangling the genetic architecture of inbreeding depression, but also evaluating and improving upon current approaches for detecting ROH.
Project description:Runs of homozygosity (ROH) are contiguous homozygous genotype segments in the genome that are present in an individual since the identical haplotypes are inherited from each parent. The aim of this study was to investigate the frequency and distribution of ROH in the genomes of Landrace, Songliao black and Yorkshire pigs. We calculated two types of genome inbreeding coefficients and their correlation, including the inbreeding coefficient based on ROH (FROH) and the inbreeding coefficient based on the difference between the observed and expected number of homozygous genotypes (FHOM). Furthermore, we identified candidate genes in the genomic region most associated with ROH. We identified 21,312 ROH in total. The average number of ROH per individual was 32.99 ± 0.38 and the average length of ROH was 6.40 ± 0.070 Mb in the three breeds. The FROH results showed that Yorkshire pigs exhibited the highest level of inbreeding (0.092 ± 0.0015) and that Landrace pigs exhibited the lowest level of inbreeding (0.073 ± 0.0047). The average correlation between FROH and FHOM was high (0.94) within three breeds. The length of ROH provides insight into the inbreeding history of these three pig breeds. In this study, Songliao black pigs presented a higher frequency and average length of long ROH (>40 Mb) compared with those of Landrace and Yorkshire pigs, which indicated greater inbreeding in recent times. Genes related to reproductive traits (GATM, SPATA46, HSD17B7, VANGL2, DAXX, CPEB1), meat quality traits (NR1I3, APOA2, USF1) and energy conversion (NDUFS2) were identified within genomic regions with a high frequency of ROH. These genes could be used as target genes for further marker-assisted selection and genome selection.
Project description:The Lundehund is an old dog breed with remarkable anatomical features including polydactyly in all four limbs and extraordinary flexibility of the spine. We genotyped 28 Lundehund using the canine Illumina high density beadchip to estimate the effective population size (Ne) and inbreeding coefficients as well as to identify potential regions of positive selection. The decay of linkage disequilibrium was slow with r2=0.95 in 50 kb distance. The last 7-200 generations ago, Ne was at 10-13. An increase of Ne was noted in the very recent generations with a peak value of 19 for Ne at generation 4. The FROH estimated for 50-, 65- and 358-SNP windows were 0.87, 087 and 0.81, respectively. The most likely estimates for FROH after removing identical-by-state segments due to linkage disequilibria were at 0.80-0.81. The extreme loss of heterozygosity has been accumulated through continued inbreeding over 200 generations within a probably closed population with a small effective population size. The mean inbreeding coefficient based on pedigree data for the last 11 generations (FPed=0.10) was strongly biased downwards due to the unknown coancestry of the founders in this pedigree data. The long-range haplotype test identified regions with genes involved in processes of immunity, olfaction, woundhealing and neuronal development as potential targets of selection. The genes QSOX2, BMPR1B and PRRX2 as well as MYOM1 are candidates for selection on the Lundehund characteristics small body size, increased number of digits per paw and extraordinary mobility, respectively. The objectives of the present study were to genotype 28 Lundehund using the canine Illumina high density beadchip (Illumina, San Diego, CA, USA) to estimate the effective population size (Ne) from data on linkage disequilibria (LD). We identified runs of homozygosity (ROH) as regions with a local loss of genetic variation. Inbreeding coefficients were calculated on the basis of pedigree data (FPed) and genotype information (FROH, FIS) to compare the results of these different methods for this population. We applied the long-range haplotype test to search for potential selective sweeps.
Project description:It is well known that inbreeding increases the risk of recessive monogenic diseases, but it is less certain whether it contributes to the etiology of complex diseases such as schizophrenia. One way to estimate the effects of inbreeding is to examine the association between disease diagnosis and genome-wide autozygosity estimated using runs of homozygosity (ROH) in genome-wide single nucleotide polymorphism arrays. Using data for schizophrenia from the Psychiatric Genomics Consortium (n = 21,868), Keller et al. (2012) estimated that the odds of developing schizophrenia increased by approximately 17% for every additional percent of the genome that is autozygous (? = 16.1, CI(?) = [6.93, 25.7], Z = 3.44, p = 0.0006). Here we describe replication results from 22 independent schizophrenia case-control datasets from the Psychiatric Genomics Consortium (n = 39,830). Using the same ROH calling thresholds and procedures as Keller et al. (2012), we were unable to replicate the significant association between ROH burden and schizophrenia in the independent PGC phase II data, although the effect was in the predicted direction, and the combined (original + replication) dataset yielded an attenuated but significant relationship between Froh and schizophrenia (? = 4.86,CI(?) = [0.90,8.83],Z = 2.40,p = 0.02). Since Keller et al. (2012), several studies reported inconsistent association of ROH burden with complex traits, particularly in case-control data. These conflicting results might suggest that the effects of autozygosity are confounded by various factors, such as socioeconomic status, education, urbanicity, and religiosity, which may be associated with both real inbreeding and the outcome measures of interest.
Project description:Inbreeding is often an inevitable outcome of strong directional artificial selection but on average it reduces population fitness with increased frequency of recessive deleterious alleles. Runs of homozygosity (ROH) representing genomic autozygosity that occur from mating between selected and genomically related individuals may be able to reveal the regions affecting fitness. To examine the influence of genomic autozygosity on fitness, we used a genome-wide association test to evaluate potential negative correlations between ROH and daughter pregnancy rate (DPR) or somatic cell score (SCS) in US Jersey cattle. In addition, relationships between changes of local ROH and inbreeding coefficients (F) were assessed to locate genomic regions with increased inbreeding. Despite finding some decreases in fertility associated with incremental increases in F, most emerging local ROH were not significantly associated with DPR or SCS. Furthermore, the analyses of ROH could be approximated with the most frequent haplotype(s), including the associations of ROH and F or traits. The analysis of the most frequent haplotype revealed that associations of ROH and fertility could be accounted for by the additive genetic effect on the trait. Thus, we suggest that a change of autozygosity is more likely to demonstrate footprints of selected haplotypes for production rather than highlight the possible increased local autozygosity of a recessive detrimental allele resulting from the mating between closely related animals in Jersey cattle.