-
Relatedness between numerically small Dutch Red dairy cattle populations and possibilities for multibreed genomic prediction.
Red dairy breeds are a valuable cultural and historical asset, and often a source of unique genetic diversity. However, they have difficulties competing with other, more productive, dairy breeds. Improving competitiveness of Red dairy breeds, by accelerating their genetic improvement using genomic selection, may be a promising strategy to secure their long-term future. For many Red dairy breeds, establishing a sufficiently large breed-specific reference population for genomic prediction is often not possible, but may be overcome by adding individuals from another breed. Relatedness between breeds strongly decides the benefit of adding another breed to the reference population. To prioritize among available breeds, the effective number of chromosome segments (Me) can be used as an indicator of relatedness between individuals from different breeds. The Me is also an important parameter in determining the accuracy of genomic prediction. The Me can be estimated both within a population and between 2 populations or breeds, as the reciprocal of the variance of genomic relationships. We investigated relatedness between 6 Dutch Red cattle breeds, Groningen White Headed (GWH), Dutch Friesian (DF), Meuse-Rhine-Yssel (MRY), Dutch Belted (DB), Deep Red (DR), and Improved Red (IR), focusing primarily on the Me, to predict which of those breeds may benefit from including reference animals of the other breeds. All of these breeds, except MRY, are under high risk of extinction. Our results indicated high variability of Me, especially between Me ranging from ∼3,500 to ∼17,400, indicating different levels of relatedness between the breeds. Two clusters are especially important, one formed by MRY, DR, and IR, and the other comprising DF and DB. Although relatedness between breeds within each of these 2 clusters is high, across-breed genomic prediction is still limited by the current number of genotyped individuals, which for many breeds is low. However, adding MRY individuals would increase the reference population of DR substantially. We estimated that between 11 and 133 individuals from other breeds are needed to achieve accuracy of genomic prediction equivalent to using one additional individual from the same breed. Given the variation in size of the breeds in this study, the benefit of a multibreed reference population is expected to be lower for larger breeds than for the smaller ones.
Marjanovic J
,Hulsegge B
,Calus MPL
《-》
-
Overlap in genomic variation associated with milk fat composition in Holstein Friesian and Dutch native dual-purpose breeds.
The aim of this study was to identify if genomic variations associated with fatty acid (FA) composition are similar between the Holstein-Friesian (HF) and native dual-purpose breeds used in the Dutch dairy industry. Phenotypic and genotypic information were available for the breeds Meuse-Rhine-Yssel (MRY), Dutch Friesian (DF), Groningen White Headed (GWH), and HF. First, the reliability of genomic breeding values of the native Dutch dual-purpose cattle breeds MRY, DF, and GWH was evaluated using single nucleotide polymorphism (SNP) effects estimated in HF, including all SNP or subsets with stronger associations in HF. Second, the genomic variation of the regions associated with FA composition in HF (regions on Bos taurus autosome 5, 14, and 26), were studied in the different breeds. Finally, similarities in genotype and allele frequencies between MRY, DF, GWH, and HF breeds were assessed for specific regions associated with FA composition. On average across the traits, the highest reliabilities of genomic prediction were estimated for GWH (0.158) and DF (0.116) when the 8 to 22 SNP with the strongest association in HF were included. With the same set of SNP, GEBV for MRY were the least reliable (0.022). This indicates that on average only 2 (MRY) to 16% (GWH) of the genomic variation in HF is shared with the native Dutch dual-purpose breeds. The comparison of predicted variances of different regions associated with milk and milk fat composition showed that breeds clearly differed in genomic variation within these regions. Finally, the correlations of allele frequencies between breeds across the 8 to 22 SNP with the strongest association in HF were around 0.8 between the Dutch native dual-purpose breeds, whereas the correlations between the native breeds and HF were clearly lower and around 0.5. There was no consistent relationship between the reliabilities of genomic prediction for a specific breed and the correlation between the allele frequencies of this breed and HF. In conclusion, most of the genomic variation associated with FA composition in the Dutch dual-purpose breeds appears to be breed-specific. Furthermore, the minor allele frequencies of genes having an effect on the milk FA composition in HF were shown to be much smaller in the breeds MRY, DF, and GWH, especially for the MRY breed.
Maurice-Van Eijndhoven MH
,Bovenhuis H
,Veerkamp RF
,Calus MP
... -
《-》
-
Factors affecting accuracy of estimated effective number of chromosome segments for numerically small breeds.
For numerically small breeds, obtaining a sufficiently large breed-specific reference population for genomic prediction is challenging or simply not possible, but may be overcome by adding individuals from another breed. To prioritize among available breeds, the effective number of chromosome segments (Me ) can be used as an indicator of relatedness between individuals from different breeds. The Me is also an important parameter in determining the accuracy of genomic prediction. The Me can be estimated both within a population and between two populations or breeds, as the reciprocal of the variance of genomic relationships. However, the threshold for number of individuals needed to accurately estimate within or between populations Me is currently unknown. It is also unknown if a discrepancy in number of genotyped individuals in two breeds affects the estimates of Me between populations. In this study, we conducted a simulation that mimics current domestic cattle populations in order to investigate how estimated Me is affected by number of genotyped individuals, single-nucleotide polymorphism (SNP) density and pedigree availability. Our results show that a small sample of 10 genotyped individuals may result in substantial over or underestimation of Me . While estimates of within population Me were hardly affected by SNP density, between population Me values were highly dependent on the number of available SNPs, with higher SNP densities being able to detect more independent chromosome segments. When subtracting pedigree from genomic relationships before computing Me , estimates of within population Me were three to four times higher than estimates with genotypes only; however, between Me estimates remained the same. For accurate estimation of within and between population Me , at least 50 individuals should be genotyped per population. Estimates of within Me were highly affected by whether pedigree was used or not. For within Me , even the smallest SNP density (~11k) resulted in accurate representation of family relationships in the population; however, for between Me , many more markers are needed to capture all independent segments.
Marjanovic J
,Calus MPL
《-》
-
Estimation of inbreeding, between-breed genomic relatedness and definition of sub-populations in red-pied cattle breeds.
Currently, enhancing the collaboration between related breeds is of main importance to increase the competitivity and the sustainability of local breeds. One type of collaboration is the development of an across-breed reference population that will allow a better management of local breeds. For this purpose, the genomic relatedness between the local target breed and possible breeds to be included in the reference population should be estimated. In Europe, there are several local red-pied cattle breeds that would benefit from this kind of collaboration. However, how different red-pied cattle breeds from the Benelux are related to each other and can collaborate is still unclear. The objectives of this study were therefore: (1) to estimate the level of inbreeding of the East Belgian Red and White (EBRW), the Red-Pied of the Ösling (RPO) and Dutch red-pied cattle breeds; (2) to determine the genomic relatedness of several red-pied cattle breeds, with a special focus on two endangered breeds: the EBRW and the RPO, and (3) based on the second objective, to detect animals from other breeds that were genomically close enough to be considered as advantageous in the creation of an across-breed reference population of EBRW or RPO. The estimated inbreeding levels based on runs of homozygosity were relatively low for almost all the studied breeds and especially for the EBRW and RPO. This would imply that inbreeding is currently not an issue in these two endangered breeds and that their sustainability is not threatened by their level of inbreeding. The results from the principal component analysis, the phylogenetic tree and the clustering all highlighted that the EBRW and RPO breeds were included in the genomic continuum of the studied red-pied cattle breeds and can be therefore considered as genomically close to Dutch red-pied cattle breeds, highlighting the possibility of a collaboration between these breeds. Especially, EBRW animals were closely related to Deep Red and Improved Red animals while, to a lesser extent, the RPO animals were closely related to the Meuse-Rhine-Yssel breed. Based on these results, we could use distance measures, based either on the principal component analysis or clustering, to detect animals from Dutch breeds that were genomically closest to the EBRW or RPO breeds. This will finally allow the building of an across-breed reference population for EBRW or RPO for further genomic evaluations, considering these genomically closest animals from other breeds.
Wilmot H
,Druet T
,Hulsegge I
,Gengler N
,Calus MPL
... -
《-》
-
Predicting the effect of reference population on the accuracy of within, across, and multibreed genomic prediction.
Genomic prediction is widely used to select candidates for breeding. Size and composition of the reference population are important factors influencing prediction accuracy. In Holstein dairy cattle, large reference populations are used, but this is difficult to achieve in numerically small breeds and for traits that are not routinely recorded. The prediction accuracy is usually estimated using cross-validation, requiring the full data set. It would be useful to have a method to predict the benefit of multibreed reference populations that does not require the availability of the full data set. Our objective was to study the effect of the size and breed composition of the reference population on the accuracy of genomic prediction using genomic BLUP and Bayes R. We also examined the effect of trait heritability and validation breed on prediction accuracy. Using these empirical results, we investigated the use of a formula to predict the effect of the size and composition of the reference population on the accuracy of genomic prediction. Phenotypes were simulated in a data set containing real genotypes of imputed sequence variants for 22,752 dairy bulls and cows, including Holstein, Jersey, Red Holstein, and Australian Red cattle. Different reference populations were constructed, varying in size and composition, to study within-breed, multibreed, and across-breed prediction. Phenotypes were simulated varying in heritability, number of chromosomes, and number of quantitative trait loci. Genomic prediction was carried out using genomic BLUP and Bayes R. We used either the genomic relationship matrix (GRM) to estimate the number of independent chromosomal segments and subsequently to predict accuracy, or the accuracies obtained from single-breed reference populations to predict the accuracies of larger or multibreed reference populations. Using the GRM overestimated the accuracy; this overestimation was likely due to close relationships among some of the reference animals. Consequently, the GRM could not be used to predict the accuracy of genomic prediction reliably. However, a method using the prediction accuracies obtained by cross-validation using a small, single-breed reference population predicted the accuracy using a multibreed reference population well and slightly overestimated the accuracy for a larger reference population of the same breed, but gave a reasonably close estimate of the accuracy for a multibreed reference population. This method could be useful for making decisions regarding the size and composition of the reference population.
van den Berg I
,Meuwissen THE
,MacLeod IM
,Goddard ME
... -
《-》