-
Genomic prediction in French Charolais beef cattle using high-density single nucleotide polymorphism markers.
The objective of the study was to develop a genomic evaluation for French beef cattle breeds and assess accuracy and bias of prediction for different genomic selection strategies. Based on a reference population of 2,682 Charolais bulls and cows, genotyped or imputed to a high-density SNP panel (777K SNP), we tested the influence of different statistical methods, marker densities (50K versus 777K), and training population sizes and structures on the quality of predictions. Four different training sets containing up to 1,979 animals and a unique validation set of 703 young bulls only known on their individual performances were formed. BayesC method had the largest average accuracy compared to genomic BLUP or pedigree-based BLUP. No gain of accuracy was observed when increasing the density of markers from 50K to 777K. For a BayesC model and 777K SNP panels, the accuracy calculated as the correlation between genomic predictions and deregressed EBV (DEBV) divided by the square root of heritability was 0.42 for birth weight, 0.34 for calving ease, 0.45 for weaning weight, 0.52 for muscular development, and 0.27 for skeletal development. Half of the training set constituted animals having only their own performance recorded, whose contribution only represented 5% of the accuracy. Using DEBV as a response brought greater accuracy than using EBV (+5% on average). Considering a residual polygenic component strongly reduced bias for most of the traits. The optimal percentage of polygenic variance varied across traits. Among the methodologies tested to implement genomic selection in the French Charolais beef cattle population, the most accurate and less biased methodology was to analyze DEBV under a BayesC strategy and a residual polygenic component approach. With this approach, a 50K SNP panel performed as well as a 777K panel.
Gunia M
,Saintilan R
,Venot E
,Hozé C
,Fouilloux MN
,Phocas F
... -
《-》
-
Efficiency of multi-breed genomic selection for dairy cattle breeds with different sizes of reference population.
Single-breed genomic selection (GS) based on medium single nucleotide polymorphism (SNP) density (~50,000; 50K) is now routinely implemented in several large cattle breeds. However, building large enough reference populations remains a challenge for many medium or small breeds. The high-density BovineHD BeadChip (HD chip; Illumina Inc., San Diego, CA) containing 777,609 SNP developed in 2010 is characterized by short-distance linkage disequilibrium expected to be maintained across breeds. Therefore, combining reference populations can be envisioned. A population of 1,869 influential ancestors from 3 dairy breeds (Holstein, Montbéliarde, and Normande) was genotyped with the HD chip. Using this sample, 50K genotypes were imputed within breed to high-density genotypes, leading to a large HD reference population. This population was used to develop a multi-breed genomic evaluation. The goal of this paper was to investigate the gain of multi-breed genomic evaluation for a small breed. The advantage of using a large breed (Normande in the present study) to mimic a small breed is the large potential validation population to compare alternative genomic selection approaches more reliably. In the Normande breed, 3 training sets were defined with 1,597, 404, and 198 bulls, and a unique validation set included the 394 youngest bulls. For each training set, estimated breeding values (EBV) were computed using pedigree-based BLUP, single-breed BayesC, or multi-breed BayesC for which the reference population was formed by any of the Normande training data sets and 4,989 Holstein and 1,788 Montbéliarde bulls. Phenotypes were standardized by within-breed genetic standard deviation, the proportion of polygenic variance was set to 30%, and the estimated number of SNP with a nonzero effect was about 7,000. The 2 genomic selection (GS) approaches were performed using either the 50K or HD genotypes. The correlations between EBV and observed daughter yield deviations (DYD) were computed for 6 traits and using the different prediction approaches. Compared with pedigree-based BLUP, the average gain in accuracy with GS in small populations was 0.057 for the single-breed and 0.086 for multi-breed approach. This gain was up to 0.193 and 0.209, respectively, with the large reference population. Improvement of EBV prediction due to the multi-breed evaluation was higher for animals not closely related to the reference population. In the case of a breed with a small reference population size, the increase in correlation due to multi-breed GS was 0.141 for bulls without their sire in reference population compared with 0.016 for bulls with their sire in reference population. These results demonstrate that multi-breed GS can contribute to increase genomic evaluation accuracy in small breeds.
Hozé C
,Fritz S
,Phocas F
,Boichard D
,Ducrocq V
,Croiseau P
... -
《-》
-
Genome-wide association study and prediction of genomic breeding values for fatty-acid composition in Korean Hanwoo cattle using a high-density single-nucleotide polymorphism array.
Genomic selection using high-density single-nucleotide polymorphism (SNP) markers is used in dairy and beef cattle breeds to accurately estimate genomic breeding values and accelerate genetic improvement by enabling selection of animals with high genetic merit. This genome-wide association study (GWAS) aimed to identify genetic variants associated with beef fatty-acid composition (FAC) traits and to evaluate the accuracy of genomic predictions (GPs) for those traits using genomic best linear unbiased prediction (GBLUP), pedigree BLUP (PBLUP), and BayesR models. Samples of the longissimus dorsi muscle of 965 thirty-month-old Hanwoo steers (progeny of 73 proven bulls) were used to investigate 14 FAC traits. Animals were genotyped or imputed using two bovine SNP platforms (50K and 777K), and after quality control, 38,715 (50K) and 633,448 (777K) SNPs were subjected to GWAS and GP study using a cross-validation scheme. SNP-based heritability estimates were moderate to high (0.25 to 0.47) for all studied traits, with some exceptions for polyunsaturated fatty acids. Association analysis revealed that 19 SNPs in BTA19 (98.7 kb) were significantly associated (P < 7.89 × 10-8) with C14:0 and C18:1n-9; these SNPs were in the fatty-acid synthase (FASN) and coiled-coil domain-containing 57 (CCDC57) genes. BayesR analysis revealed that 0.41 to 0.78% of the total SNPs (n = 2,571 to 4,904) explained almost all of the genetic variance; the majority of the SNPs (>99%) had negligible effects, suggesting that the FAC traits were polygenic. Genome partitioning analysis indicated mostly nonlinear and weak correlations between the variance explained by each chromosome and its length, which also reflected the considerable contributions of relatively few genes. The prediction accuracy of breeding values for FAC traits varied from low to high (0.25 to 0.57); the estimates using the GBLUP and BayesR methods were superior to those obtained by the PBLUP method. The BayesR method performed similarly to GBLUP for most of the studied traits but substantially better for those traits that were controlled by SNPs with large effects; this was supported by the GWAS results. In addition, the predictive abilities of the 50K and 777K SNP arrays were almost similar; thus, both are suitable for GP in Hanwoo cattle. In conclusion, this study provides important insight into the genetic architecture and predictive ability of FAC traits in Hanwoo cattle. Our findings could be used in selection and breeding programs to promote production of meat with enhanced nutritional value.
Bhuiyan MSA
,Kim YK
,Kim HJ
,Lee DH
,Lee SH
,Yoon HB
,Lee SH
... -
《-》
-
Accuracy of genomic predictions for feed efficiency traits of beef cattle using 50K and imputed HD genotypes.
The accuracy of genomic predictions can be used to assess the utility of dense marker genotypes for genetic improvement of beef efficiency traits. This study was designed to test the impact of genomic distance between training and validation populations, training population size, statistical methods, and density of genetic markers on prediction accuracy for feed efficiency traits in multibreed and crossbred beef cattle. A total of 6,794 beef cattle data collated from various projects and research herds across Canada were used. Illumina BovineSNP50 (50K) and imputed Axiom Genome-Wide BOS 1 Array (HD) genotypes were available for all animals. The traits studied were DMI, ADG, and residual feed intake (RFI). Four validation groups of 150 animals each, including Angus (AN), Charolais (CH), Angus-Hereford crosses (ANHH), and a Charolais-based composite (TX) were created by considering the genomic distance between pairs of individuals in the validation groups. Each validation group had 7 corresponding training groups of increasing sizes ( = 1,000, 1,999, 2,999, 3,999, 4,999, 5,998, and 6,644), which also represent increasing average genomic distance between pairs of individuals in the training and validations groups. Prediction of genomic estimated breeding values (GEBV) was performed using genomic best linear unbiased prediction (GBLUP) and Bayesian method C (BayesC). The accuracy of genomic predictions was defined as the Pearson's correlation between adjusted phenotype and GEBV (), unless otherwise stated. Using 50K genotypes, the highest average achieved in purebreds (AN, CH) was 0.41 for DMI, 0.34 for ADG, and 0.35 for RFI, whereas in crossbreds (ANHH, TX) it was 0.38 for DMI, 0.21 for ADG, and 0.25 for RFI. Similarly, when imputed HD genotypes were applied in purebreds (AN, CH), the highest average was 0.14 for DMI, 0.15 for ADG, and 0.14 for RFI, whereas in crossbreds (ANHH, TX) it was 0.38 for DMI, 0.22 for ADG, and 0.24 for RFI. The of GBLUP predictions were greatly reduced with increasing genomic average distance compared to those from BayesC predictions. The results indicate that 50K genotypes, used with BayesC, are more effective for predicting GEBV in purebred cattle. Imputed HD genotypes found utility when dealing with composites and crossbreds. Formulation of a fairly large training set for genomic predictions in beef cattle should consider the genomic distance between the training and target populations.
Lu D
,Akanno EC
,Crowley JJ
,Schenkel F
,Li H
,De Pauw M
,Moore SS
,Wang Z
,Li C
,Stothard P
,Plastow G
,Miller SP
,Basarab JA
... -
《-》
-
Extent of linkage disequilibrium, consistency of gametic phase, and imputation accuracy within and across Canadian dairy breeds.
Genomic selection requires a large reference population to accurately estimate single nucleotide polymorphism (SNP) effects. In some Canadian dairy breeds, the available reference populations are not large enough for accurate estimation of SNP effects for traits of interest. If marker phase is highly consistent across multiple breeds, it is theoretically possible to increase the accuracy of genomic prediction for one or all breeds by pooling several breeds into a common reference population. This study investigated the extent of linkage disequilibrium (LD) in 5 major dairy breeds using a 50,000 (50K) SNP panel and 3 of the same breeds using the 777,000 (777K) SNP panel. Correlation of pair-wise SNP phase was also investigated on both panels. The level of LD was measured using the squared correlation of alleles at 2 loci (r(2)), and the consistency of SNP gametic phases was correlated using the signed square root of these values. Because of the high cost of the 777K panel, the accuracy of imputation from lower density marker panels [6,000 (6K) or 50K] was examined both within breed and using a multi-breed reference population in Holstein, Ayrshire, and Guernsey. Imputation was carried out using FImpute V2.2 and Beagle 3.3.2 software. Imputation accuracies were then calculated as both the proportion of correct SNP filled in (concordance rate) and allelic R(2). Computation time was also explored to determine the efficiency of the different algorithms for imputation. Analysis showed that LD values >0.2 were found in all breeds at distances at or shorter than the average adjacent pair-wise distance between SNP on the 50K panel. Correlations of r-values, however, did not reach high levels (<0.9) at these distances. High correlation values of SNP phase between breeds were observed (>0.94) when the average pair-wise distances using the 777K SNP panel were examined. High concordance rate (0.968-0.995) and allelic R(2) (0.946-0.991) were found for all breeds when imputation was carried out with FImpute from 50K to 777K. Imputation accuracy for Guernsey and Ayrshire was slightly lower when using the imputation method in Beagle. Computing time was significantly greater when using Beagle software, with all comparable procedures being 9 to 13 times less efficient, in terms of time, compared with FImpute. These findings suggest that use of a multi-breed reference population might increase prediction accuracy using the 777K SNP panel and that 777K genotypes can be efficiently and effectively imputed using the lower density 50K SNP panel.
Larmer SG
,Sargolzaei M
,Schenkel FS
《-》