A Common Dataset for Genomic Analysis of Livestock Populations
Open Access
- 1 April 2012
- journal article
- research article
- Published by Oxford University Press (OUP) in G3 Genes|Genomes|Genetics
- Vol. 2 (4), 429-435
- https://doi.org/10.1534/g3.111.001453
Abstract
Although common datasets are an important resource for the scientific community and can be used to address important questions, genomic datasets of a meaningful size have not generally been available in livestock species. We describe a pig dataset that PIC (a Genus company) has made available for comparing genomic prediction methods. We also describe genomic evaluation of the data using methods that PIC considers best practice for predicting and validating genomic breeding values, and we discuss the impact of data structure on accuracy. The dataset contains 3534 individuals with high-density genotypes, phenotypes, and estimated breeding values for five traits. Genomic breeding values were calculated using BayesB, with phenotypes and de-regressed breeding values, and using a single-step genomic BLUP approach that combines information from genotyped and un-genotyped animals. The genomic breeding value accuracy increased with increased trait heritability and with increased relationship between training and validation. In nearly all cases, BayesB using de-regressed breeding values outperformed the other approaches, but the single-step evaluation performed only slightly worse. This dataset was useful for comparing methods for genomic prediction using real data. Our results indicate that validation approaches accounting for relatedness between populations can correct for potential overestimation of genomic breeding value accuracies, with implications for genotyping strategies to carry out genomic selection programs.Keywords
This publication has 23 references indexed in Scilit:
- A phasing and imputation method for pedigreed populations that results in a single-stage genomic evaluationGenetics Selection Evolution, 2012
- Mouse genomic variation and its effect on phenotypes and gene regulationNature, 2011
- Different models of genetic variation and their effect on genomic evaluationGenetics Selection Evolution, 2011
- Genome-wide prediction of discrete traits using bayesian regressions and machine learningGenetics Selection Evolution, 2011
- Different genomic relationship matrices for single-step analysis using phenotypic, pedigree and genomic informationGenetics Selection Evolution, 2011
- A map of human genome variation from population-scale sequencingNature, 2010
- Dynamics of long-term genomic selectionGenetics Selection Evolution, 2010
- The impact of genetic relationship information on genomic breeding values in German Holstein cattleGenetics Selection Evolution, 2010
- Deregressing estimated breeding values and weighting information for genomic regression analysesGenetics Selection Evolution, 2009
- The International HapMap ProjectNature, 2003