Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2019 Apr;211(4):1131-1141.
doi: 10.1534/genetics.119.301859.

Complex Trait Prediction from Genome Data: Contrasting EBV in Livestock to PRS in Humans: Genomic Prediction

Affiliations
Review

Complex Trait Prediction from Genome Data: Contrasting EBV in Livestock to PRS in Humans: Genomic Prediction

Naomi R Wray et al. Genetics. 2019 Apr.

Abstract

In this Review, we focus on the similarity of the concepts underlying prediction of estimated breeding values (EBVs) in livestock and polygenic risk scores (PRS) in humans. Our research spans both fields and so we recognize factors that are very obvious for those in one field, but less so for those in the other. Differences in family size between species is the wedge that drives the different viewpoints and approaches. Large family size achievable in nonhuman species accompanied by selection generates a smaller effective population size, increased linkage disequilibrium and a higher average genetic relationship between individuals within a population. In human genetic analyses, we select individuals unrelated in the classical sense (coefficient of relationship <0.05) to estimate heritability captured by common SNPs. In livestock data, all animals within a breed are to some extent "related," and so it is not possible to select unrelated individuals and retain a data set of sufficient size to analyze. These differences directly or indirectly impact the way data analyses are undertaken. In livestock, genetic segregation variance exposed through samplings of parental genomes within families is directly observable and taken for granted. In humans, this genomic variation is under-recognized for its contribution to variation in polygenic risk of common disease, in both those with and without family history of disease. We explore the equation that predicts the expected proportion of variance explained using PRS, and quantify how GWAS sample size is the key factor for maximizing accuracy of prediction in both humans and livestock. Last, we bring together the concepts discussed to address some frequently asked questions.

Keywords: EBV; GenPred; Genomic Prediction; PRS; estimated breeding values; polygenic risk score; segregation variance; within family variance.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Variance explained in out of sample prediction. Using Equation 1, we assume hM2 = 0.3 associated with common variants of frequency >0.1. In humans, the effective number of markers whose effect sizes are estimated is M∼50,000. Discovery sample GWAS of >1 million people are needed to achieve out-of-sample that achieves R2 approaching the upper limit of hM2. The red line bench marks out of sample prediction for M∼10,000, representative of a species with a smaller effective population size, or if, in humans, we achieve statistical methodologies that allow identification of a smaller number of DNA variants associated with the same hM2.
Figure 2
Figure 2
Increase in milk yield in black and white Holstein cattle since 1957. The mean EBV has increased by 3916 kg or 66 kg per cow per year. The phenotypic and genetic SD of milk yield in 1957 were ∼1200 and ∼600 kg. Hence, the genetic contribution to milk yield has increased by ∼6.5 genetic SD since 1957. Source: Council on Dairy Cattle Breeding (https://queries.uscdcb.com/eval/summary/trend.cfm)

Similar articles

Cited by

References

    1. Abraham G., Tye-Din J. A., Bhalala O. G., Kowalczyk A., Zobel J., et al. , 2014. Accurate and robust genomic prediction of celiac disease using statistical learning. PLoS Genet. 10: e1004137 (erratum: PLoS Genet. 10: e1004374) 10.1371/journal.pgen.1004137 - DOI - PMC - PubMed
    1. Anon2019. Chickenomics: how chicken became the rich world’s most popular meat. Economist Print edition, January 19, 2019.
    1. Beaumont R. N., Warrington N. M., Cavadino A., Tyrrell J., Nodzenski M., et al. , 2018. Genome-wide association study of offspring birth weight in 86 577 women identifies five novel loci and highlights maternal genetic effects that are independent of fetal genetics. Hum. Mol. Genet. 27: 742–756. 10.1093/hmg/ddx429 - DOI - PMC - PubMed
    1. Bell J., 2004. Predicting disease using genomics. Nature 429: 453–456. 10.1038/nature02624 - DOI - PubMed
    1. Bovine HapMap Consortium, . Gibbs R. A., Taylor J. F., Van Tassell C. P., Barendse W., et al. , 2009. Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds. Science 324: 528–532. 10.1126/science.1167936 - DOI - PMC - PubMed

Publication types