Skip to main content
ARS Home » Research » Publications at this Location » Publication #251424

Title: Imputation of Missing Genotypes From Sparse to High Density Using Long-Range Phasing

Author
item DAETWYLER, H - Department Of Primary Industries
item Wiggans, George
item HAYES, B - Department Of Primary Industries
item WOLLIAMS, J - Roslin Institute
item GODDARD, M - Department Of Primary Industries

Submitted to: World Congress of Genetics Applied in Livestock Production
Publication Type: Proceedings
Publication Acceptance Date: 6/1/2010
Publication Date: 8/1/2010
Citation: Daetwyler, H.D., Wiggans, G.R., Hayes, B.J., Wolliams, J.A., Goddard, M.E. 2010. Imputation of Missing Genotypes From Sparse to High Density Using Long-Range Phasing. World Congress of Genetics Applied in Livestock Production. Proc. 9th World Congr. Genet. Appl. Livest. Prod., Leipzig, Germany, Aug. 1–6, 4 pp.

Interpretive Summary: High density genotypes can be imputed from low density ones by determining haplotypes and tracing inheritance of long chromosome segments from common ancestors. A long-range phasing algorithm was developed to phase whole chromosomes and simultaneously impute a large number of missing markers. It was tested by imputing markers in sparsely genotyped individuals with many missing genotypes. The reduction in accuracy of genomic evaluation with imperfectly imputed and sparse data was moderate when compared to accuracy from full density. Imputation of missing genotypes from sparse to high density is feasible.

Technical Abstract: Related individuals in a population share long chromosome segments which trace to a common ancestor. We describe a long-range phasing algorithm that makes use of this property to phase whole chromosomes and simultaneously impute a large number of missing markers. We test our method by imputing markers in sparsely genotyped individuals with many missing genotypes. Futhermore, we investigate the reduction in accuracy of genomic evaluation with imperfectly imputed and sparse data. We show that imputation of missing genotypes from sparse to high density is feasible. Our results demonstrate that the reduction in genomic evaluation accuracy from imperfectly imputed marker data is moderate when compared to accuracy from full density.