Skip to main content
ARS Home » Research » Publications at this Location » Publication #265707

Title: A high-resolution cattle CNV map by population-scale genome sequencing

Author
item Bickhart, Derek
item HOU, YALI - University Of Maryland
item Liu, Ge - George
item Schroeder, Steven - Steve
item CAN, ALKAN - University Of Washington
item MARIA, CARDONE - University Of Bari
item MATUKUMALLI, L.K. - George Mason University
item SONG, JIUZHOU - University Of Maryland
item SCHNABEL, ROBERT - University Of Missouri
item VENTURA, MARIO - University Of Bari
item TAYLOR, JEREMY - University Of Missouri
item EICHLER, EVAN - University Of Washington
item Sonstegard, Tad
item Van Tassell, Curtis - Curt
item ILLUMINA BOVINE HD, CONSORTIUM - Collaborator

Submitted to: BARC Poster Day
Publication Type: Abstract Only
Publication Acceptance Date: 3/9/2011
Publication Date: 4/27/2011
Citation: Bickhart, D.M., Hou, Y., Liu, G., Schroeder, S.G., Can, A., Maria, C., Matukumalli, L., Song, J., Schnabel, R., Ventura, M., Taylor, J., Eichler, E., Sonstegard, T.S., Van Tassell, C.P., Illumina Bovine Hd, C. 2011. A high-resolution cattle CNV map by population-scale genome sequencing. BARC Poster Day [abstract]. No. 5.

Interpretive Summary:

Technical Abstract: Copy Number Variations (CNVs) are common genomic structural variations that have been linked to human diseases and phenotypic traits. CNVs represent an important type of genetic variation among cattle breeds and even individual animals; however, only low-resolution maps of cattle CNVs currently exist. We constructed a draft, high-resolution map of cattle CNVs based on whole genome sequencing data from 70 individuals by integrating complementary structural variant (SV) discovery approaches (CGH and high density SNP arrays) and experimental validations (PCR and FISH). Our sequence data comprised a total of ~90× coverage of the reference genome (~ 270 Gb) distributed among 5 taurine and 3 indicine breeds. To identify CNVs from these sequence data, we adopted and fine tuned two well-established computational approaches: read depth (RD; using WSSD) and read pair (RP; using VariationHunter) analyses. Our initial RD results identified 2,789 duplications/insertions and 156 deletions while the RP results revealed multiple deletions. Discovered CNVs had an average length of 20 kb (RD) and represented ~70 megabases of the cattle reference genome. We integrated these new SVs with the CNV call sets made from CGH array and SNP array based methods. To confirm our combined CNV dataset, we created new CGH arrays and performed PCR assays. Our results confirmed previously published CNV regions and validated the new candidate CNVs. These CNVs span multiple annotated cattle genes, which are significantly enriched for specific biological functions such as immunity, lactation, reproduction and rumination. In summary, the individually constructed cattle CNV maps provide a valuable resource for future cattle genomic research.