Skip to main content
ARS Home » Pacific West Area » Albany, California » Western Regional Research Center » Crop Improvement and Genetics Research » Research » Publications at this Location » Publication #386728

Research Project: New Genetic Resources for Breeding Better Wheat and Bioenergy Crops

Location: Crop Improvement and Genetics Research

Title: Optical maps refine the bread wheat Triticum aestivum vc. Chinese Spring genome assembly

Author
item ZHU, TINGTING - University Of California, Davis
item WANG, LE - Clermont Universite, Universite D'Auvergne, Unite De Nutrition Humaine
item RIMBERT, HELENE - Clermont Universite, Universite D'Auvergne, Unite De Nutrition Humaine
item RODRIGUEZ, JUAN - University Of California, Davis
item DEAL, KARIN - University Of California, Davis
item DE OLIVEIRA, ROMAIN - Clermont Universite, Universite D'Auvergne, Unite De Nutrition Humaine
item CHOULET, FREDERIC - Clermont Universite, Universite D'Auvergne, Unite De Nutrition Humaine
item KEEBLE-GAGNERE, GABRIEL - Agribio, Centre For Agribioscience
item TIBBITS, JOSQUIN - Agribio, Centre For Agribioscience
item ROGERS, JANE - International Wheat Genome Sequencing Consortium (IWGSC)
item EVERSOLE, KELLYE - International Wheat Genome Sequencing Consortium (IWGSC)
item APPELS, RUDI - Agribio, Centre For Agribioscience
item Gu, Yong
item MASCHER, MARTIN - Leibniz Institute Of Plant Genetics And Crop Plant Research
item DVORAK, JAN - University Of California, Davis
item LUO, MING-CHENG - University Of California, Davis

Submitted to: The Plant Journal
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: 4/19/2021
Publication Date: 4/22/2021
Citation: Zhu, T., Wang, L., Rimbert, H., Rodriguez, J., Deal, K.R., De Oliveira, R., Choulet, F., Keeble-Gagnere, G., Tibbits, J., Rogers, J., Eversole, K., Appels, R., Gu, Y.Q., Mascher, M., Dvorak, J., Luo, M. 2021. Optical maps refine the bread wheat Triticum aestivum vc. Chinese Spring genome assembly. The Plant Journal. 107(1):303-314. https://doi.org/10.1111/tpj.15289.
DOI: https://doi.org/10.1111/tpj.15289

Interpretive Summary: Wheat provides about one-fifth of the calories and proteins consumed by humans and is annually planted on an area larger than any other crop. The improved genome sequence and updated annotation will enhance the utility of the Ae. tauschii genome sequence for wheat research and breeding. The development of genomic resources, including a reference-quality genome sequence, is critical for accelerating genetic improvement of wheat. Due to the large size of wheat genome (16 Gb) and polyploid nature, sequencing wheat genome to generate a reference-quality genome assembly has been challenging. The first reference sequence for bread wheat cv. Chinese Spring (RefSeq v1.0) was published in 2018. Here, we report a new release of the bread wheat reference genome assembly (RefSeq v2.1). RefSeq v2.1 was generated by refining the RefSeq v1.0 using the BioNano optical mapping technology and Pacific Biosciences long reads. This improved wheat genome assembly is a major advance for applied and basic application.

Technical Abstract: Until recently, achieving a reference-quality genome sequence for bread wheat was long thought beyond the limits of genome sequencing and assembly technology, primarily due to the large genome size and > 80% repetitive sequence content. The release of the chromosome scale 14.5-Gb IWGSC RefSeq v1.0 genome sequence of bread wheat cv. Chinese Spring (CS) was, therefore, a milestone. Here, we used a direct label and stain (DLS) optical map of the CS genome together with a prior nick, label, repair, and stain (NLRS) optical map, and sequence contigs assembled with Pacific Biosciences long reads, to refine the v1.0 assembly. Inconsistencies between the sequence and maps were reconciled and gaps were closed. Gap filling and anchoring of 279 unplaced scaffolds increased the total length of pseudomolecules by 168 Mb (excluding Ns). Positions and orientations were corrected for 233 and 354 scaffolds, respectively, representing 10% of the genome sequence. The accuracy of the remaining 90% of the assembly was validated. As a result of the increased contiguity, the numbers of transposable elements (TEs) and intact TEs have increased in IWGSC RefSeq v2.1 compared with v1.0. In total, 98% of the gene models identified in v1.0 were mapped onto this new assembly through development of a dedicated approach implemented in the MAGAAT pipeline. The numbers of high-confidence genes on pseudomolecules have increased from 105,319 to 105,534. The reconciled assembly enhances the utility of the sequence for genetic mapping, comparative genomics, gene annotation and isolation, and more general studies on the biology of wheat.