Location: Weed and Insect Biology Research
Title: Assembly and analysis of sequence from a spring and winter type Camelina sativa by whole genome PacBio Hifi technologiesAuthor
ONTANO, ANDREW - University Of Georgia | |
Dobrin, Barbara | |
Smith, Timothy - Tim | |
ABERNATHY, BRIAN - University Of Georgia | |
Sthapit Kandel, Jinita | |
SHAIKH, TM - Missouri State University | |
RAHMAN, MUKHLESUR - North Dakota State University | |
Anderson, James | |
Vaughn, Justin | |
Horvath, David |
Submitted to: Industrial Crops and Products
Publication Type: Peer Reviewed Journal Publication Acceptance Date: 7/29/2024 Publication Date: 8/8/2024 Citation: Ontano, A., Dobrin, B.H., Smith, T.P., Abernathy, B., Sthapit Kandel, J., Shaikh, T., Rahman, M., Anderson, J.V., Vaughn, J.N., Horvath, D.P. 2024. Assembly and analysis of sequence from a spring and winter type Camelina sativa by whole genome PacBio Hifi technologies. Industrial Crops and Products. 221:119346. https://doi.org/10.1016/j.indcrop.2024.119346. DOI: https://doi.org/10.1016/j.indcrop.2024.119346 Interpretive Summary: Camelina sativa is a new biofuel crop and potential cash cover-crop suitable for the northern great plains. However, breeding and research of camelina has been hampered by the lack of a high quality genome assembly and lack of genetic tools to study agronomic traits. To solve this problem, new sequencing technologies that are capable of producing contiguous chromosome length sequences were used to sequence two camelina varieties that differed in the ability to withstand freezing temperatures and also in their ability to flower without a winter chilling period. Additionally, these two varieties were crossed and the progeny were sequenced as well. These sequences will be a powerful tool to begin assessing the mechanisms that allow early flowering and freezing tolerance and assist breeders and other scientists to improve many agronomic traits in this new crop. Technical Abstract: Camelina sativa is an emerging oilseed crop with potential for intercropping with more traditional cash crops. Until very recently, the only reference genome was produced by short read technologies and was considerably fragmented. We developed a cross between the spring biotype CO46 and the winter biotype Joelle that showed significant differences in their vernalization requirements and their freezing tolerance after cold acclimation. The resulting offspring were selfed and the F2 lines were advanced by single see descent to the F6:7 stage. To facilitate mapping of traits in this population we performed Long read sequencing using PacBio HiFi technology. Here we report the assembly and annotation of these two parental lines. Both assemblies formed 20 chromosomal units with a genome size of 679,629,983 and 559,281,586 bases for CO46 nd Joelle respectively. Assessment of completeness by BUSCO analysis indicated the CO46 and Joelle genomes were 99.6 and 99.4% complete respectively. Gene annotation however indicated that the Joelle assembly had a small number of missing gene sequences, and this was further shown by the presence of gaps when the assemblies were aligned and compared. Skim-sequencing of the F6:7 recombinant inbred lines resulted in 823,142 markers that mapped to variants identified by comparisons of the parental genome sequences. These sequences provide a valuable resource for breeders seeking to improve the food and industrial attributes of camelina. |