Location: Plant, Soil and Nutrition Research
Title: Genetic variation standardization (rsIDs) enhances Trait-Driven Interoperability and Genomic Function TransferAuthor
WEI, SHARON - Cold Spring Harbor Laboratory | |
TELLO-RUIZ, MARCELA - Cold Spring Harbor Laboratory | |
KUMAR, VIVEK - Cold Spring Harbor Laboratory | |
OLSON, ANDREW - Cold Spring Harbor Laboratory | |
CHOUGULE, KAPEEL - Cold Spring Harbor Laboratory | |
Ware, Doreen |
Submitted to: Meeting Abstract
Publication Type: Abstract Only Publication Acceptance Date: 4/20/2024 Publication Date: N/A Citation: N/A Interpretive Summary: Technical Abstract: The Reference SNP cluster ID (rsID) is a unique identifier for groups of genetic variations (GVs) co-located at specific positions in the genome. It facilitates standardized referencing across databases, studies, and publications, primarily in human research for mutation identification and data integration. However, its use in plant research has been limited due to inadequate support. With the maturation of the European Variation Archive (EVA), hundreds of millions of rsIDs have been assigned to plant genomes, including agriculturally significant ones hosted in Gramene (https://www.gramene.org/) and SorghumBase (https://sorghumbase.org/). This development made GV data aggregation and marker-based breeding more feasible. The plant-centered genomic database and browser Gramene has embraced this by ingesting rsIDs from EVA release 5, linking them to QTL, phenotype, and germplasm data. Currently, four crop genomes have integrated rsIDs: Sorghum (41M), Rice (27M), Maize (47M), and Grape (0.3M). For SorghumBase, we also imported GWAS study results from the Sorghum Association Panel population and associated the phenotypes/traits with rsIDs. As the number of sequenced pan-genomes increases, computationally calling GVs on each accession's genome becomes impractical. Instead, mapping rsIDs from the reference genome to pan-genomes proves more feasible and efficient. Gramene tested this approach by implementing the variation mapping pipeline from EVA, allowing accurate mapping across different assemblies of the same genome and genomes of different accessions from the same or closely related species. This method's successful implementation holds significant potential for breeding initiatives. Gramene's efforts are supported by funding from USDA ARS (8062-21000-041-00D). |