Skip to main content
ARS Home » Northeast Area » Beltsville, Maryland (BARC) » Beltsville Agricultural Research Center » Sustainable Perennial Crops Laboratory » Research » Publications at this Location » Publication #414293

Research Project: Genotypic Characterization of Genetic Resources for Cacao, Coffee, and Other Tropical Perennial Crops Economically Important to the United States

Location: Sustainable Perennial Crops Laboratory

Title: From seed shape to genetics: analyzing medicago spp. seed morphology using gwas and machine learning

Author
item BOTKIN, JACOB - University Of Minnesota
item MEDINA, CESAR - University Of Minnesota
item Park, Sunchung - Sun
item POUDEL, KABITA - University Of Minnesota
item CHA, MINHYEOK - Korea University
item LEE, YOONJUNG - University Of Minnesota
item Prom, Louis
item Curtin, Shaun
item Xu, Zhanyou
item Ahn, Ezekiel

Submitted to: Scientific Reports
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: 7/14/2024
Publication Date: 7/30/2024
Citation: Botkin, J.R., Medina, C.A., Park, S., Poudel, K., Cha, M., Lee, Y., Prom, L.K., Curtin, S.J., Xu, Z., Ahn, E.J. 2024. From seed shape to genetics: analyzing medicago spp. seed morphology using gwas and machine learning. Scientific Reports. 14:17588. https://doi.org/10.1038/s41598-024-67790-4.
DOI: https://doi.org/10.1038/s41598-024-67790-4

Interpretive Summary: Alfalfa (Medicago sativa L.) is a perennial forage cash crop with extensive economic and agronomic importance due to its high nutritional value. An essential part of alfalfa research includes the production of high-quality alfalfa seeds, but the fundamental genetic architecture of seed morphology in alfalfa is relatively unexplored. This study investigated the diverse morphological variations in alfalfa seeds, focusing on size, shape, and color. With a combination of robust statistical analysis and machine learning models, candidate genes potentially confer significant morphological differences found in this study, providing valuable insights into the underlying genetic diversity, which can be applied in breeding strategies to develop improved alfalfa cultivars with desirable agronomic traits.

Technical Abstract: Plant seed morphology is shaped by multiple components including genetic, physiological, and ecological components that highly affect yield, quality, and market price. The importance of Alfalfa (Medicago sativa L.) is often emphasized by calling it the "Queen of the forages." Given this great importance, understanding the seed morphology of alfalfa was imperative. To unravel the morphological characteristics and elucidate the genetic bases of seed morphology in alfalfa, we screened 318 lines, including 244 lines of alfalfa and 28 other lines of alfalfa relatives, for 11 seed morphologies associated with size, shape, and color. Statistical analysis was applied to identify potential relationships among the 11 morphological traits. Based on the phenotypic data of M. sativa subsp. sativa, a Genome-wide association study (GWAS) was conducted by using publicly available single nucleotide polymorphisms (SNPs). When mapped to the reference Medicago truncatula genome, multiple candidate genes potentially associated with alfalfa seed morphology were identified (threshold of 1E-04). Genes in proximity to top SNPs associated with seed morphology traits included CPR1, MON1, a PPR protein, and Wun1. Machine learning models were utilized to validate the GWAS results, as well as identify additional marker-trait associations across the whole genome and within coding sequences. Along with the strong correlations observed between the seed morphological traits, these candidate genes will play important roles in understanding the genetic basis of seed morphology in Medicago spp.