Skip to main content
ARS Home » Southeast Area » Raleigh, North Carolina » Soybean and Nitrogen Fixation Research » Research » Publications at this Location » Publication #386761

Research Project: Exploiting Genetic Diversity through Genomics, Plant Physiology, and Plant Breeding to Increase Competitiveness of U.S. Soybeans in Global Markets

Location: Soybean and Nitrogen Fixation Research

Title: Functional annotation of proteins for signaling network inference in non-model species

Author
item VAN DEN BROECK, LISA - North Carolina State University
item BHOSALE, DINESH KIRAN - North Carolina State University
item SONG, KUNCHENG - North Carolina State University
item DE LIMA, CASSIO F.F. - Ghent University
item ASHLEY, MICHAEL - North Carolina State University
item ZHU, TINGTING - Ghent University
item ZHU, SHANSHUO - Ghent University
item VAN DE COTTE, BRIGETTE - Ghent University
item NEYT, PIA - Ghent University
item Ortiz, Anna
item Sikes, Tiffany
item APER, JONAS - Flanders Research Institute For Agriculture
item LOOTENS, PETER - Flanders Research Institute For Agriculture
item Locke, Anna
item DE SMET, IVE - Ghent University
item SOZZANI, ROSANGELA - North Carolina State University

Submitted to: Nature Communications
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: 5/30/2023
Publication Date: 8/3/2023
Citation: Van Den Broeck, L., Bhosale, D., Song, K., De Lima, C., Ashley, M., Zhu, T., Zhu, S., Van De Cotte, B., Neyt, P., Ortiz, A.C., Sikes, T.R., Aper, J., Lootens, P., Locke, A.M., De Smet, I., Sozzani, R. 2023. Functional annotation of proteins for signaling network inference in non-model species. Nature Communications. 14:4654. https://doi.org/10.1038/s41467-023-40365-z.
DOI: https://doi.org/10.1038/s41467-023-40365-z

Interpretive Summary: A new neural network algorithm called PF-NET was developed to classify proteins, and this new algorithm requires less prior information than other commonly used algorithms. PF-NET identified phosphatase proteins in a model plant species that have been experimentally validated but were not correctly identified by the older, more commonly used protein classification algorithm. PF-NET classified the soybean kinase and phosphatase protein families, which are important for stress signaling. These protein classifications were then used to help determine the protein signaling network that regulates cold stress responses in soybean seedlings. We identified important protein regulators of soybean temperature responses, which are important targets for future experiments and could be candidates for improving soybean temperature stress tolerance.

Technical Abstract: Molecular biology aims to understand the molecular basis of cellular responses, unravel dynamic regulatory networks, and model complex biological systems. However, these studies remain challenging in non-model species as a result of poor functional annotation of regulatory proteins, like kinases or phosphatases. To overcome this limitation, we developed a multi-layer neural network that annotates proteins by determining functionality directly from the protein sequence. We annotated the kinases and phosphatases in the non-model species, Glycine max (soybean), achieving a prediction sensitivity of up to 97%. To demonstrate the applicability, we used our functional annotations in combination with Bayesian network principles to predict signaling cascades using time series phosphoproteomics. We shed light on phosphorylation cascades in soybean seedlings upon cold treatment and identified Glyma.10G173000 (TOI5) and Glyma.19G007300 (TOT3) as predicted key temperature response regulators in soybean. Importantly, the network inference does not rely upon known upstream kinases, kinase motifs, or protein interaction data, enabling de novo identification of kinase-substrate interactions. In addition to high accuracy and strong generalization, we showed that our functional prediction neural network is scalable to other model and non-model species, including Oryza sativa (rice), Zea mays (maize), Sorghum bicolor (sorghum), and Triticum aestivum (wheat). Taking together, we demonstrated a data-driven systems biology approach for non-model species leveraging our predicted upstream kinases and phosphatases.