Skip to main content
ARS Home » Plains Area » Fort Collins, Colorado » Center for Agricultural Resources Research » Water Management and Systems Research » Research » Publications at this Location » Publication #407525

Research Project: Improving Crop Performance and Precision Irrigation Management in Semi-Arid Regions through Data-Driven Research, AI, and Integrated Models

Location: Water Management and Systems Research

Title: Prediction of chlorophyll-a as an index of harmful algal blooms using machine learning models

Author
item BUSARI, IBRAHIM - Clemson University
item SAHOO, DEB - Clemson University
item Harmel, Daren
item HAGGARD, BRIAN - University Of Arkansas

Submitted to: American Society of Agricultural and Biological Engineers
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: 1/12/2024
Publication Date: 3/1/2024
Citation: Busari, I., Sahoo, D., Harmel, R.D., Haggard, B. 2024. Prediction of chlorophyll-a as an index of harmful algal blooms using machine learning models. American Society of Agricultural and Biological Engineers. 2(2):53-61. https://doi.org/10.13031/jnrae.15812.
DOI: https://doi.org/10.13031/jnrae.15812

Interpretive Summary: The complex dynamics of freshwater harmful algal blooms (HABs) necessitate proactive monitoring approaches to mitigate their impacts. The rapid breakthrough in computing prowess and statistical advances is triggering the development of data-driven techniques such as machine learning (ML) models. ML models have been shown in different fields to be instrumental in finding patterns in data usable for explaining relationships in observed data. The findings reveal the potential with ML models, thereby shedding light on crucial factors that necessitate careful deliberation by researchers and policymakers in determining the most suitable approaches to embrace for monitoring HABs.

Technical Abstract: The complex dynamics of freshwater harmful algal blooms (HABs) necessitate proactive monitoring approaches to mitigate their impacts. The rapid breakthrough in computing prowess and statistical advances is triggering the development of data-driven techniques such as machine learning (ML) models. ML models have been shown in different fields to be instrumental in finding patterns in data usable for explaining relationships in observed data. The selected models for this study are regression tree, random forest (RF), multilayer perceptron (MLP), support vector regression (SVR), long short-term memory (LSTM), and gated recurrent unit (GRU) models, with the last two models developed to consider the temporal sequence of obtained water quality datasets. The results showed varying performance with different data splitting ratios, although the RF model outperformed the SVR, MLP and regression tree models. LSTM model predicted temporal dynamics of chlorophyll-a better than GRU, although with more runtime, and showed the potential for developing real-time monitoring of HABs and development of early warning systems. The findings reveal the robustness of the chosen ML models, thereby shedding light on crucial factors that necessitate careful deliberation by researchers and policymakers in determining the most suitable approaches to embrace for monitoring HABs.