Sub-monthly time scale forecasting of harmful algal blooms intensity in Lake Erie using remote sensing and machine learning

Sci Total Environ. 2023 Nov 20:900:165781. doi: 10.1016/j.scitotenv.2023.165781. Epub 2023 Jul 25.

Abstract

Harmful algal blooms of cyanobacteria (CyanoHAB) have emerged as a serious environmental concern in large and small water bodies including many inland lakes. The growth dynamics of CyanoHAB can be chaotic at very short timescales but predictable at coarser timescales. In Lake Erie, cyanobacteria blooms occur in the spring-summer months, which, at annual timescale, are controlled by the total spring phosphorus (TP) load into the lake. This study aimed to forecast CyanoHAB cell count at sub-monthly (e.g., 10-day) timescales. Satellite-derived cyanobacterial index (CI) was used as a surrogate measure of CyanoHAB cell count. CI was related to the in-situ measured chlorophyll-a and phycocyanin concentrations and Microcystis biovolume in the lake. Using available data on environmental and lake hydrodynamics as predictor variables, four statistical models including LASSO (Least Absolute Shrinkage and Selection Operator), artificial neural network (ANN), random forest (RF), and an ensemble average of the three models (EA) were developed to forecast CI at 10-, 20- and 30-day lead times. The best predictions were obtained by using the RF and EA algorithms. It was found that CyanoHAB growth dynamics, even at sub-monthly timescales, are determined by coarser timescale variables. Meteorological, hydrological, and water quality variations at sub-monthly timescales exert lesser control over CyanoHAB growth dynamics. Nutrients discharged into the lake from rivers other than the Maumee River were also important in explaining the variations in CI. Surprisingly, to forecast CyanoHAB cell count, average solar radiation at 30 to 60 days lags were found to be more important than the average solar radiation at 0 to 30 days lag. Other important variables were TP discharged into the lake during the previous 10 years, TP and TKN discharged into the lake during the previous 120 days, the average water level at 10-day lag and 60-day lag.

Keywords: Forecasting; Freshwater lakes; Harmful algal blooms; Lake Erie; Machine learning.

MeSH terms

  • Chlorophyll A
  • Cyanobacteria*
  • Harmful Algal Bloom*
  • Lakes / microbiology
  • Machine Learning
  • Phosphorus
  • Remote Sensing Technology

Substances

  • Chlorophyll A
  • Phosphorus