St. Marys river is a key waterway extending from Brush Point in the southeast corner of Lake Superior to the northwest section of Lake Huron. The massive Soo Locks and dredged channels in the river support navigation activities including commercial shipping. This navigational lock system is closed annually from late January to late March due to the development of ice cover. However, a notable year-to-year variability in ice condition exists during the transition periods posing a challenge to safe and effective planning of shipping and icebreaking operations around the region. Consequently, it is significant to find a way to predict the ice coverage in order to help the shipping community plan safe and effective operations. Also, ice cover is important to determine climate and hydrological patterns in the Great Lakes.
Long short-term memory (LSTM) - a kind of Neural Network widely used for predicting the time series data.
LSTM is a special kind of RNN. LSTM is explicitly designed to avoid the long-term dependency problem in standard RNN.
Remembering information for long periods is practically their default behavior.
LSTM has a chain-like structure, with repeating modules. But the repeating module has a different structure.
Instead of having a single neural network layer, there are four, interacting in a very special way, which is shown in the figure left.
From left to right, there are three gates called forget gate, input gate, and output gate, respectively. These three gates will decide how much information will be discarded or maintained through this long chain.
With gates this special kind of structure, LSTM can store long-term information in its cell state and performs well in long time series prediction.
Source: https://colah.github.io/posts/2015-08-Understanding-LSTMs/
XGBoost is a decision-tree-based ensemble Machine Learning algorithm that uses a gradient boosting framework. Ensemble learning consists of a collection of predictors which are multiple models to provide better prediction accuracy. Boosting is a process where weak learners are modified to become better.
In Boosting technique the errors made by previous models are tried to be corrected by succeeding models by adding some weights to the models. Models are added sequentially until no further improvements can be made.Gradient boosting is a supervised learning algorithm that attempts to accurately predict a target variable by combining an ensemble of estimates from a set of simpler and weaker models.
Source: https://dzone.com/articles/xgboost-a-deep-dive-into-boosting
Training set (1995 - 2010), Validation set (2011 - 2015) and Test set (2016 - 2021).
Divided 3 Ice phases in winter - Freezing Phase (Nov.01-Jan.14), Stable Phase (Jan 15–Mar 25) and Melting Phase (Mar. 26–May 10)
RMSE: standard deviation of the residuals
MAE: average of all absolute errors
Ice-on/off date: If consecutive 3 days that have ice cover more than 10% and less than 10%, it will be chosen as Ice on and Ice off date respectively.
Baselline is calculated by taking the average of the ice concentration for all previous years . By comparing our prediction result with baseline, we can evaluate whether our models do perform better than simply taking the average(which is the method people normally use)
MAE | RMSE | ||||||
LSTM | XGBoost | Baseline | LSTM | XGBoost | Baseline | ||
7 days forecast model | freezing phase | 10.20 | 10.65 | 11.75 | 15.42 | 15.53 | 17.35 |
stable phase | 5.91 | 5.04 | 6.3 | 8.86 | 8.01 | 11.37 | |
melting phase | 9.91 | 7.80 | 23.93 | 14.75 | 11.42 | 28.83 | |
whole season | 9.91 | 7.75 | 12.72 | 14.75 | 12.09 | 19.21 | |
30 days forecast model | freezing phase | 14.93 | 14.32 | 11.75 | 21.82 | 20.40 | 17.35 |
stable phase | 8.86 | 9.52 | 6.3 | 13.13 | 13.83 | 11.37 | |
melting phase | 33.33 | 19.32 | 23.93 | 38.56 | 28.09 | 28.83 | |
whole season | 33.33 | 13.73 | 12.72 | 38.56 | 20.58 | 19.21 | |
Graduate Student at the School for Environment and Sustainability, University of Michigan
Email: sdavedu@umich.edu
Graduate Student at the School for Environment and Sustainability, University of Michigan
Email: liulian@umich.edu
Assistant Research Scientist
Cooperative Institute for Great Lakes Research in School for Environment and Sustainability,
UM Climate & Space Sciences and Engineering (joint appointment)
Email: ayumif@umich.edu
Research Computing Specialist
Cooperative Institute for Great Lakes Research in School for Environment and Sustainability
Email: hghu@umich.edu
Associate Professor
UM Climate & Space Sciences and Engineering
Email: cjablono@umich.edu
Chief at Integrated Physical & Ecological Modeling & Forecasting Branch
NOAA Great Lakes Environmental Research Laboratory
Email: philip.chu@noaa.gov