Quantifying Hourly Suspended Sediment Load Using Data Mining Models: Case Study of a Glacierized Andean Catchment in Chile

Khosravi, Khabat and Mao, Luca and Kisi, Ozgur and Yaseen, Zaher and Shahid, Shamsuddin (2018) Quantifying Hourly Suspended Sediment Load Using Data Mining Models: Case Study of a Glacierized Andean Catchment in Chile. Journal of Hydrology . ISSN 0022-1694

Full content URL: https://www.sciencedirect.com/science/article/pii/...

Documents
Quantifying Hourly Suspended Sediment Load Using Data Mining Models: Case Study of a Glacierized Andean Catchment in Chile
[img]
[Download]
[img] Microsoft Word
Revised manuscript SST.docx - Whole Document
Available under License Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International.

134kB
Item Type:Article
Item Status:Live Archive

Abstract

Suspended sediment has significant effects on reservoir storage capacity, the operation of hydraulic structures and river morphology. Hence, modeling suspended sediment loads (SSL) in rivers contributes to various water resource management and river engineering. An evaluation of stand-alone data mining models (i.e., reduced error pruning tree (REPT), M5P and instance-based learning (IBK)) and hybrid models, (i.e., bagging-M5P, random committee-REPT (RC-REPT) and random subspace-REPT (RS-REPT)) for predicting SSL resulting from glacial melting at an Andean catchment in Chile has been conducted in this study. The best input combinations are constructed based on Pearson correlation coefficient (PCC) of hourly SSL time series data with water discharge (Q), water temperature (T) and electrical conductivity (C) for different time lags. Seventy percent of the available data (one year of hourly data) is used to calibrate the models (dataset training) and the remaining 30% is used for model evaluation (dataset testing). The performances of the models are evaluated using several quantitative and graphical criteria, including coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), Nash-Sutcliffe efficiency (NSE), percentage of bias (PBIAS), the ratio of RMSE to the standard deviation of observation (RSR), a Taylor diagram and a boxplot. All the models performed well in predicting SSL. However, the Friedman and Wilcoxon signed rank tests revealed that predicted SSL significantly differed for different models except between IBK (or M5P) and REPT. The hybrid models performed better than individual models. The bagging-M5P had the best predictive capability while the REPT had the poorest.

Additional Information:The final published version of this article is available online at https://www.sciencedirect.com/science/article/pii/S0022169418307765#ak005
Keywords:Suspended sediment load, ; Data mining models, Glacierized Andes Catchment, Hourly time scale
Subjects:F Physical Sciences > F820 Geomorphology
Divisions:College of Science > School of Geography
ID Code:33669
Deposited On:11 Oct 2018 11:54

Repository Staff Only: item control page