File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Uncertainties of machine learning in predicting the hydrological responses of lid practices

TitleUncertainties of machine learning in predicting the hydrological responses of lid practices
Authors
KeywordsLow impact development
Machine learning
Stormwater management
Sustainable drainage system
Uncertainty analysis
Issue Date2020
Citation
22nd Congress of the International Association for Hydro-Environment Engineering and Research-Asia Pacific Division, IAHR-APD 2020: "Creating Resilience to Water-Related Challenges", 2020 How to Cite?
AbstractLow impact development (LID) practices, such as green roofs and bioretention cells, are regarded as environmentally-friendly alternatives to the conventional drainage infrastructures. It is essential to accurately predict the hydrological responses of LID practices for assessing and optimizing LID designs. However, the accuracy of the commonly used process-based hydrological models is sometimes affected by model assumptions and the availability of field measurements for calibration. Machine learning methods can potentially avoid these issues by directly modeling the correlation between the input (e.g., rainfall time series) and the response (e.g., outflow hydrograph) of a system. However, considerable uncertainties are involved when training machine learning models. As a case study, the correlation between rainfall time series and outflow rates in an LID site in the U.S. is modeled using 11 commonly used machine learning models, including random forest, k-nearest neighbors, and gradient boosting machine. These models are trained on high temporal resolution data using formal machine learning procedures, which include feature engineering, pre-processing, model tuning, and resampling. Different methods are used in the training procedures for assessing the involved uncertainties. For example, in feature engineering, the original high-resolution time series is transformed into different sets of features (e.g., mean and peak rainfall intensity in the past two hours) which are used as input to machine learning models, and different types of transformations are used to preprocess these features. The results show that some machine learning models can achieve comparable or better prediction accuracy when compared to process-based models, and performance of different machine learning models can vary significantly. The feature engineering and the resampling procedures are found to have significant impacts on the quality of the trained models. Evaluating multiple machine learning models and using various methods in model training are crucial for assessing the uncertainties involved in machine learning.
Persistent Identifierhttp://hdl.handle.net/10722/335863

 

DC FieldValueLanguage
dc.contributor.authorYang, Yang-
dc.contributor.authorChui, Ting Fong May-
dc.date.accessioned2023-12-28T08:49:19Z-
dc.date.available2023-12-28T08:49:19Z-
dc.date.issued2020-
dc.identifier.citation22nd Congress of the International Association for Hydro-Environment Engineering and Research-Asia Pacific Division, IAHR-APD 2020: "Creating Resilience to Water-Related Challenges", 2020-
dc.identifier.urihttp://hdl.handle.net/10722/335863-
dc.description.abstractLow impact development (LID) practices, such as green roofs and bioretention cells, are regarded as environmentally-friendly alternatives to the conventional drainage infrastructures. It is essential to accurately predict the hydrological responses of LID practices for assessing and optimizing LID designs. However, the accuracy of the commonly used process-based hydrological models is sometimes affected by model assumptions and the availability of field measurements for calibration. Machine learning methods can potentially avoid these issues by directly modeling the correlation between the input (e.g., rainfall time series) and the response (e.g., outflow hydrograph) of a system. However, considerable uncertainties are involved when training machine learning models. As a case study, the correlation between rainfall time series and outflow rates in an LID site in the U.S. is modeled using 11 commonly used machine learning models, including random forest, k-nearest neighbors, and gradient boosting machine. These models are trained on high temporal resolution data using formal machine learning procedures, which include feature engineering, pre-processing, model tuning, and resampling. Different methods are used in the training procedures for assessing the involved uncertainties. For example, in feature engineering, the original high-resolution time series is transformed into different sets of features (e.g., mean and peak rainfall intensity in the past two hours) which are used as input to machine learning models, and different types of transformations are used to preprocess these features. The results show that some machine learning models can achieve comparable or better prediction accuracy when compared to process-based models, and performance of different machine learning models can vary significantly. The feature engineering and the resampling procedures are found to have significant impacts on the quality of the trained models. Evaluating multiple machine learning models and using various methods in model training are crucial for assessing the uncertainties involved in machine learning.-
dc.languageeng-
dc.relation.ispartof22nd Congress of the International Association for Hydro-Environment Engineering and Research-Asia Pacific Division, IAHR-APD 2020: "Creating Resilience to Water-Related Challenges"-
dc.subjectLow impact development-
dc.subjectMachine learning-
dc.subjectStormwater management-
dc.subjectSustainable drainage system-
dc.subjectUncertainty analysis-
dc.titleUncertainties of machine learning in predicting the hydrological responses of lid practices-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.scopuseid_2-s2.0-85104789808-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats