File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: A gradient boost approach for predicting near-road ultrafine particle concentrations using detailed traffic characterization

TitleA gradient boost approach for predicting near-road ultrafine particle concentrations using detailed traffic characterization
Authors
KeywordsCross-validation
K-means clustering
Local traffic
Machine learning
Short-term fixed monitoring
Issue Date2020
Citation
Environmental Pollution, 2020, v. 265, article no. 114777 How to Cite?
AbstractThis study investigates the influence of meteorology, land use, built environment, and traffic characteristics on near-road ultrafine particle (UFP) concentrations. To achieve this objective, minute-level UFP concentrations were measured at various locations along a major arterial road in the Greater Toronto Area (GTA) between February and May 2019. Each location was visited five times, at least once in the morning, mid-day, and afternoon. Each visit lasted for 30 min, resulting in 2.5 h of minute-level data collected at each location. Local traffic information, including vehicle class and turning movements, were processed using computer vision techniques. The number of fast-food restaurants, cafes, trees, traffic signals, and building footprint, were found to have positive impacts on the mean UFP, while distance to the closest major road was negatively associated with UFP. We employed the Extreme Gradient Boosting (XGBoost) method to develop prediction models for UFP concentrations. The Shapley additive explanation (SHAP) measures were used to capture the influence of each feature on model output. The model results demonstrated that minute-level counts of local traffic from different directions had significant impacts on near-road UFP concentrations, model performance was robust under random cross-validation as coefficients of determination (R2) ranged from 0.63 to 0.69, but it revealed weaknesses when data at specific locations were eliminated from the training dataset. This result indicates that proper cross-validation techniques should be developed to better evaluate machine learning models for air quality predictions.
Persistent Identifierhttp://hdl.handle.net/10722/346783
ISSN
2023 Impact Factor: 7.6
2023 SCImago Journal Rankings: 2.132

 

DC FieldValueLanguage
dc.contributor.authorXu, Junshi-
dc.contributor.authorWang, An-
dc.contributor.authorSchmidt, Nicole-
dc.contributor.authorAdams, Matthew-
dc.contributor.authorHatzopoulou, Marianne-
dc.date.accessioned2024-09-17T04:13:15Z-
dc.date.available2024-09-17T04:13:15Z-
dc.date.issued2020-
dc.identifier.citationEnvironmental Pollution, 2020, v. 265, article no. 114777-
dc.identifier.issn0269-7491-
dc.identifier.urihttp://hdl.handle.net/10722/346783-
dc.description.abstractThis study investigates the influence of meteorology, land use, built environment, and traffic characteristics on near-road ultrafine particle (UFP) concentrations. To achieve this objective, minute-level UFP concentrations were measured at various locations along a major arterial road in the Greater Toronto Area (GTA) between February and May 2019. Each location was visited five times, at least once in the morning, mid-day, and afternoon. Each visit lasted for 30 min, resulting in 2.5 h of minute-level data collected at each location. Local traffic information, including vehicle class and turning movements, were processed using computer vision techniques. The number of fast-food restaurants, cafes, trees, traffic signals, and building footprint, were found to have positive impacts on the mean UFP, while distance to the closest major road was negatively associated with UFP. We employed the Extreme Gradient Boosting (XGBoost) method to develop prediction models for UFP concentrations. The Shapley additive explanation (SHAP) measures were used to capture the influence of each feature on model output. The model results demonstrated that minute-level counts of local traffic from different directions had significant impacts on near-road UFP concentrations, model performance was robust under random cross-validation as coefficients of determination (R2) ranged from 0.63 to 0.69, but it revealed weaknesses when data at specific locations were eliminated from the training dataset. This result indicates that proper cross-validation techniques should be developed to better evaluate machine learning models for air quality predictions.-
dc.languageeng-
dc.relation.ispartofEnvironmental Pollution-
dc.subjectCross-validation-
dc.subjectK-means clustering-
dc.subjectLocal traffic-
dc.subjectMachine learning-
dc.subjectShort-term fixed monitoring-
dc.titleA gradient boost approach for predicting near-road ultrafine particle concentrations using detailed traffic characterization-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1016/j.envpol.2020.114777-
dc.identifier.pmid32540592-
dc.identifier.scopuseid_2-s2.0-85086433299-
dc.identifier.volume265-
dc.identifier.spagearticle no. 114777-
dc.identifier.epagearticle no. 114777-
dc.identifier.eissn1873-6424-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats