File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/TBDATA.2025.3533882
- Scopus: eid_2-s2.0-85216839757
- Find via

Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: CTDI: CNN-Transformer-Based Spatial-Temporal Missing Air Pollution Data Imputation
| Title | CTDI: CNN-Transformer-Based Spatial-Temporal Missing Air Pollution Data Imputation |
|---|---|
| Authors | |
| Keywords | Air pollution data imputation data recovery deep learning missing data spatial-temporal transformer |
| Issue Date | 1-Oct-2025 |
| Publisher | Institute of Electrical and Electronics Engineers |
| Citation | IEEE Transactions on Big Data, 2025, v. 11, n. 5, p. 2443-2456 How to Cite? |
| Abstract | Accurate and comprehensive air pollution data is essential for understanding and addressing environmental challenges. Missing data can impair accurate analysis and decision-making. This study presents a novel approach, named CNN-Transformer-based Spatial-Temporal Data Imputation (CTDI), for imputing missing air pollution data. Data pre-processing incorporates observed air pollution data and related urban data to produce 24-hour period tensors as input samples. 1-by-1 CNN layers capture the interaction between different types of input data. Deep learning transformer architecture is employed in a spatial-temporal (S-T) transformer module to capture long-range dependencies and extract complex relationships in both spatial and temporal dimensions. Hong Kong air pollution data is statistically analyzed and used to evaluate CTDI in its recovery of generated and actual patterns of missing data. Experimental results show that CTDI consistently outperforms existing imputation methods across all evaluated scenarios, including cases with higher rates of missing data, thereby demonstrating its robustness and effectiveness in enhancing air quality monitoring. Additionally, ablation experiments reveal that each component significantly contributes to the model's performance, with the temporal transformer proving particularly crucial under varying rates of missing data. |
| Persistent Identifier | http://hdl.handle.net/10722/368345 |
| ISSN | 2023 Impact Factor: 7.5 2023 SCImago Journal Rankings: 1.821 |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Yu, Yangwen | - |
| dc.contributor.author | Li, Victor O.K. | - |
| dc.contributor.author | Lam, Jacqueline C.K. | - |
| dc.contributor.author | Chan, Kelvin | - |
| dc.contributor.author | Zhang, Qi | - |
| dc.date.accessioned | 2025-12-31T00:35:09Z | - |
| dc.date.available | 2025-12-31T00:35:09Z | - |
| dc.date.issued | 2025-10-01 | - |
| dc.identifier.citation | IEEE Transactions on Big Data, 2025, v. 11, n. 5, p. 2443-2456 | - |
| dc.identifier.issn | 2332-7790 | - |
| dc.identifier.uri | http://hdl.handle.net/10722/368345 | - |
| dc.description.abstract | Accurate and comprehensive air pollution data is essential for understanding and addressing environmental challenges. Missing data can impair accurate analysis and decision-making. This study presents a novel approach, named CNN-Transformer-based Spatial-Temporal Data Imputation (CTDI), for imputing missing air pollution data. Data pre-processing incorporates observed air pollution data and related urban data to produce 24-hour period tensors as input samples. 1-by-1 CNN layers capture the interaction between different types of input data. Deep learning transformer architecture is employed in a spatial-temporal (S-T) transformer module to capture long-range dependencies and extract complex relationships in both spatial and temporal dimensions. Hong Kong air pollution data is statistically analyzed and used to evaluate CTDI in its recovery of generated and actual patterns of missing data. Experimental results show that CTDI consistently outperforms existing imputation methods across all evaluated scenarios, including cases with higher rates of missing data, thereby demonstrating its robustness and effectiveness in enhancing air quality monitoring. Additionally, ablation experiments reveal that each component significantly contributes to the model's performance, with the temporal transformer proving particularly crucial under varying rates of missing data. | - |
| dc.language | eng | - |
| dc.publisher | Institute of Electrical and Electronics Engineers | - |
| dc.relation.ispartof | IEEE Transactions on Big Data | - |
| dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
| dc.subject | Air pollution | - |
| dc.subject | data imputation | - |
| dc.subject | data recovery | - |
| dc.subject | deep learning | - |
| dc.subject | missing data | - |
| dc.subject | spatial-temporal | - |
| dc.subject | transformer | - |
| dc.title | CTDI: CNN-Transformer-Based Spatial-Temporal Missing Air Pollution Data Imputation | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.1109/TBDATA.2025.3533882 | - |
| dc.identifier.scopus | eid_2-s2.0-85216839757 | - |
| dc.identifier.volume | 11 | - |
| dc.identifier.issue | 5 | - |
| dc.identifier.spage | 2443 | - |
| dc.identifier.epage | 2456 | - |
| dc.identifier.eissn | 2332-7790 | - |
| dc.identifier.issnl | 2332-7790 | - |
