File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: STVANet: A spatio-temporal visual attention framework with large kernel attention mechanism for citywide traffic dynamics prediction

TitleSTVANet: A spatio-temporal visual attention framework with large kernel attention mechanism for citywide traffic dynamics prediction
Authors
Keywords2D ConvNets
Deep learning
Large kernel attention
Spatio-temporal data
Squeeze-and-Excitation mechanism
Traffic information
Issue Date15-Nov-2024
PublisherElsevier
Citation
Expert Systems with Applications, 2024, v. 254 How to Cite?
Abstract

Enhancing the efficiency and safety of the Intelligent Transportation System requires effective modeling and prediction of citywide traffic dynamics. Most studies employ convolutional neural networks (CNNs) with a 3D convolutional structure or spatio-temporal models with self-attention mechanisms to capture the spatio-temporal information of traffic distribution. Although 3D CNNs excel at capturing local contextual information, they are computationally complex due to the large number of parameters and cannot capture long-range dependence. By contrast, although self-attention mechanisms originally designed to address challenges in natural language processing can capture long-range dependence, their application to 2D image structures requires breaking down the inherent 2D context into a 1D sequence, increasing the computational complexity and neglecting the adaptability between local contextual information and channels. Accordingly, we propose a spatio-temporal visual attention neural network (STVANet), a novel spatio-temporal visual attention 2D CNN, which integrates a unique visual attention module with a large kernel attention (LKA) mechanism, a squeeze-and-excitation (SE) mechanism and a feedforward component to capture long-range dependence and channel information in urban traffic data while preserving the 2D image structure. LKA-based spatio-temporal attention networks extract spatial and temporal features from weekly, daily, and recent hourly periods, and aggregate them with weighted consideration of external features to make predictions. Evaluation of real-world datasets demonstrates STVANet’s superiority over baseline models, showcasing its potential in citywide traffic prediction.


Persistent Identifierhttp://hdl.handle.net/10722/344383
ISSN
2023 Impact Factor: 7.5
2023 SCImago Journal Rankings: 1.875

 

DC FieldValueLanguage
dc.contributor.authorYang, Hongtai-
dc.contributor.authorJiang, Junbo-
dc.contributor.authorZhao, Zhan-
dc.contributor.authorPan, Renbin-
dc.contributor.authorTao, Siyu-
dc.date.accessioned2024-07-24T13:51:09Z-
dc.date.available2024-07-24T13:51:09Z-
dc.date.issued2024-11-15-
dc.identifier.citationExpert Systems with Applications, 2024, v. 254-
dc.identifier.issn0957-4174-
dc.identifier.urihttp://hdl.handle.net/10722/344383-
dc.description.abstract<p>Enhancing the efficiency and safety of the Intelligent Transportation System requires effective modeling and prediction of citywide traffic dynamics. Most studies employ convolutional neural networks (CNNs) with a 3D convolutional structure or spatio-temporal models with self-attention mechanisms to capture the spatio-temporal information of traffic distribution. Although 3D CNNs excel at capturing local contextual information, they are computationally complex due to the large number of parameters and cannot capture long-range dependence. By contrast, although self-attention mechanisms originally designed to address challenges in natural language processing can capture long-range dependence, their application to 2D image structures requires breaking down the inherent 2D context into a 1D sequence, increasing the computational complexity and neglecting the adaptability between local contextual information and channels. Accordingly, we propose a spatio-temporal visual attention neural network (STVANet), a novel spatio-temporal visual attention 2D CNN, which integrates a unique visual attention module with a large kernel attention (LKA) mechanism, a squeeze-and-excitation (SE) mechanism and a feedforward component to capture long-range dependence and channel information in urban traffic data while preserving the 2D image structure. LKA-based spatio-temporal attention networks extract spatial and temporal features from weekly, daily, and recent hourly periods, and aggregate them with weighted consideration of external features to make predictions. Evaluation of real-world datasets demonstrates STVANet’s superiority over baseline models, showcasing its potential in citywide traffic prediction.<br></p>-
dc.languageeng-
dc.publisherElsevier-
dc.relation.ispartofExpert Systems with Applications-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subject2D ConvNets-
dc.subjectDeep learning-
dc.subjectLarge kernel attention-
dc.subjectSpatio-temporal data-
dc.subjectSqueeze-and-Excitation mechanism-
dc.subjectTraffic information-
dc.titleSTVANet: A spatio-temporal visual attention framework with large kernel attention mechanism for citywide traffic dynamics prediction -
dc.typeArticle-
dc.identifier.doi10.1016/j.eswa.2024.124466-
dc.identifier.scopuseid_2-s2.0-85195818018-
dc.identifier.volume254-
dc.identifier.eissn1873-6793-
dc.identifier.issnl0957-4174-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats