File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: GeometryMotion-Transformer: An End-to-End Framework for 3D Action Recognition

TitleGeometryMotion-Transformer: An End-to-End Framework for 3D Action Recognition
Authors
Keywords3D action recognition
Feature extraction
Finite element analysis
Geometry
point cloud
Point cloud compression
Task analysis
Three-dimensional displays
transformer
Transformers
Issue Date2022
Citation
IEEE Transactions on Multimedia, 2022 How to Cite?
AbstractIn this work, we propose a new end-to-end optimized two-stream framework called GeometryMotion-Transformer (GMT) for 3D action recognition. We first observe that the existing 3D action recognition approaches cannot well extract motion representations from point cloud sequences. Specifically, when extracting motion representations, the existing approaches do not explicitly consider one-to-one correspondence among frames. Besides, the existing methods only extract the single-scale motion representations, which cannot well model the complex motion patterns of moving objects in point cloud sequences. To address these issues, we first propose the feature extraction module (FEM) to generate one-to-one correspondence among frames without using the voxelization process, and explicitly extract both geometry and multi-scale motion representations from raw point clouds. Moreover, we also observe the existing two-stream 3D action recognition approaches simply concatenate or add the geometry and motion features, which cannot well exploit the relationship between two-steam features. To this end, we also propose an improved transformer-based feature fusion module (FFM) to effectively fuse the two-stream features. Based on the proposed FEM and FFM, we build our GMT for 3D action recognition. Extensive experimental results on four benchmark datasets demonstrate the effectiveness of our backbone GMT.
Persistent Identifierhttp://hdl.handle.net/10722/322003
ISSN
2021 Impact Factor: 8.182
2020 SCImago Journal Rankings: 1.218
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorLiu, Jiaheng-
dc.contributor.authorGuo, Jinyang-
dc.contributor.authorXu, Dong-
dc.date.accessioned2022-11-03T02:22:56Z-
dc.date.available2022-11-03T02:22:56Z-
dc.date.issued2022-
dc.identifier.citationIEEE Transactions on Multimedia, 2022-
dc.identifier.issn1520-9210-
dc.identifier.urihttp://hdl.handle.net/10722/322003-
dc.description.abstractIn this work, we propose a new end-to-end optimized two-stream framework called GeometryMotion-Transformer (GMT) for 3D action recognition. We first observe that the existing 3D action recognition approaches cannot well extract motion representations from point cloud sequences. Specifically, when extracting motion representations, the existing approaches do not explicitly consider one-to-one correspondence among frames. Besides, the existing methods only extract the <italic>single-scale</italic> motion representations, which cannot well model the complex motion patterns of moving objects in point cloud sequences. To address these issues, we first propose the feature extraction module (FEM) to generate one-to-one correspondence among frames without using the voxelization process, and explicitly extract both geometry and <italic>multi-scale</italic> motion representations from raw point clouds. Moreover, we also observe the existing two-stream 3D action recognition approaches simply concatenate or add the geometry and motion features, which cannot well exploit the relationship between two-steam features. To this end, we also propose an improved transformer-based feature fusion module (FFM) to effectively fuse the two-stream features. Based on the proposed FEM and FFM, we build our GMT for 3D action recognition. Extensive experimental results on four benchmark datasets demonstrate the effectiveness of our backbone GMT.-
dc.languageeng-
dc.relation.ispartofIEEE Transactions on Multimedia-
dc.subject3D action recognition-
dc.subjectFeature extraction-
dc.subjectFinite element analysis-
dc.subjectGeometry-
dc.subjectpoint cloud-
dc.subjectPoint cloud compression-
dc.subjectTask analysis-
dc.subjectThree-dimensional displays-
dc.subjecttransformer-
dc.subjectTransformers-
dc.titleGeometryMotion-Transformer: An End-to-End Framework for 3D Action Recognition-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TMM.2022.3198011-
dc.identifier.scopuseid_2-s2.0-85135970280-
dc.identifier.eissn1941-0077-
dc.identifier.isiWOS:001098831500001-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats