File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Point Transformer

TitlePoint Transformer
Authors
Issue Date2021
Citation
Proceedings of the IEEE International Conference on Computer Vision, 2021, p. 16239-16248 How to Cite?
AbstractSelf-attention networks have revolutionized natural language processing and are making impressive strides in image analysis tasks such as image classification and object detection. Inspired by this success, we investigate the application of self-attention networks to 3D point cloud processing. We design self-attention layers for point clouds and use these to construct self-attention networks for tasks such as semantic scene segmentation, object part segmentation, and object classification. Our Point Transformer design improves upon prior work across domains and tasks. For example, on the challenging S3DIS dataset for large-scale semantic scene segmentation, the Point Transformer attains an mIoU of 70.4% on Area 5, outperforming the strongest prior model by 3.3 absolute percentage points and crossing the 70% mIoU threshold for the first time.
Persistent Identifierhttp://hdl.handle.net/10722/333516
ISSN
2020 SCImago Journal Rankings: 4.133

 

DC FieldValueLanguage
dc.contributor.authorZhao, Hengshuang-
dc.contributor.authorJiang, Li-
dc.contributor.authorJia, Jiaya-
dc.contributor.authorTorr, Philip-
dc.contributor.authorKoltun, Vladlen-
dc.date.accessioned2023-10-06T05:20:06Z-
dc.date.available2023-10-06T05:20:06Z-
dc.date.issued2021-
dc.identifier.citationProceedings of the IEEE International Conference on Computer Vision, 2021, p. 16239-16248-
dc.identifier.issn1550-5499-
dc.identifier.urihttp://hdl.handle.net/10722/333516-
dc.description.abstractSelf-attention networks have revolutionized natural language processing and are making impressive strides in image analysis tasks such as image classification and object detection. Inspired by this success, we investigate the application of self-attention networks to 3D point cloud processing. We design self-attention layers for point clouds and use these to construct self-attention networks for tasks such as semantic scene segmentation, object part segmentation, and object classification. Our Point Transformer design improves upon prior work across domains and tasks. For example, on the challenging S3DIS dataset for large-scale semantic scene segmentation, the Point Transformer attains an mIoU of 70.4% on Area 5, outperforming the strongest prior model by 3.3 absolute percentage points and crossing the 70% mIoU threshold for the first time.-
dc.languageeng-
dc.relation.ispartofProceedings of the IEEE International Conference on Computer Vision-
dc.titlePoint Transformer-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/ICCV48922.2021.01595-
dc.identifier.scopuseid_2-s2.0-85117295748-
dc.identifier.spage16239-
dc.identifier.epage16248-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats