Anisotropic convolutional networks for 3D semantic scene completion

Li, Jie; Han, Kai; Wang, Peng; Liu, Yu; Yuan, Xia

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/CVPR42600.2020.00341
Scopus: eid_2-s2.0-85094162301
WOS: WOS:000620679503060
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Statistics & Actuarial Science: Conference papers

Conference Paper: Anisotropic convolutional networks for 3D semantic scene completion

Title	Anisotropic convolutional networks for 3D semantic scene completion
Authors	Li, Jie Han, Kai Wang, Peng Liu, Yu Yuan, Xia
Issue Date	2020
Citation	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2020, p. 3348-3356 How to Cite? DOI: http://dx.doi.org/10.1109/CVPR42600.2020.00341
Abstract	As a voxel-wise labeling task, semantic scene completion (SSC) tries to simultaneously infer the occupancy and semantic labels for a scene from a single depth and/or RGB image. The key challenge for SSC is how to effectively take advantage of the 3D context to model various objects or stuffs with severe variations in shapes, layouts and visibility. To handle such variations, we propose a novel module called anisotropic convolution, which properties with flexibility and power impossible for the competing methods such as standard 3D convolution and some of its variations. In contrast to the standard 3D convolution that is limited to a fixed 3D receptive field, our module is capable of modeling the dimensional anisotropy voxel-wisely. The basic idea is to enable anisotropic 3D receptive field by decomposing a 3D convolution into three consecutive 1D convolutions, and the kernel size for each such 1D convolution is adaptively determined on the fly. By stacking multiple such anisotropic convolution modules, the voxel-wise modeling capability can be further enhanced while maintaining a controllable amount of model parameters. Extensive experiments on two SSC benchmarks, NYU-Depth-v2 and NYUCAD, show the superior performance of the proposed method. Our code is available at https://waterljwant.github.io/SSC/.
Persistent Identifier	http://hdl.handle.net/10722/311497
ISSN	1063-6919 2023 SCImago Journal Rankings: 10.331
ISI Accession Number ID	WOS:000620679503060

DC Field	Value	Language
dc.contributor.author	Li, Jie	-
dc.contributor.author	Han, Kai	-
dc.contributor.author	Wang, Peng	-
dc.contributor.author	Liu, Yu	-
dc.contributor.author	Yuan, Xia	-
dc.date.accessioned	2022-03-22T11:54:05Z	-
dc.date.available	2022-03-22T11:54:05Z	-
dc.date.issued	2020	-
dc.identifier.citation	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2020, p. 3348-3356	-
dc.identifier.issn	1063-6919	-
dc.identifier.uri	http://hdl.handle.net/10722/311497	-
dc.description.abstract	As a voxel-wise labeling task, semantic scene completion (SSC) tries to simultaneously infer the occupancy and semantic labels for a scene from a single depth and/or RGB image. The key challenge for SSC is how to effectively take advantage of the 3D context to model various objects or stuffs with severe variations in shapes, layouts and visibility. To handle such variations, we propose a novel module called anisotropic convolution, which properties with flexibility and power impossible for the competing methods such as standard 3D convolution and some of its variations. In contrast to the standard 3D convolution that is limited to a fixed 3D receptive field, our module is capable of modeling the dimensional anisotropy voxel-wisely. The basic idea is to enable anisotropic 3D receptive field by decomposing a 3D convolution into three consecutive 1D convolutions, and the kernel size for each such 1D convolution is adaptively determined on the fly. By stacking multiple such anisotropic convolution modules, the voxel-wise modeling capability can be further enhanced while maintaining a controllable amount of model parameters. Extensive experiments on two SSC benchmarks, NYU-Depth-v2 and NYUCAD, show the superior performance of the proposed method. Our code is available at https://waterljwant.github.io/SSC/.	-
dc.language	eng	-
dc.relation.ispartof	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition	-
dc.title	Anisotropic convolutional networks for 3D semantic scene completion	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/CVPR42600.2020.00341	-
dc.identifier.scopus	eid_2-s2.0-85094162301	-
dc.identifier.spage	3348	-
dc.identifier.epage	3356	-
dc.identifier.isi	WOS:000620679503060	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Anisotropic convolutional networks for 3D semantic scene completion

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats