PlaneDepth: Self-Supervised Depth Estimation via Orthogonal Planes

Wang, Ruoyu; Yu, Zehao; Gao, Shenghua

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/CVPR52729.2023.02052
Scopus: eid_2-s2.0-85172231843
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: PlaneDepth: Self-Supervised Depth Estimation via Orthogonal Planes

Title	PlaneDepth: Self-Supervised Depth Estimation via Orthogonal Planes
Authors	Wang, Ruoyu Yu, Zehao Gao, Shenghua
Keywords	3D from multi-view and sensors
Issue Date	2023
Citation	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2023, v. 2023-June, p. 21425-21434 How to Cite? DOI: http://dx.doi.org/10.1109/CVPR52729.2023.02052
Abstract	Multiple near frontal-parallel planes based depth representation demonstrated impressive results in self-supervised monocular depth estimation (MDE). Whereas, such a representation would cause the discontinuity of the ground as it is perpendicular to the frontal-parallel planes, which is detrimental to the identification of drivable space in autonomous driving. In this paper, we propose the PlaneDepth, a novel orthogonal planes based presentation, including vertical planes and ground planes. PlaneDepth estimates the depth distribution using a Laplacian Mixture Model based on orthogonal planes for an input image. These planes are used to synthesize a reference view to provide the self-supervision signal. Further, we find that the widely used resizing and cropping data augmentation breaks the orthogonality assumptions, leading to inferior plane predictions. We address this problem by explicitly constructing the resizing cropping transformation to rectify the predefined planes and predicted camera pose. Moreover, we propose an augmented self-distillation loss supervised with a bilateral occlusion mask to boost the robustness of orthogonal planes representation for occlusions. Thanks to our orthogonal planes representation, we can extract the ground plane in an unsupervised manner, which is important for autonomous driving. Extensive experiments on the KITTI dataset demonstrate the effectiveness and efficiency of our method. The code is available at https://github.com/svip-lab/PlaneDepth.
Persistent Identifier	http://hdl.handle.net/10722/345354
ISSN	1063-6919 2023 SCImago Journal Rankings: 10.331

DC Field	Value	Language
dc.contributor.author	Wang, Ruoyu	-
dc.contributor.author	Yu, Zehao	-
dc.contributor.author	Gao, Shenghua	-
dc.date.accessioned	2024-08-15T09:26:49Z	-
dc.date.available	2024-08-15T09:26:49Z	-
dc.date.issued	2023	-
dc.identifier.citation	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2023, v. 2023-June, p. 21425-21434	-
dc.identifier.issn	1063-6919	-
dc.identifier.uri	http://hdl.handle.net/10722/345354	-
dc.description.abstract	Multiple near frontal-parallel planes based depth representation demonstrated impressive results in self-supervised monocular depth estimation (MDE). Whereas, such a representation would cause the discontinuity of the ground as it is perpendicular to the frontal-parallel planes, which is detrimental to the identification of drivable space in autonomous driving. In this paper, we propose the PlaneDepth, a novel orthogonal planes based presentation, including vertical planes and ground planes. PlaneDepth estimates the depth distribution using a Laplacian Mixture Model based on orthogonal planes for an input image. These planes are used to synthesize a reference view to provide the self-supervision signal. Further, we find that the widely used resizing and cropping data augmentation breaks the orthogonality assumptions, leading to inferior plane predictions. We address this problem by explicitly constructing the resizing cropping transformation to rectify the predefined planes and predicted camera pose. Moreover, we propose an augmented self-distillation loss supervised with a bilateral occlusion mask to boost the robustness of orthogonal planes representation for occlusions. Thanks to our orthogonal planes representation, we can extract the ground plane in an unsupervised manner, which is important for autonomous driving. Extensive experiments on the KITTI dataset demonstrate the effectiveness and efficiency of our method. The code is available at https://github.com/svip-lab/PlaneDepth.	-
dc.language	eng	-
dc.relation.ispartof	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition	-
dc.subject	3D from multi-view and sensors	-
dc.title	PlaneDepth: Self-Supervised Depth Estimation via Orthogonal Planes	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/CVPR52729.2023.02052	-
dc.identifier.scopus	eid_2-s2.0-85172231843	-
dc.identifier.volume	2023-June	-
dc.identifier.spage	21425	-
dc.identifier.epage	21434	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: PlaneDepth: Self-Supervised Depth Estimation via Orthogonal Planes

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats