iBTC: An Image-Assisting Binary and Triangle Combined Descriptor for Place Recognition by Fusing LiDAR and Camera Measurements

Zou, Zuhao; Zheng, Chunran; Yuan, Chongjian; Zhou, Shunbo; Xue, Kaiwen; Zhang, Fu

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/LRA.2024.3483032
Scopus: eid_2-s2.0-85207338911
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Mechanical Engineering: Journal/Magazine Articles

Article: iBTC: An Image-Assisting Binary and Triangle Combined Descriptor for Place Recognition by Fusing LiDAR and Camera Measurements

Title	iBTC: An Image-Assisting Binary and Triangle Combined Descriptor for Place Recognition by Fusing LiDAR and Camera Measurements
Authors	Zou, Zuhao Zheng, Chunran Yuan, Chongjian Zhou, Shunbo Xue, Kaiwen Zhang, Fu
Keywords	descriptor LiDAR SLAM loop detection multimodal Place recognition visual SLAM
Issue Date	1-Dec-2024
Publisher	Institute of Electrical and Electronics Engineers
Citation	IEEE Robotics and Automation Letters, 2024, v. 9, n. 12, p. 10858-10865 How to Cite? DOI: http://dx.doi.org/10.1109/LRA.2024.3483032
Abstract	In this work, we introduce a novel multimodal descriptor, the image-assisting binary and triangle combined (iBTC) descriptor, which fuses LiDAR (Light Detection and Ranging) and camera measurements for 3D place recognition. The inherent invariance of a triangle to rigid transformations inspires us to design triangle-based descriptors. We first extract distinct 3D key points from both LiDAR and camera measurements and organize them into triplets to form triangles. By utilizing the lengths of the sides of these triangles, we can create triangle descriptors, enabling the rapid retrieval of similar triangles from a database. By encoding the geometric and visual details at the triangle vertices into binary descriptors, we augment the triangle descriptors with richer local information. This enrichment process empowers our descriptors to reject mis-matched triangle pairs. Consequently, the remaining matched triangle pairs yield accurate loop closure place indices and relative poses. In our experiments, we conduct a thorough comparison of our proposed method with several SOTA methods across public and self-collected datasets. The results demonstrate that our method exhibits superior performance in place recognition and overcomes the limitations associated with the unimodal methods like BTC, RING++, ORB-DBoW2, and NetVLAD. Additionally, we perform a time cost benchmark experiment and the result indicates that our method's time consumption is reasonable, compared with baseline methods.
Persistent Identifier	http://hdl.handle.net/10722/354543
ISSN	2377-3766 2023 Impact Factor: 4.6 2023 SCImago Journal Rankings: 2.119

DC Field	Value	Language
dc.contributor.author	Zou, Zuhao	-
dc.contributor.author	Zheng, Chunran	-
dc.contributor.author	Yuan, Chongjian	-
dc.contributor.author	Zhou, Shunbo	-
dc.contributor.author	Xue, Kaiwen	-
dc.contributor.author	Zhang, Fu	-
dc.date.accessioned	2025-02-13T00:35:14Z	-
dc.date.available	2025-02-13T00:35:14Z	-
dc.date.issued	2024-12-01	-
dc.identifier.citation	IEEE Robotics and Automation Letters, 2024, v. 9, n. 12, p. 10858-10865	-
dc.identifier.issn	2377-3766	-
dc.identifier.uri	http://hdl.handle.net/10722/354543	-
dc.description.abstract	<p>In this work, we introduce a novel multimodal descriptor, the image-assisting binary and triangle combined (iBTC) descriptor, which fuses LiDAR (Light Detection and Ranging) and camera measurements for 3D place recognition. The inherent invariance of a triangle to rigid transformations inspires us to design triangle-based descriptors. We first extract distinct 3D key points from both LiDAR and camera measurements and organize them into triplets to form triangles. By utilizing the lengths of the sides of these triangles, we can create triangle descriptors, enabling the rapid retrieval of similar triangles from a database. By encoding the geometric and visual details at the triangle vertices into binary descriptors, we augment the triangle descriptors with richer local information. This enrichment process empowers our descriptors to reject mis-matched triangle pairs. Consequently, the remaining matched triangle pairs yield accurate loop closure place indices and relative poses. In our experiments, we conduct a thorough comparison of our proposed method with several SOTA methods across public and self-collected datasets. The results demonstrate that our method exhibits superior performance in place recognition and overcomes the limitations associated with the unimodal methods like BTC, RING++, ORB-DBoW2, and NetVLAD. Additionally, we perform a time cost benchmark experiment and the result indicates that our method's time consumption is reasonable, compared with baseline methods.<br></p>	-
dc.language	eng	-
dc.publisher	Institute of Electrical and Electronics Engineers	-
dc.relation.ispartof	IEEE Robotics and Automation Letters	-
dc.subject	descriptor	-
dc.subject	LiDAR SLAM	-
dc.subject	loop detection	-
dc.subject	multimodal	-
dc.subject	Place recognition	-
dc.subject	visual SLAM	-
dc.title	iBTC: An Image-Assisting Binary and Triangle Combined Descriptor for Place Recognition by Fusing LiDAR and Camera Measurements	-
dc.type	Article	-
dc.identifier.doi	10.1109/LRA.2024.3483032	-
dc.identifier.scopus	eid_2-s2.0-85207338911	-
dc.identifier.volume	9	-
dc.identifier.issue	12	-
dc.identifier.spage	10858	-
dc.identifier.epage	10865	-
dc.identifier.eissn	2377-3766	-
dc.identifier.issnl	2377-3766	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: iBTC: An Image-Assisting Binary and Triangle Combined Descriptor for Place Recognition by Fusing LiDAR and Camera Measurements

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats