File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/LRA.2024.3483032
- Scopus: eid_2-s2.0-85207338911
- Find via
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: iBTC: An Image-Assisting Binary and Triangle Combined Descriptor for Place Recognition by Fusing LiDAR and Camera Measurements
Title | iBTC: An Image-Assisting Binary and Triangle Combined Descriptor for Place Recognition by Fusing LiDAR and Camera Measurements |
---|---|
Authors | |
Keywords | descriptor LiDAR SLAM loop detection multimodal Place recognition visual SLAM |
Issue Date | 1-Dec-2024 |
Publisher | Institute of Electrical and Electronics Engineers |
Citation | IEEE Robotics and Automation Letters, 2024, v. 9, n. 12, p. 10858-10865 How to Cite? |
Abstract | In this work, we introduce a novel multimodal descriptor, the image-assisting binary and triangle combined (iBTC) descriptor, which fuses LiDAR (Light Detection and Ranging) and camera measurements for 3D place recognition. The inherent invariance of a triangle to rigid transformations inspires us to design triangle-based descriptors. We first extract distinct 3D key points from both LiDAR and camera measurements and organize them into triplets to form triangles. By utilizing the lengths of the sides of these triangles, we can create triangle descriptors, enabling the rapid retrieval of similar triangles from a database. By encoding the geometric and visual details at the triangle vertices into binary descriptors, we augment the triangle descriptors with richer local information. This enrichment process empowers our descriptors to reject mis-matched triangle pairs. Consequently, the remaining matched triangle pairs yield accurate loop closure place indices and relative poses. In our experiments, we conduct a thorough comparison of our proposed method with several SOTA methods across public and self-collected datasets. The results demonstrate that our method exhibits superior performance in place recognition and overcomes the limitations associated with the unimodal methods like BTC, RING++, ORB-DBoW2, and NetVLAD. Additionally, we perform a time cost benchmark experiment and the result indicates that our method's time consumption is reasonable, compared with baseline methods. |
Persistent Identifier | http://hdl.handle.net/10722/354543 |
ISSN | 2023 Impact Factor: 4.6 2023 SCImago Journal Rankings: 2.119 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zou, Zuhao | - |
dc.contributor.author | Zheng, Chunran | - |
dc.contributor.author | Yuan, Chongjian | - |
dc.contributor.author | Zhou, Shunbo | - |
dc.contributor.author | Xue, Kaiwen | - |
dc.contributor.author | Zhang, Fu | - |
dc.date.accessioned | 2025-02-13T00:35:14Z | - |
dc.date.available | 2025-02-13T00:35:14Z | - |
dc.date.issued | 2024-12-01 | - |
dc.identifier.citation | IEEE Robotics and Automation Letters, 2024, v. 9, n. 12, p. 10858-10865 | - |
dc.identifier.issn | 2377-3766 | - |
dc.identifier.uri | http://hdl.handle.net/10722/354543 | - |
dc.description.abstract | <p>In this work, we introduce a novel multimodal descriptor, the image-assisting binary and triangle combined (iBTC) descriptor, which fuses LiDAR (Light Detection and Ranging) and camera measurements for 3D place recognition. The inherent invariance of a triangle to rigid transformations inspires us to design triangle-based descriptors. We first extract distinct 3D key points from both LiDAR and camera measurements and organize them into triplets to form triangles. By utilizing the lengths of the sides of these triangles, we can create triangle descriptors, enabling the rapid retrieval of similar triangles from a database. By encoding the geometric and visual details at the triangle vertices into binary descriptors, we augment the triangle descriptors with richer local information. This enrichment process empowers our descriptors to reject mis-matched triangle pairs. Consequently, the remaining matched triangle pairs yield accurate loop closure place indices and relative poses. In our experiments, we conduct a thorough comparison of our proposed method with several SOTA methods across public and self-collected datasets. The results demonstrate that our method exhibits superior performance in place recognition and overcomes the limitations associated with the unimodal methods like BTC, RING++, ORB-DBoW2, and NetVLAD. Additionally, we perform a time cost benchmark experiment and the result indicates that our method's time consumption is reasonable, compared with baseline methods.<br></p> | - |
dc.language | eng | - |
dc.publisher | Institute of Electrical and Electronics Engineers | - |
dc.relation.ispartof | IEEE Robotics and Automation Letters | - |
dc.subject | descriptor | - |
dc.subject | LiDAR SLAM | - |
dc.subject | loop detection | - |
dc.subject | multimodal | - |
dc.subject | Place recognition | - |
dc.subject | visual SLAM | - |
dc.title | iBTC: An Image-Assisting Binary and Triangle Combined Descriptor for Place Recognition by Fusing LiDAR and Camera Measurements | - |
dc.type | Article | - |
dc.identifier.doi | 10.1109/LRA.2024.3483032 | - |
dc.identifier.scopus | eid_2-s2.0-85207338911 | - |
dc.identifier.volume | 9 | - |
dc.identifier.issue | 12 | - |
dc.identifier.spage | 10858 | - |
dc.identifier.epage | 10865 | - |
dc.identifier.eissn | 2377-3766 | - |
dc.identifier.issnl | 2377-3766 | - |