File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/LRA.2024.3518076
- Scopus: eid_2-s2.0-86000383385
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: OMEGA: Efficient Occlusion-Aware Navigation for Air-Ground Robots in Dynamic Environments via State Space Model
| Title | OMEGA: Efficient Occlusion-Aware Navigation for Air-Ground Robots in Dynamic Environments via State Space Model |
|---|---|
| Authors | |
| Keywords | 3D semantic occupancy prediction autonomous navigation Deep learning for visual perception |
| Issue Date | 1-Jan-2025 |
| Publisher | Institute of Electrical and Electronics Engineers |
| Citation | IEEE Robotics and Automation Letters, 2025, v. 10, n. 2, p. 1066-1073 How to Cite? |
| Abstract | Air-ground robots (AGRs) are widely used in surveillance and disaster response due to their exceptional mobility and versatility (i.e., flying and driving). Current AGR navigation systems perform well in static occlusion-prone environments (e.g., indoors) by using 3D semantic occupancy networks to predict occlusions for complete local mapping and then computing Euclidean Signed Distance Field (ESDF) for path planning. However, these systems face challenges in dynamic scenes (e.g., crowds) due to limitations in perception networks' low prediction accuracy and path planners' high computation overhead. In this paper, we propose Omega, which contains OccMamba with an Efficient AGR-plAnner to address the above-mentioned problems. OccMamba adopts a novel architecture that separates semantic and occupancy prediction into independent branches, incorporating the mamba module to efficiently extract semantic and geometric features in 3D environments. This ensures the network can learn long-distance dependencies and improve prediction accuracy. Features are then combined within the Bird's Eye View (BEV) space to minimise computational overhead during feature fusion. The resulting semantic occupancy map is integrated into the local map, providing occlusion awareness of the dynamic environment. Our AGR-Planner utilizes this local map and employs Kinodynamic A∗ search and gradient-based trajectory optimization for ESDF-free and energy-efficient planning. Experiments demonstrate that OccMamba outperforms the state-of-the-art 3D semantic occupancy network with 25.0% mIoU. End-to-end navigation experiments in dynamic scenes verify OMEGA's efficiency, achieving a 96% average planning success rate. |
| Persistent Identifier | http://hdl.handle.net/10722/361921 |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Wang, Junming | - |
| dc.contributor.author | Guan, Xiuxian | - |
| dc.contributor.author | Sun, Zekai | - |
| dc.contributor.author | Shen, Tianxiang | - |
| dc.contributor.author | Huang, Dong | - |
| dc.contributor.author | Liu, Fangming | - |
| dc.contributor.author | Cui, Heming | - |
| dc.date.accessioned | 2025-09-17T00:32:03Z | - |
| dc.date.available | 2025-09-17T00:32:03Z | - |
| dc.date.issued | 2025-01-01 | - |
| dc.identifier.citation | IEEE Robotics and Automation Letters, 2025, v. 10, n. 2, p. 1066-1073 | - |
| dc.identifier.uri | http://hdl.handle.net/10722/361921 | - |
| dc.description.abstract | <p>Air-ground robots (AGRs) are widely used in surveillance and disaster response due to their exceptional mobility and versatility (i.e., flying and driving). Current AGR navigation systems perform well in static occlusion-prone environments (e.g., indoors) by using 3D semantic occupancy networks to predict occlusions for complete local mapping and then computing Euclidean Signed Distance Field (ESDF) for path planning. However, these systems face challenges in dynamic scenes (e.g., crowds) due to limitations in perception networks' low prediction accuracy and path planners' high computation overhead. In this paper, we propose Omega, which contains OccMamba with an Efficient AGR-plAnner to address the above-mentioned problems. OccMamba adopts a novel architecture that separates semantic and occupancy prediction into independent branches, incorporating the mamba module to efficiently extract semantic and geometric features in 3D environments. This ensures the network can learn long-distance dependencies and improve prediction accuracy. Features are then combined within the Bird's Eye View (BEV) space to minimise computational overhead during feature fusion. The resulting semantic occupancy map is integrated into the local map, providing occlusion awareness of the dynamic environment. Our AGR-Planner utilizes this local map and employs Kinodynamic A∗ search and gradient-based trajectory optimization for ESDF-free and energy-efficient planning. Experiments demonstrate that OccMamba outperforms the state-of-the-art 3D semantic occupancy network with 25.0% mIoU. End-to-end navigation experiments in dynamic scenes verify OMEGA's efficiency, achieving a 96% average planning success rate.</p> | - |
| dc.language | eng | - |
| dc.publisher | Institute of Electrical and Electronics Engineers | - |
| dc.relation.ispartof | IEEE Robotics and Automation Letters | - |
| dc.subject | 3D semantic occupancy prediction | - |
| dc.subject | autonomous navigation | - |
| dc.subject | Deep learning for visual perception | - |
| dc.title | OMEGA: Efficient Occlusion-Aware Navigation for Air-Ground Robots in Dynamic Environments via State Space Model | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.1109/LRA.2024.3518076 | - |
| dc.identifier.scopus | eid_2-s2.0-86000383385 | - |
| dc.identifier.volume | 10 | - |
| dc.identifier.issue | 2 | - |
| dc.identifier.spage | 1066 | - |
| dc.identifier.epage | 1073 | - |
| dc.identifier.eissn | 2377-3766 | - |
| dc.identifier.issnl | 2377-3766 | - |
