OMEGA: Efficient Occlusion-Aware Navigation for Air-Ground Robots in Dynamic Environments via State Space Model

Wang, Junming; Guan, Xiuxian; Sun, Zekai; Shen, Tianxiang; Huang, Dong; Liu, Fangming; Cui, Heming

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/LRA.2024.3518076
Scopus: eid_2-s2.0-86000383385

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Computer Science: Journal/Magazine Articles

Article: OMEGA: Efficient Occlusion-Aware Navigation for Air-Ground Robots in Dynamic Environments via State Space Model

Title	OMEGA: Efficient Occlusion-Aware Navigation for Air-Ground Robots in Dynamic Environments via State Space Model
Authors	Wang, Junming Guan, Xiuxian Sun, Zekai Shen, Tianxiang Huang, Dong Liu, Fangming Cui, Heming
Keywords	3D semantic occupancy prediction autonomous navigation Deep learning for visual perception
Issue Date	1-Jan-2025
Publisher	Institute of Electrical and Electronics Engineers
Citation	IEEE Robotics and Automation Letters, 2025, v. 10, n. 2, p. 1066-1073 How to Cite? DOI: http://dx.doi.org/10.1109/LRA.2024.3518076
Abstract	Air-ground robots (AGRs) are widely used in surveillance and disaster response due to their exceptional mobility and versatility (i.e., flying and driving). Current AGR navigation systems perform well in static occlusion-prone environments (e.g., indoors) by using 3D semantic occupancy networks to predict occlusions for complete local mapping and then computing Euclidean Signed Distance Field (ESDF) for path planning. However, these systems face challenges in dynamic scenes (e.g., crowds) due to limitations in perception networks' low prediction accuracy and path planners' high computation overhead. In this paper, we propose Omega, which contains OccMamba with an Efficient AGR-plAnner to address the above-mentioned problems. OccMamba adopts a novel architecture that separates semantic and occupancy prediction into independent branches, incorporating the mamba module to efficiently extract semantic and geometric features in 3D environments. This ensures the network can learn long-distance dependencies and improve prediction accuracy. Features are then combined within the Bird's Eye View (BEV) space to minimise computational overhead during feature fusion. The resulting semantic occupancy map is integrated into the local map, providing occlusion awareness of the dynamic environment. Our AGR-Planner utilizes this local map and employs Kinodynamic A∗ search and gradient-based trajectory optimization for ESDF-free and energy-efficient planning. Experiments demonstrate that OccMamba outperforms the state-of-the-art 3D semantic occupancy network with 25.0% mIoU. End-to-end navigation experiments in dynamic scenes verify OMEGA's efficiency, achieving a 96% average planning success rate.
Persistent Identifier	http://hdl.handle.net/10722/361921

DC Field	Value	Language
dc.contributor.author	Wang, Junming	-
dc.contributor.author	Guan, Xiuxian	-
dc.contributor.author	Sun, Zekai	-
dc.contributor.author	Shen, Tianxiang	-
dc.contributor.author	Huang, Dong	-
dc.contributor.author	Liu, Fangming	-
dc.contributor.author	Cui, Heming	-
dc.date.accessioned	2025-09-17T00:32:03Z	-
dc.date.available	2025-09-17T00:32:03Z	-
dc.date.issued	2025-01-01	-
dc.identifier.citation	IEEE Robotics and Automation Letters, 2025, v. 10, n. 2, p. 1066-1073	-
dc.identifier.uri	http://hdl.handle.net/10722/361921	-
dc.description.abstract	<p>Air-ground robots (AGRs) are widely used in surveillance and disaster response due to their exceptional mobility and versatility (i.e., flying and driving). Current AGR navigation systems perform well in static occlusion-prone environments (e.g., indoors) by using 3D semantic occupancy networks to predict occlusions for complete local mapping and then computing Euclidean Signed Distance Field (ESDF) for path planning. However, these systems face challenges in dynamic scenes (e.g., crowds) due to limitations in perception networks' low prediction accuracy and path planners' high computation overhead. In this paper, we propose Omega, which contains OccMamba with an Efficient AGR-plAnner to address the above-mentioned problems. OccMamba adopts a novel architecture that separates semantic and occupancy prediction into independent branches, incorporating the mamba module to efficiently extract semantic and geometric features in 3D environments. This ensures the network can learn long-distance dependencies and improve prediction accuracy. Features are then combined within the Bird's Eye View (BEV) space to minimise computational overhead during feature fusion. The resulting semantic occupancy map is integrated into the local map, providing occlusion awareness of the dynamic environment. Our AGR-Planner utilizes this local map and employs Kinodynamic A∗ search and gradient-based trajectory optimization for ESDF-free and energy-efficient planning. Experiments demonstrate that OccMamba outperforms the state-of-the-art 3D semantic occupancy network with 25.0% mIoU. End-to-end navigation experiments in dynamic scenes verify OMEGA's efficiency, achieving a 96% average planning success rate.</p>	-
dc.language	eng	-
dc.publisher	Institute of Electrical and Electronics Engineers	-
dc.relation.ispartof	IEEE Robotics and Automation Letters	-
dc.subject	3D semantic occupancy prediction	-
dc.subject	autonomous navigation	-
dc.subject	Deep learning for visual perception	-
dc.title	OMEGA: Efficient Occlusion-Aware Navigation for Air-Ground Robots in Dynamic Environments via State Space Model	-
dc.type	Article	-
dc.identifier.doi	10.1109/LRA.2024.3518076	-
dc.identifier.scopus	eid_2-s2.0-86000383385	-
dc.identifier.volume	10	-
dc.identifier.issue	2	-
dc.identifier.spage	1066	-
dc.identifier.epage	1073	-
dc.identifier.eissn	2377-3766	-
dc.identifier.issnl	2377-3766	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: OMEGA: Efficient Occlusion-Aware Navigation for Air-Ground Robots in Dynamic Environments via State Space Model

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats