File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Reinforcement Learning for Autonomous Underwater Vehicles via Data-Informed Domain Randomization

TitleReinforcement Learning for Autonomous Underwater Vehicles via Data-Informed Domain Randomization
Authors
Keywordsautonomous underwater vehicles
domain randomization
reinforcement learning
uncertainty attenuation
Issue Date29-Jan-2023
PublisherMDPI
Citation
Applied Sciences, 2023, v. 13, n. 3 How to Cite?
Abstract

Autonomous Underwater Vehicles (AUVs) or underwater vehicle-manipulator systems often have large model uncertainties from degenerated or damaged thrusters, varying payloads, disturbances from currents, etc. Other constraints, such as input dead zones and saturations, make the feedback controllers difficult to tune online. Model-free Reinforcement Learning (RL) has been applied to control AUVs, but most results were validated through numerical simulations. The trained controllers often perform unsatisfactorily on real AUVs; this is because the distributions of the AUV dynamics in numerical simulations and those of real AUVs are mismatched. This paper presents a model-free RL via Data-informed Domain Randomization (DDR) for controlling AUVs, where the mismatches between the trajectory data from numerical simulations and the real AUV were minimized by adjusting the parameters in the simulated AUVs. The DDR strategy extends the existing adaptive domain randomization technique by aggregating an input network to learn mappings between control signals across domains, enabling the controller to adapt to sudden changes in dynamics. The proposed RL via DDR was tested on the problems of AUV pose regulation through extensive numerical simulations and experiments in a lab tank with an underwater positioning system. These results have demonstrated the effectiveness of RL-DDR for transferring trained controllers to AUVs with different dynamics.


Persistent Identifierhttp://hdl.handle.net/10722/338597
ISSN
2023 Impact Factor: 2.5
2023 SCImago Journal Rankings: 0.508
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorLu, Wenjie-
dc.contributor.authorCheng, Kai-
dc.contributor.authorHu, Manman-
dc.date.accessioned2024-03-11T10:30:05Z-
dc.date.available2024-03-11T10:30:05Z-
dc.date.issued2023-01-29-
dc.identifier.citationApplied Sciences, 2023, v. 13, n. 3-
dc.identifier.issn2076-3417-
dc.identifier.urihttp://hdl.handle.net/10722/338597-
dc.description.abstract<p>Autonomous Underwater Vehicles (AUVs) or underwater vehicle-manipulator systems often have large model uncertainties from degenerated or damaged thrusters, varying payloads, disturbances from currents, etc. Other constraints, such as input dead zones and saturations, make the feedback controllers difficult to tune online. Model-free Reinforcement Learning (RL) has been applied to control AUVs, but most results were validated through numerical simulations. The trained controllers often perform unsatisfactorily on real AUVs; this is because the distributions of the AUV dynamics in numerical simulations and those of real AUVs are mismatched. This paper presents a model-free RL via Data-informed Domain Randomization (DDR) for controlling AUVs, where the mismatches between the trajectory data from numerical simulations and the real AUV were minimized by adjusting the parameters in the simulated AUVs. The DDR strategy extends the existing adaptive domain randomization technique by aggregating an input network to learn mappings between control signals across domains, enabling the controller to adapt to sudden changes in dynamics. The proposed RL via DDR was tested on the problems of AUV pose regulation through extensive numerical simulations and experiments in a lab tank with an underwater positioning system. These results have demonstrated the effectiveness of RL-DDR for transferring trained controllers to AUVs with different dynamics.<br></p>-
dc.languageeng-
dc.publisherMDPI-
dc.relation.ispartofApplied Sciences-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subjectautonomous underwater vehicles-
dc.subjectdomain randomization-
dc.subjectreinforcement learning-
dc.subjectuncertainty attenuation-
dc.titleReinforcement Learning for Autonomous Underwater Vehicles via Data-Informed Domain Randomization-
dc.typeArticle-
dc.identifier.doi10.3390/app13031723-
dc.identifier.scopuseid_2-s2.0-85147905087-
dc.identifier.volume13-
dc.identifier.issue3-
dc.identifier.eissn2076-3417-
dc.identifier.isiWOS:000929228700001-
dc.identifier.issnl2076-3417-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats