AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

Liang, Zhixuan; Mu, Yao; Ding, Mingyu; Ni, Fei; Tomizuka, Masayoshi; Luo, Ping

File Download

content.pdf

Supplementary

Citations:
Appears in Collections:
- Computer Science: Conference papers
- HKU Musketeers Foundation Institute of Data Science: Conference papers

Conference Paper: AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

Title	AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
Authors	Liang, Zhixuan Mu, Yao Ding, Mingyu Ni, Fei Tomizuka, Masayoshi Luo, Ping
Issue Date	23-Jul-2023
Abstract	Diffusion models have demonstrated their powerful generative capability in many tasks, with great potential to serve as a paradigm for offline reinforcement learning. However, the quality of the diffusion model is limited by the insufficient diversity of training data, which hinders the performance of planning and the generalizability to new tasks. This paper introduces AdaptDiffuser, an evolutionary planning method with diffusion that can self-evolve to improve the diffusion model hence a better planner, not only for seen tasks but can also adapt to unseen tasks. AdaptDiffuser enables the generation of rich synthetic expert data for goal-conditioned tasks using guidance from reward gradients. It then selects high-quality data via a discriminator to finetune the diffusion model, which improves the generalization ability to unseen tasks. Empirical experiments on two benchmark environments and two carefully designed unseen tasks in KUKA industrial robot arm and Maze2D environments demonstrate the effectiveness of AdaptDiffuser. For example, AdaptDiffuser not only outperforms the previous art Diffuser by 20.8% on Maze2D and 7.5% on MuJoCo locomotion, but also adapts better to new tasks, e.g., KUKA pick-and-place, by 27.9% without requiring additional expert data. More visualization results and demo videos could be found on our project page.
Persistent Identifier	http://hdl.handle.net/10722/340273

DC Field	Value	Language
dc.contributor.author	Liang, Zhixuan	-
dc.contributor.author	Mu, Yao	-
dc.contributor.author	Ding, Mingyu	-
dc.contributor.author	Ni, Fei	-
dc.contributor.author	Tomizuka, Masayoshi	-
dc.contributor.author	Luo, Ping	-
dc.date.accessioned	2024-03-11T10:42:56Z	-
dc.date.available	2024-03-11T10:42:56Z	-
dc.date.issued	2023-07-23	-
dc.identifier.uri	http://hdl.handle.net/10722/340273	-
dc.description.abstract	<p>Diffusion models have demonstrated their powerful generative capability in many tasks, with great potential to serve as a paradigm for offline reinforcement learning. However, the quality of the diffusion model is limited by the insufficient diversity of training data, which hinders the performance of planning and the generalizability to new tasks. This paper introduces AdaptDiffuser, an evolutionary planning method with diffusion that can self-evolve to improve the diffusion model hence a better planner, not only for seen tasks but can also adapt to unseen tasks. AdaptDiffuser enables the generation of rich synthetic expert data for goal-conditioned tasks using guidance from reward gradients. It then selects high-quality data via a discriminator to finetune the diffusion model, which improves the generalization ability to unseen tasks. Empirical experiments on two benchmark environments and two carefully designed unseen tasks in KUKA industrial robot arm and Maze2D environments demonstrate the effectiveness of AdaptDiffuser. For example, AdaptDiffuser not only outperforms the previous art Diffuser by 20.8% on Maze2D and 7.5% on MuJoCo locomotion, but also adapts better to new tasks, e.g., KUKA pick-and-place, by 27.9% without requiring additional expert data. More visualization results and demo videos could be found on our project page.<br></p>	-
dc.language	eng	-
dc.relation.ispartof	International Conference on Machine Learning (23/07/2023-29/07/2023, Honolulu, Hawaii)	-
dc.title	AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners	-
dc.type	Conference_Paper	-
dc.description.nature	published_or_final_version	-

File Download

Supplementary

Conference Paper: AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats