Ctrlformer: Learning transferable state  representation for visual control via transformer

Mu, Y; Chen, S; Ding, M; Chen , J; Chen, R; Luo, P

File Download

There are no files associated with this item.

Supplementary

Citations:
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Ctrlformer: Learning transferable state representation for visual control via transformer

Title	Ctrlformer: Learning transferable state representation for visual control via transformer
Authors	Mu, Y Chen, S Ding, M Chen , J Chen, R Luo, P
Issue Date	2022
Publisher	International Conference on Machine Learning.
Citation	39th International Conference on Machine Learning (ICML) (Hybrid), Baltimore MD, USA, July 17-23, 2022 How to Cite?
Abstract	Transformer has achieved great successes in learning vision and language representation, which is general across various downstream tasks. In visual control, learning transferable state representation that can transfer between different control tasks is important to reduce the training sample size. However, porting Transformer to sample-efficient visual control remains a challenging and unsolved problem. To this end, we propose a novel Control Transformer (CtrlFormer), possessing many appealing benefits that prior arts do not have. Firstly, CtrlFormer jointly learns self-attention mechanisms between visual tokens and policy tokens among different control tasks, where multitask representation can be learned and transferred without catastrophic forgetting. Secondly, we carefully design a contrastive reinforcement learning paradigm to train CtrlFormer, enabling it to achieve high sample efficiency, which is important in control problems. For example, in the DMControl benchmark, unlike recent advanced methods that failed by producing a zero score in the 'Cartpole' task after transfer learning with 100k samples, CtrlFormer can achieve a state-of-the-art score with only 100k samples while maintaining the performance of previous tasks.
Persistent Identifier	http://hdl.handle.net/10722/315545

DC Field	Value	Language
dc.contributor.author	Mu, Y	-
dc.contributor.author	Chen, S	-
dc.contributor.author	Ding, M	-
dc.contributor.author	Chen , J	-
dc.contributor.author	Chen, R	-
dc.contributor.author	Luo, P	-
dc.date.accessioned	2022-08-19T08:59:53Z	-
dc.date.available	2022-08-19T08:59:53Z	-
dc.date.issued	2022	-
dc.identifier.citation	39th International Conference on Machine Learning (ICML) (Hybrid), Baltimore MD, USA, July 17-23, 2022	-
dc.identifier.uri	http://hdl.handle.net/10722/315545	-
dc.description.abstract	Transformer has achieved great successes in learning vision and language representation, which is general across various downstream tasks. In visual control, learning transferable state representation that can transfer between different control tasks is important to reduce the training sample size. However, porting Transformer to sample-efficient visual control remains a challenging and unsolved problem. To this end, we propose a novel Control Transformer (CtrlFormer), possessing many appealing benefits that prior arts do not have. Firstly, CtrlFormer jointly learns self-attention mechanisms between visual tokens and policy tokens among different control tasks, where multitask representation can be learned and transferred without catastrophic forgetting. Secondly, we carefully design a contrastive reinforcement learning paradigm to train CtrlFormer, enabling it to achieve high sample efficiency, which is important in control problems. For example, in the DMControl benchmark, unlike recent advanced methods that failed by producing a zero score in the 'Cartpole' task after transfer learning with 100k samples, CtrlFormer can achieve a state-of-the-art score with only 100k samples while maintaining the performance of previous tasks.	-
dc.language	eng	-
dc.publisher	International Conference on Machine Learning.	-
dc.title	Ctrlformer: Learning transferable state representation for visual control via transformer	-
dc.type	Conference_Paper	-
dc.identifier.email	Luo, P: pluo@hku.hk	-
dc.identifier.authority	Luo, P=rp02575	-
dc.identifier.hkuros	335573	-
dc.publisher.place	United States	-

File Download

Supplementary

Conference Paper: Ctrlformer: Learning transferable state representation for visual control via transformer

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats