GRADIENTS AS FEATURES FOR DEEP REPRESENTATION LEARNING

Mu, Fangzhou; Liang, Yingyu; Li, Yin

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Scopus: eid_2-s2.0-85101113031

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- HKU Musketeers Foundation Institute of Data Science: Conference papers

Conference Paper: GRADIENTS AS FEATURES FOR DEEP REPRESENTATION LEARNING

Title	GRADIENTS AS FEATURES FOR DEEP REPRESENTATION LEARNING
Authors	Mu, Fangzhou Liang, Yingyu Li, Yin
Issue Date	2020
Citation	8th International Conference on Learning Representations, ICLR 2020, 2020 How to Cite?
Abstract	We address the challenging problem of deep representation learning - the efficient adaption of a pre-trained deep network to different tasks. Specifically, we propose to explore gradient-based features. These features are gradients of the model parameters with respect to a task-specific loss given an input sample. Our key innovation is the design of a linear model that incorporates both gradient and activation of the pre-trained network. We demonstrate that our model provides a local linear approximation to an underlying deep model, and discuss important theoretical insights. Moreover, we present an efficient algorithm for the training and inference of our model without computing the actual gradients. Our method is evaluated across a number of representation-learning tasks on several datasets and using different network architectures. Strong results are obtained in all settings, and are well-aligned with our theoretical insights.
Persistent Identifier	http://hdl.handle.net/10722/341298

DC Field	Value	Language
dc.contributor.author	Mu, Fangzhou	-
dc.contributor.author	Liang, Yingyu	-
dc.contributor.author	Li, Yin	-
dc.date.accessioned	2024-03-13T08:41:43Z	-
dc.date.available	2024-03-13T08:41:43Z	-
dc.date.issued	2020	-
dc.identifier.citation	8th International Conference on Learning Representations, ICLR 2020, 2020	-
dc.identifier.uri	http://hdl.handle.net/10722/341298	-
dc.description.abstract	We address the challenging problem of deep representation learning - the efficient adaption of a pre-trained deep network to different tasks. Specifically, we propose to explore gradient-based features. These features are gradients of the model parameters with respect to a task-specific loss given an input sample. Our key innovation is the design of a linear model that incorporates both gradient and activation of the pre-trained network. We demonstrate that our model provides a local linear approximation to an underlying deep model, and discuss important theoretical insights. Moreover, we present an efficient algorithm for the training and inference of our model without computing the actual gradients. Our method is evaluated across a number of representation-learning tasks on several datasets and using different network architectures. Strong results are obtained in all settings, and are well-aligned with our theoretical insights.	-
dc.language	eng	-
dc.relation.ispartof	8th International Conference on Learning Representations, ICLR 2020	-
dc.title	GRADIENTS AS FEATURES FOR DEEP REPRESENTATION LEARNING	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.scopus	eid_2-s2.0-85101113031	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: GRADIENTS AS FEATURES FOR DEEP REPRESENTATION LEARNING

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats