Transductive Zero-Shot Learning with Visual Structure Constraint

Wan, Z; Chen, D; Li, Y; Yan, X; Zhang, J; Yu, Y; Liao, J

File Download

There are no files associated with this item.

Supplementary

Citations:
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Transductive Zero-Shot Learning with Visual Structure Constraint

Title	Transductive Zero-Shot Learning with Visual Structure Constraint
Authors	Wan, Z Chen, D Li, Y Yan, X Zhang, J Yu, Y Liao, J
Issue Date	2019
Publisher	Morgan Kaufmann Publishers.
Citation	33rd Conference on Neural Information Processing Systems (NeurIPS), Vancouver, Canada, December 8-14, 2019. In Advances in Neural Information Processing Systems (NeurIPS), v. 32, p. 9972-9982 How to Cite?
Abstract	To recognize objects of the unseen classes, most existing Zero-Shot Learning (ZSL) methods first learn a compatible projection function between the common semantic space and the visual space based on the data of source seen classes, then directly apply it to the target unseen classes. However, in real scenarios, the data distribution between the source and target domain might not match well, thus causing the well-known domain shift problem. Based on the observation that visual features of test instances can be separated into different clusters, we propose a new visual structure constraint on class centers for transductive ZSL, to improve the generality of the projection function (ie alleviate the above domain shift problem). Specifically, three different strategies (symmetric Chamfer-distance,Bipartite matching distance, and Wasserstein distance) are adopted to align the projected unseen semantic centers and visual cluster centers of test instances. We also propose a new training strategy to handle the real cases where many unrelated images exist in the test dataset, which is not considered in previous methods. Experiments on many widely used datasets demonstrate that the proposed visual structure constraint can bring substantial performance gain consistently and achieve state-of-the-art results.
Persistent Identifier	http://hdl.handle.net/10722/316287

DC Field	Value	Language
dc.contributor.author	Wan, Z	-
dc.contributor.author	Chen, D	-
dc.contributor.author	Li, Y	-
dc.contributor.author	Yan, X	-
dc.contributor.author	Zhang, J	-
dc.contributor.author	Yu, Y	-
dc.contributor.author	Liao, J	-
dc.date.accessioned	2022-09-02T06:08:49Z	-
dc.date.available	2022-09-02T06:08:49Z	-
dc.date.issued	2019	-
dc.identifier.citation	33rd Conference on Neural Information Processing Systems (NeurIPS), Vancouver, Canada, December 8-14, 2019. In Advances in Neural Information Processing Systems (NeurIPS), v. 32, p. 9972-9982	-
dc.identifier.uri	http://hdl.handle.net/10722/316287	-
dc.description.abstract	To recognize objects of the unseen classes, most existing Zero-Shot Learning (ZSL) methods first learn a compatible projection function between the common semantic space and the visual space based on the data of source seen classes, then directly apply it to the target unseen classes. However, in real scenarios, the data distribution between the source and target domain might not match well, thus causing the well-known domain shift problem. Based on the observation that visual features of test instances can be separated into different clusters, we propose a new visual structure constraint on class centers for transductive ZSL, to improve the generality of the projection function (ie alleviate the above domain shift problem). Specifically, three different strategies (symmetric Chamfer-distance,Bipartite matching distance, and Wasserstein distance) are adopted to align the projected unseen semantic centers and visual cluster centers of test instances. We also propose a new training strategy to handle the real cases where many unrelated images exist in the test dataset, which is not considered in previous methods. Experiments on many widely used datasets demonstrate that the proposed visual structure constraint can bring substantial performance gain consistently and achieve state-of-the-art results.	-
dc.language	eng	-
dc.publisher	Morgan Kaufmann Publishers.	-
dc.relation.ispartof	Advances in Neural Information Processing Systems (NeurIPS)	-
dc.title	Transductive Zero-Shot Learning with Visual Structure Constraint	-
dc.type	Conference_Paper	-
dc.identifier.email	Yu, Y: yzyu@cs.hku.hk	-
dc.identifier.authority	Yu, Y=rp01415	-
dc.identifier.hkuros	336352	-
dc.identifier.volume	32	-
dc.identifier.spage	9972	-
dc.identifier.epage	9982	-
dc.publisher.place	United States	-

File Download

Supplementary

Conference Paper: Transductive Zero-Shot Learning with Visual Structure Constraint

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats