Exploring Self-attention for Image Recognition

Zhao, Hengshuang; Jia, Jiaya; Koltun, Vladlen

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/CVPR42600.2020.01009
Scopus: eid_2-s2.0-85090597566
WOS: WOS:001309199902094
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Computer Science: Conference papers

See more details

Conference Paper: Exploring Self-attention for Image Recognition

Title	Exploring Self-attention for Image Recognition
Authors	Zhao, Hengshuang Jia, Jiaya Koltun, Vladlen
Issue Date	2020
Citation	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2020, p. 10073-10082 How to Cite? DOI: http://dx.doi.org/10.1109/CVPR42600.2020.01009
Abstract	Recent work has shown that self-attention can serve as a basic building block for image recognition models. We explore variations of self-attention and assess their effectiveness for image recognition. We consider two forms of self-attention. One is pairwise self-attention, which generalizes standard dot-product attention and is fundamentally a set operator. The other is patchwise self-attention, which is strictly more powerful than convolution. Our pairwise self-attention networks match or outperform their convolutional counterparts, and the patchwise models substantially outperform the convolutional baselines. We also conduct experiments that probe the robustness of learned representations and conclude that self-attention networks may have significant benefits in terms of robustness and generalization.
Persistent Identifier	http://hdl.handle.net/10722/303697
ISSN	1063-6919 2023 SCImago Journal Rankings: 10.331
ISI Accession Number ID	WOS:001309199902094

DC Field	Value	Language
dc.contributor.author	Zhao, Hengshuang	-
dc.contributor.author	Jia, Jiaya	-
dc.contributor.author	Koltun, Vladlen	-
dc.date.accessioned	2021-09-15T08:25:50Z	-
dc.date.available	2021-09-15T08:25:50Z	-
dc.date.issued	2020	-
dc.identifier.citation	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2020, p. 10073-10082	-
dc.identifier.issn	1063-6919	-
dc.identifier.uri	http://hdl.handle.net/10722/303697	-
dc.description.abstract	Recent work has shown that self-attention can serve as a basic building block for image recognition models. We explore variations of self-attention and assess their effectiveness for image recognition. We consider two forms of self-attention. One is pairwise self-attention, which generalizes standard dot-product attention and is fundamentally a set operator. The other is patchwise self-attention, which is strictly more powerful than convolution. Our pairwise self-attention networks match or outperform their convolutional counterparts, and the patchwise models substantially outperform the convolutional baselines. We also conduct experiments that probe the robustness of learned representations and conclude that self-attention networks may have significant benefits in terms of robustness and generalization.	-
dc.language	eng	-
dc.relation.ispartof	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition	-
dc.title	Exploring Self-attention for Image Recognition	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/CVPR42600.2020.01009	-
dc.identifier.scopus	eid_2-s2.0-85090597566	-
dc.identifier.spage	10073	-
dc.identifier.epage	10082	-
dc.identifier.isi	WOS:001309199902094	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Exploring Self-attention for Image Recognition

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats