Learning Spatial Attention for Face Super-Resolution

Chen, C; Gong, D; Wang, H; Li, Z; Wong, KYK

File Download

Content.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/TIP.2020.3043093
Scopus: eid_2-s2.0-85098119837
PMID: 33315560
WOS: WOS:000603026100002
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Computer Science: Journal/Magazine Articles

Article: Learning Spatial Attention for Face Super-Resolution

Title	Learning Spatial Attention for Face Super-Resolution
Authors	Chen, C Gong, D Wang, H Li, Z Wong, KYK
Keywords	Face super-resolution spatial attention generative adversarial networks
Issue Date	2021
Publisher	Institute of Electrical and Electronics Engineers. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=83
Citation	IEEE Transactions on Image Processing, 2021, v. 30, p. 1219-1231 How to Cite? DOI: http://dx.doi.org/10.1109/TIP.2020.3043093
Abstract	General image super-resolution techniques have difficulties in recovering detailed face structures when applying to low resolution face images. Recent deep learning based methods tailored for face images have achieved improved performance by jointly trained with additional task such as face parsing and landmark prediction. However, multi-task learning requires extra manually labeled data. Besides, most of the existing works can only generate relatively low resolution face images (e.g., 128 × 128), and their applications are therefore limited. In this paper, we introduce a novel SPatial Attention Residual Network (SPARNet) built on our newly proposed Face Attention Units (FAUs) for face super-resolution. Specifically, we introduce a spatial attention mechanism to the vanilla residual blocks. This enables the convolutional layers to adaptively bootstrap features related to the key face structures and pay less attention to those less feature-rich regions. This makes the training more effective and efficient as the key face structures only account for a very small portion of the face image. Visualization of the attention maps shows that our spatial attention network can capture the key face structures well even for very low resolution faces (e.g., 16×16). Quantitative comparisons on various kinds of metrics (including PSNR, SSIM, identity similarity, and landmark detection) demonstrate the superiority of our method over current state-of-the-arts. We further extend SPARNet with multi-scale discriminators, named as SPARNetHD, to produce high resolution results (i.e., 512×512). We show that SPARNetHD trained with synthetic data can not only produce high quality and high resolution outputs for synthetically degraded face images, but also show good generalization ability to real world low quality face images. Codes are available at https://github.com/chaofengc/Face-SPARNet.
Persistent Identifier	http://hdl.handle.net/10722/301191
ISSN	1057-7149 2023 Impact Factor: 10.8 2023 SCImago Journal Rankings: 3.556
ISI Accession Number ID	WOS:000603026100002

DC Field	Value	Language
dc.contributor.author	Chen, C	-
dc.contributor.author	Gong, D	-
dc.contributor.author	Wang, H	-
dc.contributor.author	Li, Z	-
dc.contributor.author	Wong, KYK	-
dc.date.accessioned	2021-07-27T08:07:28Z	-
dc.date.available	2021-07-27T08:07:28Z	-
dc.date.issued	2021	-
dc.identifier.citation	IEEE Transactions on Image Processing, 2021, v. 30, p. 1219-1231	-
dc.identifier.issn	1057-7149	-
dc.identifier.uri	http://hdl.handle.net/10722/301191	-
dc.description.abstract	General image super-resolution techniques have difficulties in recovering detailed face structures when applying to low resolution face images. Recent deep learning based methods tailored for face images have achieved improved performance by jointly trained with additional task such as face parsing and landmark prediction. However, multi-task learning requires extra manually labeled data. Besides, most of the existing works can only generate relatively low resolution face images (e.g., 128 × 128), and their applications are therefore limited. In this paper, we introduce a novel SPatial Attention Residual Network (SPARNet) built on our newly proposed Face Attention Units (FAUs) for face super-resolution. Specifically, we introduce a spatial attention mechanism to the vanilla residual blocks. This enables the convolutional layers to adaptively bootstrap features related to the key face structures and pay less attention to those less feature-rich regions. This makes the training more effective and efficient as the key face structures only account for a very small portion of the face image. Visualization of the attention maps shows that our spatial attention network can capture the key face structures well even for very low resolution faces (e.g., 16×16). Quantitative comparisons on various kinds of metrics (including PSNR, SSIM, identity similarity, and landmark detection) demonstrate the superiority of our method over current state-of-the-arts. We further extend SPARNet with multi-scale discriminators, named as SPARNetHD, to produce high resolution results (i.e., 512×512). We show that SPARNetHD trained with synthetic data can not only produce high quality and high resolution outputs for synthetically degraded face images, but also show good generalization ability to real world low quality face images. Codes are available at https://github.com/chaofengc/Face-SPARNet.	-
dc.language	eng	-
dc.publisher	Institute of Electrical and Electronics Engineers. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=83	-
dc.relation.ispartof	IEEE Transactions on Image Processing	-
dc.rights	IEEE Transactions on Image Processing. Copyright © Institute of Electrical and Electronics Engineers.	-
dc.rights	©2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.	-
dc.subject	Face super-resolution	-
dc.subject	spatial attention	-
dc.subject	generative adversarial networks	-
dc.title	Learning Spatial Attention for Face Super-Resolution	-
dc.type	Article	-
dc.identifier.email	Wong, KYK: kykwong@cs.hku.hk	-
dc.identifier.authority	Wong, KYK=rp01393	-
dc.description.nature	postprint	-
dc.identifier.doi	10.1109/TIP.2020.3043093	-
dc.identifier.pmid	33315560	-
dc.identifier.scopus	eid_2-s2.0-85098119837	-
dc.identifier.hkuros	323623	-
dc.identifier.hkuros	323470	-
dc.identifier.volume	30	-
dc.identifier.spage	1219	-
dc.identifier.epage	1231	-
dc.identifier.isi	WOS:000603026100002	-
dc.publisher.place	United States	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Learning Spatial Attention for Face Super-Resolution

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats