File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: FAT: Frequency-Aware Transformation for Bridging Full-Precision and Low-Precision Deep Representations

TitleFAT: Frequency-Aware Transformation for Bridging Full-Precision and Low-Precision Deep Representations
Authors
Issue Date21-Oct-2022
PublisherInstitute of Electrical and Electronics Engineers
Citation
IEEE Transactions on Neural Networks and Learning Systems, 2022, v. 13 How to Cite?
Abstract

Learning low-bitwidth convolutional neural networks (CNNs) is challenging because performance may drop significantly after quantization. Prior arts often quantize the network weights by carefully tuning hyperparameters such as nonuniform stepsize and layerwise bitwidths, which are complicated since the full-and low-precision representations have large discrepancies. This work presents a novel quantization pipeline, named frequency-aware transformation (FAT), that features important benefits: 1) instead of designing complicated quantizers, FAT learns to transform network weights in the frequency domain to remove redundant information before quantization, making them amenable to training in low bitwidth with simple quantizers; 2) FAT readily embeds CNNs in low bitwidths using standard quantizers without tedious hyperparameter tuning and theoretical analyses show that FAT minimizes the quantization errors in both uniform and nonuniform quantizations; and 3) FAT can be easily plugged into various CNN architectures. Using FAT with a simple uniform/logarithmic quantizer can achieve the state-of-the-art performance in different bitwidths on various model architectures. Consequently, FAT serves to provide a novel frequency-based perspective for model quantization.


Persistent Identifierhttp://hdl.handle.net/10722/339470
ISSN
2023 Impact Factor: 10.2
2023 SCImago Journal Rankings: 4.170

 

DC FieldValueLanguage
dc.contributor.authorTao, Chaofan-
dc.contributor.authorLin, Rui-
dc.contributor.authorChen, Quan-
dc.contributor.authorZhang, Zhaoyang-
dc.contributor.authorLuo, Ping-
dc.contributor.authorWong, Ngai-
dc.date.accessioned2024-03-11T10:36:54Z-
dc.date.available2024-03-11T10:36:54Z-
dc.date.issued2022-10-21-
dc.identifier.citationIEEE Transactions on Neural Networks and Learning Systems, 2022, v. 13-
dc.identifier.issn2162-237X-
dc.identifier.urihttp://hdl.handle.net/10722/339470-
dc.description.abstract<p>Learning low-bitwidth convolutional neural networks (CNNs) is challenging because performance may drop significantly after quantization. Prior arts often quantize the network weights by carefully tuning hyperparameters such as nonuniform stepsize and layerwise bitwidths, which are complicated since the full-and low-precision representations have large discrepancies. This work presents a novel quantization pipeline, named frequency-aware transformation (FAT), that features important benefits: 1) instead of designing complicated quantizers, FAT learns to transform network weights in the frequency domain to remove redundant information before quantization, making them amenable to training in low bitwidth with simple quantizers; 2) FAT readily embeds CNNs in low bitwidths using standard quantizers without tedious hyperparameter tuning and theoretical analyses show that FAT minimizes the quantization errors in both uniform and nonuniform quantizations; and 3) FAT can be easily plugged into various CNN architectures. Using FAT with a simple uniform/logarithmic quantizer can achieve the state-of-the-art performance in different bitwidths on various model architectures. Consequently, FAT serves to provide a novel frequency-based perspective for model quantization.<br></p>-
dc.languageeng-
dc.publisherInstitute of Electrical and Electronics Engineers-
dc.relation.ispartofIEEE Transactions on Neural Networks and Learning Systems-
dc.titleFAT: Frequency-Aware Transformation for Bridging Full-Precision and Low-Precision Deep Representations-
dc.typeArticle-
dc.identifier.doi10.1109/TNNLS.2022.3190607-
dc.identifier.volume13-
dc.identifier.eissn2162-2388-
dc.identifier.issnl2162-237X-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats