File Download
There are no files associated with this item.
Supplementary
-
Citations:
- Appears in Collections:
Conference Paper: Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution
Title | Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution |
---|---|
Authors | |
Issue Date | 2021 |
Publisher | ML Research Press. The Journal's web site is located at http://proceedings.mlr.press/ |
Citation | The 38th International Conference on Machine Learning (ICML), Virtual Conference, 18-24 July 2021. In Proceedings of Machine Learning Research (PMLR), v. 139: Proceedings of ICML 2021, p. 12546-12556 How to Cite? |
Abstract | Model quantization is challenging due to many tedious hyper-parameters such as precision (bitwidth), dynamic range (minimum and maximum discrete values) and stepsize (interval between discrete values). Unlike prior arts that carefully tune these values, we present a fully differentiable approach to learn all of them, named Differentiable Dynamic Quantization (DDQ), which has several benefits. (1) DDQ is able to quantize challenging lightweight architectures like MobileNets, where different layers prefer different quantization parameters. (2) DDQ is hardware-friendly and can be easily implemented using low-precision matrix-vector multiplication, making it capable in many hardware such as ARM. (3) Extensive experiments show that DDQ outperforms prior arts on many networks and benchmarks, especially when models are already efficient and compact. e.g., DDQ is the first approach that achieves lossless 4-bit quantization for MobileNetV2 on ImageNet. |
Description | Applications (CV and NLP) Session |
Persistent Identifier | http://hdl.handle.net/10722/301433 |
ISSN |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zhang, Z | - |
dc.contributor.author | Shao, W | - |
dc.contributor.author | Gu, J | - |
dc.contributor.author | Wang, X | - |
dc.contributor.author | Luo, P | - |
dc.date.accessioned | 2021-07-27T08:11:00Z | - |
dc.date.available | 2021-07-27T08:11:00Z | - |
dc.date.issued | 2021 | - |
dc.identifier.citation | The 38th International Conference on Machine Learning (ICML), Virtual Conference, 18-24 July 2021. In Proceedings of Machine Learning Research (PMLR), v. 139: Proceedings of ICML 2021, p. 12546-12556 | - |
dc.identifier.issn | 2640-3498 | - |
dc.identifier.uri | http://hdl.handle.net/10722/301433 | - |
dc.description | Applications (CV and NLP) Session | - |
dc.description.abstract | Model quantization is challenging due to many tedious hyper-parameters such as precision (bitwidth), dynamic range (minimum and maximum discrete values) and stepsize (interval between discrete values). Unlike prior arts that carefully tune these values, we present a fully differentiable approach to learn all of them, named Differentiable Dynamic Quantization (DDQ), which has several benefits. (1) DDQ is able to quantize challenging lightweight architectures like MobileNets, where different layers prefer different quantization parameters. (2) DDQ is hardware-friendly and can be easily implemented using low-precision matrix-vector multiplication, making it capable in many hardware such as ARM. (3) Extensive experiments show that DDQ outperforms prior arts on many networks and benchmarks, especially when models are already efficient and compact. e.g., DDQ is the first approach that achieves lossless 4-bit quantization for MobileNetV2 on ImageNet. | - |
dc.language | eng | - |
dc.publisher | ML Research Press. The Journal's web site is located at http://proceedings.mlr.press/ | - |
dc.relation.ispartof | Proceedings of Machine Learning Research (PMLR) | - |
dc.relation.ispartof | The 38th International Conference on Machine Learning (ICML), 2021 | - |
dc.title | Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution | - |
dc.type | Conference_Paper | - |
dc.identifier.email | Luo, P: pluo@hku.hk | - |
dc.identifier.authority | Luo, P=rp02575 | - |
dc.identifier.hkuros | 323759 | - |
dc.identifier.volume | 139: Proceedings of ICML 2021 | - |
dc.identifier.spage | 12546 | - |
dc.identifier.epage | 12556 | - |
dc.publisher.place | United States | - |