Showing results 1 to 5 of 5
| Title | Author(s) | Issue Date | |
|---|---|---|---|
| 2024 | |||
DyBit: Dynamic Bit-Precision Numbers for Efficient Quantized Neural Network Inference Journal:IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | 13-Dec-2023 | ||
FAT: Frequency-Aware Transformation for Bridging Full-Precision and Low-Precision Deep Representations Journal:IEEE Transactions on Neural Networks and Learning Systems | 21-Oct-2022 | ||
ODG-Q: Robust Quantization via Online Domain Generalization Proceeding/Conference:26th International Conference on Pattern Recognition, ICPR2022 (21/08/2022-25/08/2022, Montreal, Quebec) | 21-Aug-2022 | ||
Structured Pruning for Efficient Generative Pre-trained Language Models Proceeding/Conference:Annual Meeting of the Association for Computational Linguistics - ACL 2023 (09/07/2023-14/07/2023, , , Toronto, Canada) | 9-Jul-2023 |
