File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/WACV57701.2024.00140
- Scopus: eid_2-s2.0-85191996148
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Adaptive Deep Neural Network Inference Optimization with EENet
Title | Adaptive Deep Neural Network Inference Optimization with EENet |
---|---|
Authors | |
Keywords | Algorithms Image recognition and understanding |
Issue Date | 2024 |
Citation | Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024, 2024, p. 1362-1371 How to Cite? |
Abstract | Well-trained deep neural networks (DNNs) treat all test samples equally during prediction. Adaptive DNN inference with early exiting leverages the observation that some test examples can be easier to predict than others. This paper presents EENet, a novel early-exiting scheduling framework for multi-exit DNN models. Instead of having every sample go through all DNN layers during prediction, EENet learns an early exit scheduler, which can intelligently terminate the inference earlier for certain predictions, which the model has high confidence of early exit. As opposed to previous early-exiting solutions with heuristics-based methods, our EENet framework optimizes an early-exiting policy to maximize model accuracy while satisfying the given per-sample average inference budget. Extensive experiments are conducted on four computer vision datasets (CIFAR-10, CIFAR-100, ImageNet, Cityscapes) and two NLP datasets (SST-2, AgNews). The results demonstrate that the adaptive inference by EENet can outperform the representative existing early exit techniques. We also perform a detailed visualization analysis of the comparison results to interpret the benefits of EENet. |
Persistent Identifier | http://hdl.handle.net/10722/343495 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Ilhan, Fatih | - |
dc.contributor.author | Chow, Ka Ho | - |
dc.contributor.author | Hu, Sihao | - |
dc.contributor.author | Huang, Tiansheng | - |
dc.contributor.author | Tekin, Selim | - |
dc.contributor.author | Wei, Wenqi | - |
dc.contributor.author | Wu, Yanzhao | - |
dc.contributor.author | Lee, Myungjin | - |
dc.contributor.author | Kompella, Ramana | - |
dc.contributor.author | Latapie, Hugo | - |
dc.contributor.author | Liu, Gaowen | - |
dc.contributor.author | Liu, Ling | - |
dc.date.accessioned | 2024-05-10T09:08:34Z | - |
dc.date.available | 2024-05-10T09:08:34Z | - |
dc.date.issued | 2024 | - |
dc.identifier.citation | Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024, 2024, p. 1362-1371 | - |
dc.identifier.uri | http://hdl.handle.net/10722/343495 | - |
dc.description.abstract | Well-trained deep neural networks (DNNs) treat all test samples equally during prediction. Adaptive DNN inference with early exiting leverages the observation that some test examples can be easier to predict than others. This paper presents EENet, a novel early-exiting scheduling framework for multi-exit DNN models. Instead of having every sample go through all DNN layers during prediction, EENet learns an early exit scheduler, which can intelligently terminate the inference earlier for certain predictions, which the model has high confidence of early exit. As opposed to previous early-exiting solutions with heuristics-based methods, our EENet framework optimizes an early-exiting policy to maximize model accuracy while satisfying the given per-sample average inference budget. Extensive experiments are conducted on four computer vision datasets (CIFAR-10, CIFAR-100, ImageNet, Cityscapes) and two NLP datasets (SST-2, AgNews). The results demonstrate that the adaptive inference by EENet can outperform the representative existing early exit techniques. We also perform a detailed visualization analysis of the comparison results to interpret the benefits of EENet. | - |
dc.language | eng | - |
dc.relation.ispartof | Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 | - |
dc.subject | Algorithms | - |
dc.subject | Image recognition and understanding | - |
dc.title | Adaptive Deep Neural Network Inference Optimization with EENet | - |
dc.type | Conference_Paper | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1109/WACV57701.2024.00140 | - |
dc.identifier.scopus | eid_2-s2.0-85191996148 | - |
dc.identifier.spage | 1362 | - |
dc.identifier.epage | 1371 | - |