Adaptive Deep Neural Network Inference Optimization with EENet

Ilhan, Fatih; Chow, Ka Ho; Hu, Sihao; Huang, Tiansheng; Tekin, Selim; Wei, Wenqi; Wu, Yanzhao; Lee, Myungjin; Kompella, Ramana; Latapie, Hugo; Liu, Gaowen; Liu, Ling

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/WACV57701.2024.00140
Scopus: eid_2-s2.0-85191996148

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Adaptive Deep Neural Network Inference Optimization with EENet

Title	Adaptive Deep Neural Network Inference Optimization with EENet
Authors	Ilhan, Fatih Chow, Ka Ho Hu, Sihao Huang, Tiansheng Tekin, Selim Wei, Wenqi Wu, Yanzhao Lee, Myungjin Kompella, Ramana Latapie, Hugo Liu, Gaowen Liu, Ling
Keywords	Algorithms Image recognition and understanding
Issue Date	2024
Citation	Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024, 2024, p. 1362-1371 How to Cite? DOI: http://dx.doi.org/10.1109/WACV57701.2024.00140
Abstract	Well-trained deep neural networks (DNNs) treat all test samples equally during prediction. Adaptive DNN inference with early exiting leverages the observation that some test examples can be easier to predict than others. This paper presents EENet, a novel early-exiting scheduling framework for multi-exit DNN models. Instead of having every sample go through all DNN layers during prediction, EENet learns an early exit scheduler, which can intelligently terminate the inference earlier for certain predictions, which the model has high confidence of early exit. As opposed to previous early-exiting solutions with heuristics-based methods, our EENet framework optimizes an early-exiting policy to maximize model accuracy while satisfying the given per-sample average inference budget. Extensive experiments are conducted on four computer vision datasets (CIFAR-10, CIFAR-100, ImageNet, Cityscapes) and two NLP datasets (SST-2, AgNews). The results demonstrate that the adaptive inference by EENet can outperform the representative existing early exit techniques. We also perform a detailed visualization analysis of the comparison results to interpret the benefits of EENet.
Persistent Identifier	http://hdl.handle.net/10722/343495

DC Field	Value	Language
dc.contributor.author	Ilhan, Fatih	-
dc.contributor.author	Chow, Ka Ho	-
dc.contributor.author	Hu, Sihao	-
dc.contributor.author	Huang, Tiansheng	-
dc.contributor.author	Tekin, Selim	-
dc.contributor.author	Wei, Wenqi	-
dc.contributor.author	Wu, Yanzhao	-
dc.contributor.author	Lee, Myungjin	-
dc.contributor.author	Kompella, Ramana	-
dc.contributor.author	Latapie, Hugo	-
dc.contributor.author	Liu, Gaowen	-
dc.contributor.author	Liu, Ling	-
dc.date.accessioned	2024-05-10T09:08:34Z	-
dc.date.available	2024-05-10T09:08:34Z	-
dc.date.issued	2024	-
dc.identifier.citation	Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024, 2024, p. 1362-1371	-
dc.identifier.uri	http://hdl.handle.net/10722/343495	-
dc.description.abstract	Well-trained deep neural networks (DNNs) treat all test samples equally during prediction. Adaptive DNN inference with early exiting leverages the observation that some test examples can be easier to predict than others. This paper presents EENet, a novel early-exiting scheduling framework for multi-exit DNN models. Instead of having every sample go through all DNN layers during prediction, EENet learns an early exit scheduler, which can intelligently terminate the inference earlier for certain predictions, which the model has high confidence of early exit. As opposed to previous early-exiting solutions with heuristics-based methods, our EENet framework optimizes an early-exiting policy to maximize model accuracy while satisfying the given per-sample average inference budget. Extensive experiments are conducted on four computer vision datasets (CIFAR-10, CIFAR-100, ImageNet, Cityscapes) and two NLP datasets (SST-2, AgNews). The results demonstrate that the adaptive inference by EENet can outperform the representative existing early exit techniques. We also perform a detailed visualization analysis of the comparison results to interpret the benefits of EENet.	-
dc.language	eng	-
dc.relation.ispartof	Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024	-
dc.subject	Algorithms	-
dc.subject	Image recognition and understanding	-
dc.title	Adaptive Deep Neural Network Inference Optimization with EENet	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/WACV57701.2024.00140	-
dc.identifier.scopus	eid_2-s2.0-85191996148	-
dc.identifier.spage	1362	-
dc.identifier.epage	1371	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Adaptive Deep Neural Network Inference Optimization with EENet

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats