File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/TWC.2020.3021177
- Scopus: eid_2-s2.0-85097742947
- WOS: WOS:000597156900027
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Joint Parameter-and-Bandwidth Allocation for Improving the Efficiency of Partitioned Edge Learning
Title | Joint Parameter-and-Bandwidth Allocation for Improving the Efficiency of Partitioned Edge Learning |
---|---|
Authors | |
Keywords | Computational modeling Resource management Servers Channel allocation Load modeling |
Issue Date | 2020 |
Publisher | Institute of Electrical and Electronics Engineers. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=7693 |
Citation | IEEE Transactions on Wireless Communications, 2020, v. 19 n. 12, p. 8272-8286 How to Cite? |
Abstract | To leverage data and computation capabilities of mobile devices, machine learning algorithms are deployed at the network edge for training artificial intelligence (AI) models, resulting in the new paradigm of edge learning. In this paper, we consider the framework of partitioned edge learning for iteratively training a large-scale model using many resource-constrained devices (called workers). To this end, in each iteration, the model is dynamically partitioned into parametric blocks, which are downloaded to worker groups for updating using data subsets. Then, the local updates are uploaded to and cascaded by the server for updating a global model. To reduce resource usage by minimizing the total learning-and-communication latency, this work focuses on the novel joint design of parameter (computation load) allocation and bandwidth allocation (for downloading and uploading). Two design approaches are adopted. First, a practical sequential approach, called partially integrated parameter-and-bandwidth allocation (PABA), yields two schemes, namely bandwidth aware parameter allocation and parameter aware bandwidth allocation. The former minimizes the load for the slowest (in computing) of worker groups, each training a same parametric block. The latter allocates the largest bandwidth to the worker being the latency bottleneck. Second, PABA are jointly optimized. Despite it being a nonconvex problem, an efficient and optimal solution algorithm is derived by intelligently nesting a bisection search and solving a convex problem. Experimental results using real data demonstrate that integrating PABA can substantially improve the performance of partitioned edge learning in terms of latency (by e.g., 46%) and accuracy (by e.g., 4% given the latency of 100 seconds). |
Persistent Identifier | http://hdl.handle.net/10722/295894 |
ISSN | 2023 Impact Factor: 8.9 2023 SCImago Journal Rankings: 5.371 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | WEN, D | - |
dc.contributor.author | Bennis, M | - |
dc.contributor.author | Huang, K | - |
dc.date.accessioned | 2021-02-08T08:15:32Z | - |
dc.date.available | 2021-02-08T08:15:32Z | - |
dc.date.issued | 2020 | - |
dc.identifier.citation | IEEE Transactions on Wireless Communications, 2020, v. 19 n. 12, p. 8272-8286 | - |
dc.identifier.issn | 1536-1276 | - |
dc.identifier.uri | http://hdl.handle.net/10722/295894 | - |
dc.description.abstract | To leverage data and computation capabilities of mobile devices, machine learning algorithms are deployed at the network edge for training artificial intelligence (AI) models, resulting in the new paradigm of edge learning. In this paper, we consider the framework of partitioned edge learning for iteratively training a large-scale model using many resource-constrained devices (called workers). To this end, in each iteration, the model is dynamically partitioned into parametric blocks, which are downloaded to worker groups for updating using data subsets. Then, the local updates are uploaded to and cascaded by the server for updating a global model. To reduce resource usage by minimizing the total learning-and-communication latency, this work focuses on the novel joint design of parameter (computation load) allocation and bandwidth allocation (for downloading and uploading). Two design approaches are adopted. First, a practical sequential approach, called partially integrated parameter-and-bandwidth allocation (PABA), yields two schemes, namely bandwidth aware parameter allocation and parameter aware bandwidth allocation. The former minimizes the load for the slowest (in computing) of worker groups, each training a same parametric block. The latter allocates the largest bandwidth to the worker being the latency bottleneck. Second, PABA are jointly optimized. Despite it being a nonconvex problem, an efficient and optimal solution algorithm is derived by intelligently nesting a bisection search and solving a convex problem. Experimental results using real data demonstrate that integrating PABA can substantially improve the performance of partitioned edge learning in terms of latency (by e.g., 46%) and accuracy (by e.g., 4% given the latency of 100 seconds). | - |
dc.language | eng | - |
dc.publisher | Institute of Electrical and Electronics Engineers. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=7693 | - |
dc.relation.ispartof | IEEE Transactions on Wireless Communications | - |
dc.rights | IEEE Transactions on Wireless Communications. Copyright © Institute of Electrical and Electronics Engineers. | - |
dc.rights | ©20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | - |
dc.subject | Computational modeling | - |
dc.subject | Resource management | - |
dc.subject | Servers | - |
dc.subject | Channel allocation | - |
dc.subject | Load modeling | - |
dc.title | Joint Parameter-and-Bandwidth Allocation for Improving the Efficiency of Partitioned Edge Learning | - |
dc.type | Article | - |
dc.identifier.email | Huang, K: huangkb@eee.hku.hk | - |
dc.identifier.authority | Huang, K=rp01875 | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1109/TWC.2020.3021177 | - |
dc.identifier.scopus | eid_2-s2.0-85097742947 | - |
dc.identifier.hkuros | 321246 | - |
dc.identifier.volume | 19 | - |
dc.identifier.issue | 12 | - |
dc.identifier.spage | 8272 | - |
dc.identifier.epage | 8286 | - |
dc.identifier.isi | WOS:000597156900027 | - |
dc.publisher.place | United States | - |