File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: Multiple access, resource management, and load balancing for edge intelligence systems
Title | Multiple access, resource management, and load balancing for edge intelligence systems |
---|---|
Authors | |
Advisors | Advisor(s):Huang, K |
Issue Date | 2021 |
Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
Citation | Wen, D. [文鼎柱 ]. (2021). Multiple access, resource management, and load balancing for edge intelligence systems. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. |
Abstract | Given the advancements of mobile communication systems and Internet-of-Things (IoT), billions of mobile devices will be deployed in wireless networks. The enormous amount of data generated by the massive number of devices, together with their computation capacities, can be used to enable low-latency artificial intelligence (AI) applications at the network edge, giving rise to a new research area, called edge intelligence. The new design goals give rise to new challenges. On one hand, fast wireless data aggregation (WDA) from a large number of devices should be realized to execute many real-time tasks, such as auto-driving, public safety surveillance, and traffic control. As a result, the conventional orthogonal multiple access techniques is insufficient due to the limited wireless resources. On the other hand, distributed learning algorithms are deployed at the network edge for training AI models by using distributed data and computation resources therein, called edge learning. This inter-disciplinary area integrates computer science and wireless communications. Consequently, the traditional wireless techniques, targeting throughput maximization or energy minimization, are no longer optimal, as the design goal shifts towards enhancing the learning performance, i.e., training latency and testing accuracy. In this dissertation, the above two issues are addressed.
First, a new multiple access technique is proposed for massive multi-input-multi-output (MIMO) systems with clustered IoT sensors, called MIMO AirComp, which directly receives a functional value of the distributed data transmitted from multiple devices instead of decoding each data stream. As a result, it can support the simultaneous transmission of a large number of devices by using the waveform superposition property of a wireless channel. Specifically, in the proposed MIMO AirComp framework, a vector-valued function can be calculated at the receiver with low latency and low complexity by designing a two-tier receive beamforming matrix.
Next, the novel framework of partitioned edge learning (PARTEL) is proposed. It supports the efficient training of large-scale models e.g., large-scale logistic regression or deep neural networks (DNNs) with millions of parameters, by coordinating the edge server and many resource-constrained devices. In such a framework, the global model is partitioned into several parametric blocks in each training iteration. Each block is assigned to one worker (group) for updating. Then, updates of all blocks are uploaded to the server for updating the global model, completing one iteration. The PARTEL framework is first implemented at narrowband wireless systems. Optimal scheme of joint parameter and bandwidth allocation is proposed under the target of minimum training latency. Then, it is further implemented at broadband wireless systems using orthogonal frequency division multiplexing (OFDM), which is more challenging as it involves the joint allocation of subcarriers, transmit power, and computation power. To solve this optimization problem, a practical solution approach is proposed. In both systems, experimental results using real datasets show that the proposed schemes can substantially reduce the training latency. |
Degree | Doctor of Philosophy |
Subject | Mobile communication systems Edge computing |
Dept/Program | Electrical and Electronic Engineering |
Persistent Identifier | http://hdl.handle.net/10722/308603 |
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Huang, K | - |
dc.contributor.author | Wen, Dingzhu | - |
dc.contributor.author | 文鼎柱 | - |
dc.date.accessioned | 2021-12-06T01:03:58Z | - |
dc.date.available | 2021-12-06T01:03:58Z | - |
dc.date.issued | 2021 | - |
dc.identifier.citation | Wen, D. [文鼎柱 ]. (2021). Multiple access, resource management, and load balancing for edge intelligence systems. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. | - |
dc.identifier.uri | http://hdl.handle.net/10722/308603 | - |
dc.description.abstract | Given the advancements of mobile communication systems and Internet-of-Things (IoT), billions of mobile devices will be deployed in wireless networks. The enormous amount of data generated by the massive number of devices, together with their computation capacities, can be used to enable low-latency artificial intelligence (AI) applications at the network edge, giving rise to a new research area, called edge intelligence. The new design goals give rise to new challenges. On one hand, fast wireless data aggregation (WDA) from a large number of devices should be realized to execute many real-time tasks, such as auto-driving, public safety surveillance, and traffic control. As a result, the conventional orthogonal multiple access techniques is insufficient due to the limited wireless resources. On the other hand, distributed learning algorithms are deployed at the network edge for training AI models by using distributed data and computation resources therein, called edge learning. This inter-disciplinary area integrates computer science and wireless communications. Consequently, the traditional wireless techniques, targeting throughput maximization or energy minimization, are no longer optimal, as the design goal shifts towards enhancing the learning performance, i.e., training latency and testing accuracy. In this dissertation, the above two issues are addressed. First, a new multiple access technique is proposed for massive multi-input-multi-output (MIMO) systems with clustered IoT sensors, called MIMO AirComp, which directly receives a functional value of the distributed data transmitted from multiple devices instead of decoding each data stream. As a result, it can support the simultaneous transmission of a large number of devices by using the waveform superposition property of a wireless channel. Specifically, in the proposed MIMO AirComp framework, a vector-valued function can be calculated at the receiver with low latency and low complexity by designing a two-tier receive beamforming matrix. Next, the novel framework of partitioned edge learning (PARTEL) is proposed. It supports the efficient training of large-scale models e.g., large-scale logistic regression or deep neural networks (DNNs) with millions of parameters, by coordinating the edge server and many resource-constrained devices. In such a framework, the global model is partitioned into several parametric blocks in each training iteration. Each block is assigned to one worker (group) for updating. Then, updates of all blocks are uploaded to the server for updating the global model, completing one iteration. The PARTEL framework is first implemented at narrowband wireless systems. Optimal scheme of joint parameter and bandwidth allocation is proposed under the target of minimum training latency. Then, it is further implemented at broadband wireless systems using orthogonal frequency division multiplexing (OFDM), which is more challenging as it involves the joint allocation of subcarriers, transmit power, and computation power. To solve this optimization problem, a practical solution approach is proposed. In both systems, experimental results using real datasets show that the proposed schemes can substantially reduce the training latency. | - |
dc.language | eng | - |
dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
dc.subject.lcsh | Mobile communication systems | - |
dc.subject.lcsh | Edge computing | - |
dc.title | Multiple access, resource management, and load balancing for edge intelligence systems | - |
dc.type | PG_Thesis | - |
dc.description.thesisname | Doctor of Philosophy | - |
dc.description.thesislevel | Doctoral | - |
dc.description.thesisdiscipline | Electrical and Electronic Engineering | - |
dc.description.nature | published_or_final_version | - |
dc.date.hkucongregation | 2021 | - |
dc.identifier.mmsid | 991044448906603414 | - |