File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: A Reliable Reinforcement Learning for Resource Allocation in Uplink NOMA-URLLC Networks

TitleA Reliable Reinforcement Learning for Resource Allocation in Uplink NOMA-URLLC Networks
Authors
KeywordsDeep SARSA-λ learning
non-orthogonal multiple access
power allocation
ultra-reliable low-latency communication
user clustering
Issue Date2022
Citation
IEEE Transactions on Wireless Communications, 2022, v. 21, n. 8, p. 5989-6002 How to Cite?
AbstractIn this paper, we propose a deep state-action-reward-state-action (SARSA) λ learning approach for optimising the uplink resource allocation in non-orthogonal multiple access (NOMA) aided ultra-reliable low-latency communication (URLLC). To reduce the mean decoding error probability in time-varying network environments, this work designs a reliable learning algorithm for providing a long-term resource allocation, where the reward feedback is based on the instantaneous network performance. With the aid of the proposed algorithm, this paper addresses three main challenges of the reliable resource sharing in NOMA-URLLC networks: 1) user clustering; 2) Instantaneous feedback system; and 3) Optimal resource allocation. All of these designs interact with the considered communication environment. Lastly, we compare the performance of the proposed algorithm with conventional Q-learning and SARSA Q-learning algorithms. The simulation outcomes show that: 1) Compared with the traditional Q learning algorithms, the proposed solution is able to converge within 200 episodes for providing as low as 10-2 long-term mean error; 2) NOMA assisted URLLC outperforms traditional OMA systems in terms of decoding error probabilities; and 3) The proposed feedback system is efficient for the long-term learning process.
Persistent Identifierhttp://hdl.handle.net/10722/349689
ISSN
2023 Impact Factor: 8.9
2023 SCImago Journal Rankings: 5.371

 

DC FieldValueLanguage
dc.contributor.authorAhsan, Waleed-
dc.contributor.authorYi, Wenqiang-
dc.contributor.authorLiu, Yuanwei-
dc.contributor.authorNallanathan, Arumugam-
dc.date.accessioned2024-10-17T07:00:09Z-
dc.date.available2024-10-17T07:00:09Z-
dc.date.issued2022-
dc.identifier.citationIEEE Transactions on Wireless Communications, 2022, v. 21, n. 8, p. 5989-6002-
dc.identifier.issn1536-1276-
dc.identifier.urihttp://hdl.handle.net/10722/349689-
dc.description.abstractIn this paper, we propose a deep state-action-reward-state-action (SARSA) λ learning approach for optimising the uplink resource allocation in non-orthogonal multiple access (NOMA) aided ultra-reliable low-latency communication (URLLC). To reduce the mean decoding error probability in time-varying network environments, this work designs a reliable learning algorithm for providing a long-term resource allocation, where the reward feedback is based on the instantaneous network performance. With the aid of the proposed algorithm, this paper addresses three main challenges of the reliable resource sharing in NOMA-URLLC networks: 1) user clustering; 2) Instantaneous feedback system; and 3) Optimal resource allocation. All of these designs interact with the considered communication environment. Lastly, we compare the performance of the proposed algorithm with conventional Q-learning and SARSA Q-learning algorithms. The simulation outcomes show that: 1) Compared with the traditional Q learning algorithms, the proposed solution is able to converge within 200 episodes for providing as low as 10-2 long-term mean error; 2) NOMA assisted URLLC outperforms traditional OMA systems in terms of decoding error probabilities; and 3) The proposed feedback system is efficient for the long-term learning process.-
dc.languageeng-
dc.relation.ispartofIEEE Transactions on Wireless Communications-
dc.subjectDeep SARSA-λ learning-
dc.subjectnon-orthogonal multiple access-
dc.subjectpower allocation-
dc.subjectultra-reliable low-latency communication-
dc.subjectuser clustering-
dc.titleA Reliable Reinforcement Learning for Resource Allocation in Uplink NOMA-URLLC Networks-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TWC.2022.3144618-
dc.identifier.scopuseid_2-s2.0-85124229416-
dc.identifier.volume21-
dc.identifier.issue8-
dc.identifier.spage5989-
dc.identifier.epage6002-
dc.identifier.eissn1558-2248-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats