File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Graph-Embedded Multi-Agent Learning for Smart Reconfigurable THz MIMO-NOMA Networks

TitleGraph-Embedded Multi-Agent Learning for Smart Reconfigurable THz MIMO-NOMA Networks
Authors
KeywordsDistributed optimization
MADRL
MIMO-NOMA
Reconfigurable intelligent surface
THz
Issue Date2022
Citation
IEEE Journal on Selected Areas in Communications, 2022, v. 40, n. 1, p. 259-275 How to Cite?
AbstractWith the accelerated development of immersive applications and the explosive increment of internet-of-things (IoT) terminals, 6G would introduce terahertz (THz) massive multiple-input multiple-output non-orthogonal multiple access (MIMO-NOMA) technologies to meet the ultra-high-speed data rate and massive connectivity requirements. Nevertheless, the unreliability of THz transmissions and the extreme heterogeneity of device requirements pose critical challenges for practical applications. To address these challenges, we propose a novel smart reconfigurable THz MIMO-NOMA framework, which can realize customizable and intelligent communications by flexibly and coordinately reconfiguring hybrid beams through the cooperation between access points (APs) and reconfigurable intelligent surfaces (RISs). The optimization problem is formulated as a decentralized partially-observable Markov decision process (Dec-POMDP) to maximize the network energy efficiency, while guaranteeing the diversified users' performance, via a joint RIS element selection, coordinated discrete phase-shift control, and power allocation strategy. To solve the above non-convex, strongly coupled, and highly complex mixed integer nonlinear programming (MINLP) problem, we propose a novel multi-agent deep reinforcement learning (MADRL) algorithm, namely graph-embedded value-decomposition actor-critic (GE-VDAC), that embeds the interaction information of agents, and learns a locally optimal solution through a distributed policy. Numerical results demonstrate that the proposed algorithm achieves highly customized communications and outperforms traditional MADRL algorithms.
Persistent Identifierhttp://hdl.handle.net/10722/349655
ISSN
2023 Impact Factor: 13.8
2023 SCImago Journal Rankings: 8.707

 

DC FieldValueLanguage
dc.contributor.authorXu, Xiaoxia-
dc.contributor.authorChen, Qimei-
dc.contributor.authorMu, Xidong-
dc.contributor.authorLiu, Yuanwei-
dc.contributor.authorJiang, Hao-
dc.date.accessioned2024-10-17T06:59:59Z-
dc.date.available2024-10-17T06:59:59Z-
dc.date.issued2022-
dc.identifier.citationIEEE Journal on Selected Areas in Communications, 2022, v. 40, n. 1, p. 259-275-
dc.identifier.issn0733-8716-
dc.identifier.urihttp://hdl.handle.net/10722/349655-
dc.description.abstractWith the accelerated development of immersive applications and the explosive increment of internet-of-things (IoT) terminals, 6G would introduce terahertz (THz) massive multiple-input multiple-output non-orthogonal multiple access (MIMO-NOMA) technologies to meet the ultra-high-speed data rate and massive connectivity requirements. Nevertheless, the unreliability of THz transmissions and the extreme heterogeneity of device requirements pose critical challenges for practical applications. To address these challenges, we propose a novel smart reconfigurable THz MIMO-NOMA framework, which can realize customizable and intelligent communications by flexibly and coordinately reconfiguring hybrid beams through the cooperation between access points (APs) and reconfigurable intelligent surfaces (RISs). The optimization problem is formulated as a decentralized partially-observable Markov decision process (Dec-POMDP) to maximize the network energy efficiency, while guaranteeing the diversified users' performance, via a joint RIS element selection, coordinated discrete phase-shift control, and power allocation strategy. To solve the above non-convex, strongly coupled, and highly complex mixed integer nonlinear programming (MINLP) problem, we propose a novel multi-agent deep reinforcement learning (MADRL) algorithm, namely graph-embedded value-decomposition actor-critic (GE-VDAC), that embeds the interaction information of agents, and learns a locally optimal solution through a distributed policy. Numerical results demonstrate that the proposed algorithm achieves highly customized communications and outperforms traditional MADRL algorithms.-
dc.languageeng-
dc.relation.ispartofIEEE Journal on Selected Areas in Communications-
dc.subjectDistributed optimization-
dc.subjectMADRL-
dc.subjectMIMO-NOMA-
dc.subjectReconfigurable intelligent surface-
dc.subjectTHz-
dc.titleGraph-Embedded Multi-Agent Learning for Smart Reconfigurable THz MIMO-NOMA Networks-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/JSAC.2021.3126079-
dc.identifier.scopuseid_2-s2.0-85121876822-
dc.identifier.volume40-
dc.identifier.issue1-
dc.identifier.spage259-
dc.identifier.epage275-
dc.identifier.eissn1558-0008-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats