Asynchronous Multi-Agent Reinforcement Learning for Scheduling in Subnetworks

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

35 Lataukset (Pure)

Abstrakti

We address radio resource scheduling in a network of
multiple in-X subnetworks providing wireless Ultra-Reliable Low-
Latency Communication (URLLC) service. Each subnetwork is
controlled by an agent responsible for scheduling resources to its
devices. Agents rely solely on interference measurements for information
about other agents, with no explicit coordination. Subnetwork
mobility and fast-fading effects create a non-stationary
environment, adding to the complexity of the scheduling problem.
This scenario is modeled as a multi-agent Markov Decision
Process (MDP). To address the problem, we propose a Multi-
Agent Deep Reinforcement Learning (MADRL) approach under
URLLC constraints, which integrates Long Short-Term Memory
(LSTM) with the Deep Deterministic Policy Gradient (DDPG)
algorithm to manage non-stationarity and high-dimensional action
spaces. We apply an asynchronous update strategy, where
one agent is updating at a time. This reduces learning variability,
resolves policy conflicts, and improves the interpretability of the
MADRL approach. Simulation results demonstrate that the asynchronous
update mechanism outperforms synchronous updates
and baseline methods, achieving superior reliability, resource
utilization, and explainability.
AlkuperäiskieliEnglanti
OtsikkoProceedings of the IEEE 101st Vehicular Technology Conference
KustantajaIEEE
Sivumäärä6
TilaHyväksytty/In press - 2025
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaIEEE Vehicular Technology Conference - Oslo, Norja
Kesto: 17 kesäk. 202520 kesäk. 2025
Konferenssinumero: 101

Conference

ConferenceIEEE Vehicular Technology Conference
LyhennettäVTC
Maa/AlueNorja
KaupunkiOslo
Ajanjakso17/06/202520/06/2025

Sormenjälki

Sukella tutkimusaiheisiin 'Asynchronous Multi-Agent Reinforcement Learning for Scheduling in Subnetworks'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä