Bandit-Based Power Control in Full-Duplex Cooperative Relay Networks with Strict-Sense Stationary and Non-Stationary Wireless Communication Channels

Nikolaos Nomikos, Mohammad Sadegh Talebi, Themistoklis Charalambous, Risto Wichman

Research output: Contribution to journalArticleScientificpeer-review

3 Citations (Scopus)
29 Downloads (Pure)


Full-duplex relaying is an enabling technique of sixth generation (6G) mobile networks, promising tremendous rate and spectral efficiency gains. In order to improve the performance of full-duplex communications, power control is a viable way of avoiding excessive loop interference at the relay. Unfortunately, power control requires channel state information of source-relay, relay-destination and loop interference channels, thus resulting in increased overheads. Aiming to offer a low-complexity alternative for power control in such networks, we adopt reward-based learning in the sense of multi-armed bandits. More specifically, we present bandit-based power control, relying on acknowledgements/negative-acknowledgements observations by the relay. Our distributed algorithms avoid channel state information acquisition and exchange, and can alleviate the impact of outdated channel state information. Two cases are examined regarding the channel statistics of the wireless network, namely, strict-sense stationary and non-stationary channels. For the latter, a sliding window approach is adopted to further improve the performance. Performance evaluation highlights a performance-complexity trade-off, compared to optimal power control with full channel knowledge and significant gains over cases considering channel estimation and feedback overheads, outdated channel knowledge, no power control and random power level selection. Finally, it is shown that the sliding-window bandit-based algorithm provides improved performance in non-stationary settings by efficiently adapting to abrupt changes of the wireless channels.

Original languageEnglish
Pages (from-to)366-378
Number of pages13
JournalIEEE Open Journal of the Communications Society
Publication statusPublished - 28 Feb 2022
MoE publication typeA1 Journal article-refereed


  • Channel estimation
  • Encoding
  • Full-duplex relaying
  • multi-armed bandits
  • non-stationary wireless channels
  • outdated CSI
  • power control
  • Power control
  • reinforcement learning
  • Relay networks (telecommunication)
  • Resource management
  • sliding-window
  • upper confidence bound policies.
  • Wireless communication
  • Wireless sensor networks


Dive into the research topics of 'Bandit-Based Power Control in Full-Duplex Cooperative Relay Networks with Strict-Sense Stationary and Non-Stationary Wireless Communication Channels'. Together they form a unique fingerprint.

Cite this