Projects per year
Abstract
Many sequential decision-making problems need optimization of different objectives which possibly conflict with each other. The conventional way to deal with a multitask problem is to establish a scalar objective function based on a linear combination of different objectives. However, for the case where we have conflicting objectives with different scales, this method needs a trial-and-error approach to properly find proper weights for the combination. As such, in most cases, this approach cannot guarantee an optimal Pareto solution. In this paper, we develop a single-agent scale-independent multi-objective reinforcement learning on the basis of the Advantage Actor-Critic (A2C) algorithm. A convergence analysis is then done for the devised multi-objective algorithm providing a convergence-in-mean guarantee. We then perform some experiments over a multitask problem to evaluate the performance of the proposed algorithm. Simulation results show the superiority of developed multi-objective A2C approach against the single-objective algorithm.
Original language | English |
---|---|
Title of host publication | 2023 62nd IEEE Conference on Decision and Control (CDC) |
Publisher | IEEE |
Pages | 1326-1333 |
Number of pages | 8 |
ISBN (Print) | 979-8-3503-0125-0 |
DOIs | |
Publication status | Published - 15 Dec 2023 |
MoE publication type | A4 Conference publication |
Event | IEEE Conference on Decision and Control - Marina Bay Sands, Singapore, Singapore Duration: 13 Dec 2023 → 15 Dec 2023 Conference number: 62 https://cdc2023.ieeecss.org/ |
Publication series
Name | Proceedings of the IEEE Conference on Decision & Control |
---|---|
ISSN (Electronic) | 2576-2370 |
Conference
Conference | IEEE Conference on Decision and Control |
---|---|
Abbreviated title | CDC |
Country/Territory | Singapore |
City | Singapore |
Period | 13/12/2023 → 15/12/2023 |
Internet address |
Keywords
- Measurement
- Simulation
- Decision making
- Reinforcement learning
- Quality of service
- Linear programming
- Optimization
Fingerprint
Dive into the research topics of 'A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis'. Together they form a unique fingerprint.Projects
- 1 Active
-
CRUISE /Di Francesco: A Cross-system Architecture Design for Autonomous Wireless Networks based on Lifelong Machine Learning
Di Francesco, M. (Principal investigator)
01/01/2023 → 31/12/2025
Project: RCF Other