Siirry päänavigointiin Siirry hakuun Siirry pääsisältöön

Exploring Safe Reinforcement Learning Using Safety Shields Derived With System-Theoretic Process Analysis: A Case-Study on a Cruise Ship Hotel System

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

4 Lataukset (Pure)

Abstrakti

The cruise ship industry is under increasing pressure to reduce greenhouse gas emissions, as international regulations define ambitious requirements and goals for modern cruise ships. One of the most significant consumers of energy onboard cruise ships are their Heating Ventilation and Air Conditioning (HVAC) systems. However, the energy optimization of HVAC systems is challenging, as they are impacted by a number of uncontrolled variables, such as changing weather conditions, passenger behavior, and the demands of other significant energy consumers, such as propulsion systems. Reinforcement Learning (RL) is often used to tackle such complex optimization tasks, however concerns over ensuring the safety of RL optimized systems hinders its adoption in industry, especially in the context of safety-critical systems. This paper presents the initial findings of applying a novel approach to ensure safety in RL: a safety shield developed utilizing a novel hazard analysis method, System-Theoretic Process Analysis. In this work the safety shield is used to both train the RL agent as well as block unsafe behavior in operation. Preliminary findings suggest that blocking unsafe behavior during training hinders the ability to learn a safe RL policy, however, when used in testing the approach is capable of significantly reducing the number of safety violations.
AlkuperäiskieliEnglanti
Otsikko2025 IEEE 30th International Conference on Emerging Technologies and Factory Automation (ETFA)
ToimittajatLuis Almeida, Marina Indria, Mario de Sousa, Antonio Visioli, Mohammad Ashjaei, Pedro Santos
KustantajaIEEE
Sivumäärä4
ISBN (elektroninen)979-8-3315-5383-8
DOI - pysyväislinkit
TilaJulkaistu - 21 lokak. 2025
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaIEEE International Conference on Emerging Technologies and Factory Automation - Porto, Portugali
Kesto: 9 syysk. 202512 syysk. 2025
Konferenssinumero: 30

Julkaisusarja

NimiIEEE International Conference on Emerging Technologies and Factory Automation
ISSN (elektroninen)1946-0759

Conference

ConferenceIEEE International Conference on Emerging Technologies and Factory Automation
LyhennettäETFA
Maa/AluePortugali
KaupunkiPorto
Ajanjakso09/09/202512/09/2025

Rahoitus

The work presented in this paper is done within the project SEASHINE. The SEASHINE - Safe intelligent agent to optimize ship energy management has received funding from the European Union, via the oc1-2024-TIS-01 issued and implemented by the ENFIELD project, under the grant agreement No 101120657.

YK:n kestävän kehityksen tavoitteet

Tämä tuotos edistää seuraavia kestävän kehityksen tavoitteita:

  1. SDG 9 – Teollisuus, innovaatiot ja infrastruktuuri
    SDG 9 – Teollisuus, innovaatiot ja infrastruktuuri
  2. SDG 13 – Ilmastotoimet
    SDG 13 – Ilmastotoimet

Sormenjälki

Sukella tutkimusaiheisiin 'Exploring Safe Reinforcement Learning Using Safety Shields Derived With System-Theoretic Process Analysis: A Case-Study on a Cruise Ship Hotel System'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä