A computationally lightweight safe learning algorithm

Dominik Baumann, Krzysztof Kowalczyk, Koen Tiels, Paweł Wachel

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

39 Lataukset (Pure)

Abstrakti

Safety is an essential asset when learning control policies for physical systems, as violating safety constraints during training can lead to expensive hardware damage. In response to this need, the field of safe learning has emerged with algorithms that can provide probabilistic safety guarantees without knowledge of the underlying system dynamics. Those algorithms often rely on Gaussian process inference. Unfortunately, Gaussian process inference scales cubically with the number of data points, limiting applicability to high-dimensional and embedded systems. In this paper, we propose a safe learning algorithm that provides probabilistic safety guarantees but leverages the Nadaraya-Watson estimator instead of Gaussian processes. For the Nadaraya-Watson estimator, we can reach logarithmic scaling with the number of data points. We provide theoretical guarantees for the estimates, embed them into a safe learning algorithm, and show numerical experiments on a simulated seven-degrees-of-freedom robot manipulator.
AlkuperäiskieliEnglanti
Otsikko2023 62nd IEEE Conference on Decision and Control, CDC 2023
KustantajaIEEE
Sivut1022-1027
Sivumäärä6
ISBN (elektroninen)979-8-3503-0124-3
DOI - pysyväislinkit
TilaJulkaistu - 19 tammik. 2024
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaIEEE Conference on Decision and Control - Marina Bay Sands, Singapore, Singapore
Kesto: 13 jouluk. 202315 jouluk. 2023
Konferenssinumero: 62
https://cdc2023.ieeecss.org/

Julkaisusarja

NimiProceedings of the IEEE Conference on Decision & Control
ISSN (elektroninen)2576-2370

Conference

ConferenceIEEE Conference on Decision and Control
LyhennettäCDC
Maa/AlueSingapore
KaupunkiSingapore
Ajanjakso13/12/202315/12/2023
www-osoite

Sormenjälki

Sukella tutkimusaiheisiin 'A computationally lightweight safe learning algorithm'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä