Reinforcement learning of adaptive online rescheduling timing and computing time allocation

Teemu J. Ikonen, Keijo Heljanko, Iiro Harjunkoski*

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

11 Citations (Scopus)
12 Downloads (Pure)

Abstract

Mathematical optimization methods have been developed to a vast variety of complex problems in the field of process systems engineering (e.g., the scheduling of chemical batch processes). However, the use of these methods in online scheduling is hindered by the stochastic nature of the processes and prohibitively long solution times when optimized over long time horizons. The following questions are raised: When to trigger a rescheduling, how much computing resources to allocate, what optimization strategy to use, and how far ahead to schedule? We propose an approach where a reinforcement learning agent is trained to make the first two decisions (i.e., rescheduling timing and computing time allocation). Using neuroevolution of augmenting topologies (NEAT) as the reinforcement learning algorithm, the approach yields, on average, better closed-loop solutions than conventional rescheduling methods on three out of four studied routing problems. We also reflect on expanding the agent's decision-making to all four decisions. (C) 2020 Elsevier Ltd. All rights reserved.

Original languageEnglish
Article number106994
Number of pages17
JournalComputers and Chemical Engineering
Volume141
DOIs
Publication statusPublished - 4 Oct 2020
MoE publication typeA1 Journal article-refereed

Keywords

  • Computing resource allocation
  • Decision-making
  • Online scheduling
  • Reinforcement learning
  • Rescheduling procedures
  • Timing

Fingerprint

Dive into the research topics of 'Reinforcement learning of adaptive online rescheduling timing and computing time allocation'. Together they form a unique fingerprint.

Cite this