Data-Driven Model Predictive Control Using Deep Double Expected Sarsa

Hoomaan MoradiMaryamnegari; Marco Frego; Angelika Peer

doi:10.1109/CoDIT58514.2023.10284335

Back

Data-Driven Model Predictive Control Using Deep Double Expected Sarsa

Conference proceeding

Peer reviewed

Data-Driven Model Predictive Control Using Deep Double Expected Sarsa

Hoomaan MoradiMaryamnegari, Marco Frego and Angelika Peer

2023 9th International Conference on Control, Decision and Information Technologies (CoDIT), pp.345-350

9th International Conference on Control, Decision and Information Technologies (Rome, 03/07/2023 - 06/07/2023)

2023

DOI: https://doi.org/10.1109/CoDIT58514.2023.10284335

Handle:

https://hdl.handle.net/10863/37859

Abstract

Training

Computational modeling

Neural networks

Reinforcement learning

Predictive models

Cost function

Approximation algorithms

In this paper, a data-driven Model Predictive Controller (MPC) is presented, in which an off-policy Reinforcement Learning (RL) method called Deep Double Expected Sarsa is employed to update the weights of its cost function. While the parameterized MPC cost function is used as the current action-value function estimator, a Neural Network is used as the subsequent action-value function approximator. The target Neural Network is trained based on inputs and outputs of the primary MPC obtained at previous sampling times, whereby the training is performed either within each sampling time by sharing the time slots with the main algorithm or in parallel to the main algorithm as a whole. The latter reduces the required real-time computations per time slot. To compute the action of the target policy, two strategies are employed: Once a greedy policy using a minimization of the Neural Network model with respect to the action, and once the second element of the MPC vector related to the previous sampling time. Results show that there is no significant difference between the final control performance and training speed of both methods, whereas the real-time computational cost can be significantly reduced for the latter approach since the optimization related to the Neural Network can be omitted.

Details

Title: Data-Driven Model Predictive Control Using Deep Double Expected Sarsa
Creators: Hoomaan MoradiMaryamnegari - Free University of Bozen-Bolzano
Marco Frego - Free University of Bozen-Bolzano
Angelika Peer - Free University of Bozen-Bolzano
Publication Details: 2023 9th International Conference on Control, Decision and Information Technologies (CoDIT), pp.345-350
ISBN: 9798350311402
Conference: 9th International Conference on Control, Decision and Information Technologies (Rome, 03/07/2023 - 06/07/2023)
Publisher: IEEE
Number of pages: 6
Identifiers: 979-8-3503-1140-2
(UNIBZ)69178858
991006685298701241
Scopus ID: 2-s2.0-85177423229
Academic Unit: Faculty of Engineering
Language: English
Resource Type: Conference proceeding
Author Names String: MoradiMaryamnegari H, Frego M, Peer A

Metrics

3 Record Views

Data-Driven Model Predictive Control Using Deep Double Expected Sarsa

Abstract

Related links

Details

Metrics