How to use the reinforcement learning toolbox in Matlab to implement delayed reward
Ältere Kommentare anzeigen
I want to implement delayed reward with matlab code. For example, I need to wait until the end of my current episode before giving the reward for each action in this episode. How can I achieve this?
Akzeptierte Antwort
Weitere Antworten (1)
MOHAMMADREZA
am 5 Mär. 2025
0 Stimmen
Hi, I am having the same problem. Hwever, I am using the Matlab heper (class) for environment. I do not know how to handle reward so that at the end of episode the reward is used for updating the parameters. More specifically, when using class template, I have step, reset,... functions. when the parameters is updated? is it after running step function? I wrote the reward in the step function. but I need to update the parameters only at the end of episode.
Kategorien
Mehr zu Reinforcement Learning finden Sie in Hilfe-Center und File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!