is actor-critic agent learning?

1 Ansicht (letzte 30 Tage)
karim bio gassi
karim bio gassi am 21 Aug. 2021
Beantwortet: Ahmed R. Sayed am 4 Okt. 2022
I built a actor critic agent for microgrid energy management. it has to decide the discharging/charging energy among a set of action
in total 9 action can be taken for 7008 time steps. I am training the agent over 2000 episodes. But I notice when the agent start learning at a cetain episodes, at the next episodes it fall completely down. I tattached the training for the first 250 episodes.
I wonder if there something wrong in my code.

Antworten (1)

Ahmed R. Sayed
Ahmed R. Sayed am 4 Okt. 2022
From your figure, the discounted reward value is very large. try to rescale it to a certain value [-10, 10] in the environment. For example, r(t) = 10 * Microgrid operational cost (t) / MaxCost , where MaxCost is the maximum possible cost per time step.
Another point is you can use another agent.
I hope these suggestions can solve your concerns.

Kategorien

Mehr zu Agents finden Sie in Help Center und File Exchange

Produkte


Version

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by