Training a DDPG, and observation values are zero. How do I initialize the first episode to have initial values to the action?
2 Ansichten (letzte 30 Tage)
Ältere Kommentare anzeigen
Hello,
I am training a DDPG agent with four actions. My observations are zero for more than 1000 episodes. I suspect because the action values have been zero, that is affecting the observations. How do I set the action values for the first episode to some values at start.
Actions are torque input with min and max (200) and later multiplied with gain 100. Is there something, I need to do to properly to get the observations to not stay as zeros.
4 Kommentare
Emmanouil Tzorakoleftherakis
am 5 Jul. 2023
Not very clear why you would want the agent to learn when the termination time of the episode? After training you can always choose to 'unplug' the agent as you see fit.
Antworten (0)
Siehe auch
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!