Transient value problem of the variable in reward function of reinforcement learning

1 Ansicht (letzte 30 Tage)

Ältere Kommentare anzeigen

Yihao Wan am 22 Mär. 2021

1
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/779882-transient-value-problem-of-the-variable-in-reward-function-of-reinforcement-learning

Kommentiert: Yihao Wan am 23 Mär. 2021

Akzeptierte Antwort: Emmanouil Tzorakoleftherakis

Hello, I encounted a problem when designing the reward function. In the simulink environment, I want to incorporate some variables in the reward function. During the training of RL agent, the varibles will converge after about 0.06s, while the agent is trained from 0s. The enable block doesn't help by putting the RL block in a subsystem.

From my understanding, it will influence the value reward function, which may result in poor trained agent. Does anyone have any suggestions regarding this questions?

Thank you very much.

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Akzeptierte Antwort

Emmanouil Tzorakoleftherakis am 22 Mär. 2021

0
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/779882-transient-value-problem-of-the-variable-in-reward-function-of-reinforcement-learning#answer_654817

You can put the agent block under a triggered subsystem and set it to begin training after 0.06 seconds

5 Kommentare
3 ältere Kommentare anzeigen3 ältere Kommentare ausblenden

Emmanouil Tzorakoleftherakis am 23 Mär. 2021

I believe it should be 40 yes - there is a counter implemented internally that keeps track of how many times the RL Agent block will run

Yihao Wan am 23 Mär. 2021

Thank you very much for your help.

Melden Sie sich an, um zu kommentieren.

Weitere Antworten (0)

Melden Sie sich an, um diese Frage zu beantworten.

Kategorien

Control Systems Reinforcement Learning Toolbox Environments

Mehr zu Environments finden Sie in Help Center und File Exchange

Tags

Produkte

Simulink

Version

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Translated by