reward error during training

1 Ansicht (letzte 30 Tage)
기범
기범 am 11 Jan. 2023
Beantwortet: Harsh am 28 Feb. 2025
Hello,
Im using reinforcemet designer to train my model,
and here is my problem.
Q. I dont not why my reward cannot go up to 0.1, why is this happen?? How can I fix this??

Antworten (1)

Harsh
Harsh am 28 Feb. 2025
In Reinforcement Learning (RL), the reward is a signal that guides the agent’s learning by providing feedback on its actions. It changes dynamically based on the agent’s behavior and the environment’s response. A well-designed reward function encourages desired actions and discourages unwanted ones, leading to improved performance over time.
You can try the following to improve your performance:
  • Ensure that cos(psi(t)) - cos(psi(t-1)) > 0 is met frequently by checking if psi(t) increases over time.
  • Verify that the initial conditions of the delay block are properly set to prevent incorrect first-step evaluations.
  • Modify the reward function threshold to make the positive reward condition more lenient.
For more information on how to craft a reward function please refer to the following MATLAB tech talks webinar - https://www.mathworks.com/videos/reinforcement-learning-part-2-understanding-the-environment-and-rewards-1551976590603.html

Kategorien

Mehr zu Environments finden Sie in Help Center und File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by