How GAE calculates in Reinforement Learning Toolbox(PPO)?

6 Ansichten (letzte 30 Tage)
A difference between help center and reference[3] about TD error.
Why in Generalized Advantage Estimator?
https://ww2.mathworks.cn/help/reinforcement-learning/ug/ppo-agents.html

Akzeptierte Antwort

Emmanouil Tzorakoleftherakis
Hello,
Thank you for catching this typo - it should be Gt = Dt+V. I have let the documentation team know.

Weitere Antworten (0)

Kategorien

Mehr zu Specialized Power Systems finden Sie in Help Center und File Exchange

Tags

Produkte


Version

R2020a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by