How GAE calculates in Reinforement Learning Toolbox(PPO)?
6 Ansichten (letzte 30 Tage)
Ältere Kommentare anzeigen
TigerSee
am 14 Feb. 2021
Beantwortet: Emmanouil Tzorakoleftherakis
am 16 Feb. 2021
A difference between help center and reference[3] about TD error.
Why
in Generalized Advantage Estimator?
in Generalized Advantage Estimator?https://ww2.mathworks.cn/help/reinforcement-learning/ug/ppo-agents.html

0 Kommentare
Akzeptierte Antwort
Emmanouil Tzorakoleftherakis
am 16 Feb. 2021
Hello,
Thank you for catching this typo - it should be Gt = Dt+V. I have let the documentation team know.
0 Kommentare
Weitere Antworten (0)
Siehe auch
Kategorien
Mehr zu Specialized Power Systems finden Sie in Help Center und File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!