The definition of the Target update frequency in Reinforcement Learning Designer.
4 Ansichten (letzte 30 Tage)
Ältere Kommentare anzeigen
Xian Zheng Hong
am 7 Mär. 2024
Kommentiert: Xian Zheng Hong
am 16 Mär. 2024
In DDPG Agent, there are four networks. Online policy, Target policy, Online Q and Target Q.
The [Target update frequency] is used to the Target policy and Target Q in Reinforcement Learning Designer.
Are the Update frequency of the Online policy and Online Q same as the [Target update frequency] ?
0 Kommentare
Akzeptierte Antwort
UDAYA PEDDIRAJU
am 12 Mär. 2024
Hi Xian,
No, the update frequency of the Online Policy and Online Q networks is not the same as the Target Update Frequency. The Target Update Frequency specifically applies to how often the Target Policy and Target Q networks are updated, which is typically less frequent or managed differently to ensure stability in learning.
Weitere Antworten (0)
Siehe auch
Kategorien
Mehr zu Deep Learning Toolbox finden Sie in Help Center und File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!