How Does MATLAB Internally Format Actions as dlarray in DDPG with Recurrent Networks (LSTM)?
Ältere Kommentare anzeigen
In MATLAB's RL toolbox, when using DDPG with LSTM-based actors/critics, the conversion of actions to dlarray is handled automatically. Since users cannot directly control this process:
Are actions formatted with 'T' (time) or 'C' (channel) dimensions when passed between the actor and critic networks?
How does MATLAB structure actions for compatibility with recurrent layers (e.g., aligning sequences for LSTM time steps)?
Akzeptierte Antwort
Weitere Antworten (0)
Kategorien
Mehr zu Actors, Critics, and Policies finden Sie in Hilfe-Center und File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!