DDPG Agent OU noise options to favour exploration

3 Ansichten (letzte 30 Tage)

Abd Al-Rahman Al-Remal am 22 Jul. 2021

0
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/883684-ddpg-agent-ou-noise-options-to-favour-exploration

Bearbeitet: Abd Al-Rahman Al-Remal am 22 Jul. 2021

Hi there,

I have seem similar posts here however I haven't found one that explains how to actually tune the OU noise parameters to favour exploration - currently my agent is stuck on the same reward value from the beginning and does not change/train/learn.

Can anyone advise on how to tune the OU noise parameters within the code to favour exploration? Currently mine are:

agentOpts.NoiseOptions.StandardDeviation = 0.3;

agentOpts.NoiseOptions.StandardDeviationDecayRate = 1e-5;

agentOpts.NoiseOptions.MeanAttractionConstant = 2e-3;

This worked for a previous similar model I made however I understand that the parameter smust be modified per model however I don't know how and literature all looks very dense and doesn't give a clear answer.

Thanks in advance!

Abd

Mehr zu Deep Learning Toolbox finden Sie in Help Center und File Exchange

Produkte

Version

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

DDPG Agent OU noise options to favour exploration

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Antworten (0)

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

DDPG Agent OU noise options to favour exploration

0 Kommentare -2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Antworten (0)

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden