Noise parameters in Reinforcement learning DDPG

Question

0 Stimmen

What should be the values of Noise parameters (for agent) if my action range is between -0.5 to -5 in DDPG reinforcement learning I want to explore whole action range for each sample time? Also is there anyway to make the noise options (for agent) independent of sample time?

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Follow Question

Answer 1

Drew Davis am 19 Jun. 2019

Bearbeitet: Drew Davis am 19 Jun. 2019

In MATLAB Online öffnen

3 Stimmen

Hi Surya

It is fairly common to have Variance*sqrt(SampleTime) somewhere between 1 and 10% of your action range for Ornstein Uhlenbeck (OU) action noise. So in your case, the variance can be set between 4.5*0.01/sqrt(SampleTime) and 4.5*0.10/sqrt(SampleTime). The other important factor is the VarianceDecayRate, which will dictate how fast the variance will decay. You can calculate how many samples it will take for your variance to be halved by this simple formula:

halflife = log(0.5)/log(1-VarianceDecayRate)

It is critically important for your agent to explore while learning so keeping the VarianceDecayRate small (or even zero) is a good idea. The other noise parameters can usually be left as default.

You can check out this pendulum example which does a pretty good job of exploring during training.

The sample time of the noise options will be inherited by the agent, so it is not necessary to configure. By default, the noise model will be queried at the same rate as the agent.

Hope this helps

Drew

5 Kommentare
3 ältere Kommentare anzeigen 3 ältere Kommentare ausblenden

Drew Davis am 9 Dez. 2019

You can derive this formula pretty easily:

decayfactor = 0.5 = (1 - decayrate)^(#steps)

Rajesh Siraskar am 10 Dez. 2019

Thank you Drew

Melden Sie sich an, um zu kommentieren.

Answer 2

Atikah Surriani am 30 Apr. 2023

0 Stimmen

can i change noise model of ddpg using matlab? for example, the original ddpg using OU noise, while my study tends to change it using gaussian?

3 Kommentare
1 älteren Kommentar anzeigen 1 älteren Kommentar ausblenden

Atikah Surriani am 8 Mai 2023

thank you for the answer, so we can change the noise option on DDPG using matlab?

for example:

rl.option.OrnsteinUhlenbeckActionNoise

we change as " rl.option.gaussianActionNoise or rl.option.anythingActionNoise "

or else

thankyou

Atikah Surriani am 8 Mai 2023

or do any modification to the noise?

Melden Sie sich an, um zu kommentieren.

Noise parameters in Reinforcement learning DDPG

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Akzeptierte Antwort

5 Kommentare
3 ältere Kommentare anzeigen 3 ältere Kommentare ausblenden

Weitere Antworten (1)

3 Kommentare
1 älteren Kommentar anzeigen 1 älteren Kommentar ausblenden

Kategorien

Produkte

Version

Tags

Community Treasure Hunt

Noise parameters in Reinforcement learning DDPG

0 Kommentare -2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Akzeptierte Antwort

5 Kommentare 3 ältere Kommentare anzeigen 3 ältere Kommentare ausblenden

Weitere Antworten (1)

3 Kommentare 1 älteren Kommentar anzeigen 1 älteren Kommentar ausblenden

Kategorien

Produkte

Version

Tags

Siehe auch

Community Treasure Hunt

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

5 Kommentare
3 ältere Kommentare anzeigen 3 ältere Kommentare ausblenden

3 Kommentare
1 älteren Kommentar anzeigen 1 älteren Kommentar ausblenden