DQN learns at first but then worsens.

Question

Khandakar Rashid am 20 Apr. 2021

0
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/807947-dqn-learns-at-first-but-then-worsens

Kommentiert: Emmanouil Tzorakoleftherakis am 23 Apr. 2021

Hi, I am training a DQN agent with a simevent model. I am testing out different hyperparameters, but everytime the agent learns (reward goes higher) at first for a while, but then goes down. I have tested different learning rate, exploration epsilon, and discount factors. But the shape of training progress is pretty much same in all combinations. Is there any potential way I can fix this issue?

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Answer 1

Emmanouil Tzorakoleftherakis am 22 Apr. 2021

0
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/807947-dqn-learns-at-first-but-then-worsens#answer_682275

Bearbeitet: Emmanouil Tzorakoleftherakis am 22 Apr. 2021

To confirm that this is an exploration issue, can you try setting the EpsilonMin param to a high value? e.g. 0.99. If after doing that you still see the same result, there is likely something else going on.

2 Kommentare
Keine anzeigenKeine ausblenden

Khandakar Rashid am 23 Apr. 2021

Thank you Emmanouil for the suggestion. I have tried Epsilon = 1, EpsilonMin=0.99. Unfortunately, no luck :(

Do you have any other tips?

Emmanouil Tzorakoleftherakis am 23 Apr. 2021

Hard to tell, but it's strange to me that the episode curve is similar every time. That makes me think that there is something specific about the way you have modeled your environment model that guides the training through a similar path each time.

Melden Sie sich an, um zu kommentieren.

DQN learns at first but then worsens.

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Antworten (1)

2 Kommentare
Keine anzeigenKeine ausblenden

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

DQN learns at first but then worsens.

0 Kommentare -2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Antworten (1)

2 Kommentare Keine anzeigenKeine ausblenden

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

2 Kommentare
Keine anzeigenKeine ausblenden