Reinforcement learning agent stops training unexpectedly?

lab

15 Aug. 2022

1 Antwort

Aktualisiert 26 Jan. 2023

4 Ansichten (30 Tage)

Melden Sie sich an, um diese Frage zu beantworten.

Follow Question

Melden Sie sich an, um diese Frage zu beantworten.

Follow Question

Ältere Kommentare anzeigen

0 Stimmen

when I use DDPG agent to train a model, the agent stops training at 199th episode. However, the maximum episode number I set is 500. In addition, the average reward 143.9382 is far lower than the termination condition value 500. The final result shows "traning finished after all agents reached stop training criteria". I am confused what does it mean?

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Follow Question

Antworten (1)

Emmanouil Tzorakoleftherakis am 26 Jan. 2023

0 Stimmen

The picture also mentioned why training stopped in the last two rows. In this case, it seems you have a criterion that stops training when the average steps in an episode reach 500.

The max episode number kicks in if no other criterion is satisfied by the time training reaches the specified episode number.