Deep reinforcement learning for multi-agents

Question

beni hadi am 20 Nov. 2020

1
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/654548-deep-reinforcement-learning-for-multi-agents

Kommentiert: beni hadi am 25 Nov. 2020

Akzeptierte Antwort: Emmanouil Tzorakoleftherakis

By the multi-agent deep reinforcement learning toolbox, three agents are trained. The reward changes are as shown in the picture. Why do agents' rewards decrease and converge to an unfavorable situation after the reward increases and they move towards desired performance? I expected the process of increasing the rewards and achieving the desired goal to continue as the episode progresses. According to the picture, from episode 700, agents converge to undesired situations, and they didn't change their states.

Thank you.

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Answer 1

Emmanouil Tzorakoleftherakis am 22 Nov. 2020

1
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/654548-deep-reinforcement-learning-for-multi-agents#answer_552608

Bearbeitet: Emmanouil Tzorakoleftherakis am 22 Nov. 2020

Hello,

The policies you will get from RL training change depending on the amount of time the agents spend exploring. Usually, if you see a situation like this where agents converge to a non-ideal solution, you may want to change the agent options to increase exploration.

Hope that helps

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

beni hadi am 25 Nov. 2020

Thank you for your help.

Melden Sie sich an, um zu kommentieren.

Deep reinforcement learning for multi-agents

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Akzeptierte Antwort

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Weitere Antworten (0)

Siehe auch

Kategorien

Tags

Community Treasure Hunt

Deep reinforcement learning for multi-agents

0 Kommentare -2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Akzeptierte Antwort

1 Kommentar -1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden

Weitere Antworten (0)

Siehe auch

Kategorien

Tags

Community Treasure Hunt

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

1 Kommentar
-1 ältere Kommentare anzeigen-1 ältere Kommentare ausblenden