Applying reinforcement learning with two continuous actions. During training one varies but the other is virtually static.

3 Ansichten (letzte 30 Tage)

Bay Jay am 4 Jan. 2023

0
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/1888637-applying-reinforcement-learning-with-two-continuous-actions-during-training-one-varies-but-the-othe

Kommentiert: Emmanouil Tzorakoleftherakis am 15 Feb. 2023

Hello,

I am trying to train the DDPG agent to control the vehicle's (model:Kinetmatic) steering angle and velocity. The purpose is to train the agent so the vehicle can move from an initial x,y, theta position to final x,y,theta position. One agent is to perform both actions.

The ranges are [-0.78,+0.78] and [-2.5 and 2.5]. In the actor network, a tanh is used and scaling [0.78; 2.5]. During the training, I realised the steering angle is not changing=>stuck at 0.78, but the velocity varies and this affects the training. What could be the reason for this? Is a single agent okay to perform the task? I am still learning RL. Any suggestion would be helpful.

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Antworten (1)

Emmanouil Tzorakoleftherakis am 24 Jan. 2023

0
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/1888637-applying-reinforcement-learning-with-two-continuous-actions-during-training-one-varies-but-the-othe#answer_1155585

You should be able to use a single agent for this task. Since you are using DDPG, the first thing I would check is whether the noise options are set properly for both inputs.

5 Kommentare
3 ältere Kommentare anzeigen3 ältere Kommentare ausblenden

Bay Jay am 15 Feb. 2023

Yes, it is the IsDone signal. I have tried to open the link you shared but its not working. Could you share again

Emmanouil Tzorakoleftherakis am 15 Feb. 2023

Edited the link

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Kategorien

AI and Statistics Deep Learning Toolbox Applications Autonomous and Control Systems Reinforcement Learning

Mehr zu Reinforcement Learning finden Sie in Help Center und File Exchange

Produkte

Version

R2022b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Applying reinforcement learning with two continuous actions. During training one varies but the other is virtually static.

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Antworten (1)

5 Kommentare
3 ältere Kommentare anzeigen3 ältere Kommentare ausblenden

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

Applying reinforcement learning with two continuous actions. During training one varies but the other is virtually static.

0 Kommentare -2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Antworten (1)

5 Kommentare 3 ältere Kommentare anzeigen3 ältere Kommentare ausblenden

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

5 Kommentare
3 ältere Kommentare anzeigen3 ältere Kommentare ausblenden