My controlled object is an object with three inputs and three outputs, I now use three parallel reinforcement learning agents to control these three channels, such a scheme is

Question

嘻嘻 am 2 Nov. 2023

0
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/2041721-my-controlled-object-is-an-object-with-three-inputs-and-three-outputs-i-now-use-three-parallel-rein

Kommentiert: Sam Chak am 21 Dez. 2023

My controlled object is an object with three inputs and three outputs, I now use three parallel reinforcement learning agents to control these three channels, such a scheme is feasible? I would be very grateful for your answer.

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Answer 1

Venu am 12 Dez. 2023

0
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/2041721-my-controlled-object-is-an-object-with-three-inputs-and-three-outputs-i-now-use-three-parallel-rein#answer_1369989

Hi @嘻嘻,

You can perform parallel computing for multiple agents in the context of using three parallel reinforcement learning agents to control these three channels.

You can refer to the below documentation and also examples provided in this documentation.

https://www.mathworks.com/help/reinforcement-learning/ug/train-agents-using-parallel-computing-and-gpu.html#d126e14864

Here's how you can achieve parallel training of multiple agents based on the above documentation:

1. Create a Parallel Pool: To start, you can create a parallel pool of workers using the parpool function. This allows you to specify the number of workers (N) for your parallel pool.

pool = parpool(N);

2. Configure Parallel Training: When training your agents using multiple processes, you need to pass an "rlTrainingOptions" object to the train function. In this object, set the "UseParallel" property to "true" to enable parallel computing for training.

trainingOptions = rlTrainingOptions('UseParallel', true);

3. Experience-Based Parallelization: In this mode, workers simulate the agent within their copy of the environment and send experience data back to the client. The client then computes the gradients from experiences, updates the agent parameters, and sends the updated parameters back to the workers.

4. Asynchronous and Synchronous Training: Asynchronous training allows the client agent to calculate gradients and update agent parameters without waiting for experiences from all the workers. Synchronous training, on the other hand, waits to receive experiences from all workers and then calculates gradients from all these experiences.

trainOpts.ParallelizationOptions.Mode = "async";

Hope this helps!

3 Kommentare
1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden

Venu am 21 Dez. 2023

hi @嘻嘻, when using three parallel agents to train, conflicts may arise due to the interdependencies between your inputs and outputs of the system. So it depends on combination of your system characteristics, training approach, reward design, and your control mechanism.

Sam Chak am 21 Dez. 2023

@嘻嘻, is your coupled system a control-affine system something like the following?

,

are inputs and

,

are outputs.

Melden Sie sich an, um zu kommentieren.

My controlled object is an object with three inputs and three outputs, I now use three parallel reinforcement learning agents to control these three channels, such a scheme is

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Antworten (1)

3 Kommentare
1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

My controlled object is an object with three inputs and three outputs, I now use three parallel reinforcement learning agents to control these three channels, such a scheme is

0 Kommentare -2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Antworten (1)

3 Kommentare 1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

3 Kommentare
1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden