Tune PI Controller Using Reinforcement Learning

Question

0 Stimmen

How is the initial value of the weight of this neural network determined? If I want to change my PI controller to a PID controller, do I just add another weight to this row that is initialGain = single([1e-3 2])?

This code is from the demo "Tune PI Controller Using Reinforcement Learning."

initialGain = single([1e-3 2]);

actorNet = [

featureInputLayer(numObs)

fullyConnectedPILayer(initialGain,'ActOutLyr')

];

actorNet = dlnetwork(actorNet);

actor = rlContinuousDeterministicActor(actorNet,obsInfo,actInfo);

Can my network be changed to look like the following：

actorNet= [

featureInputLayer(numObs)

fullyConnectedPILayer(randi([-60,60],1,3), 'Action')]

3 Kommentare
1 älteren Kommentar anzeigen 1 älteren Kommentar ausblenden

嘻嘻 am 18 Okt. 2023

I want the weights of the network to represent the controller parameters, the input of the network to represent the error and the error integral and its first derivative, and the final output of the network to be the control instructions

嘻嘻 am 18 Okt. 2023

I'm not really sure. What do you think of this scheme?

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Follow Question

Answer 1

Emmanouil Tzorakoleftherakis am 23 Okt. 2023

0 Stimmen

I also replied to the other thread. The fullyConnectedPILayer is a custom layer provided in the example - you can open it and see how it's implemented. So you can certainly add a third weight for the D term, but you will most likely run into other issues (e.g. how to approximate the error derivative)

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Tune PI Controller Using Reinforcement Learning

3 Kommentare
1 älteren Kommentar anzeigen 1 älteren Kommentar ausblenden

Akzeptierte Antwort

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Weitere Antworten (0)

Kategorien

Tags

Community Treasure Hunt

Tune PI Controller Using Reinforcement Learning

3 Kommentare 1 älteren Kommentar anzeigen 1 älteren Kommentar ausblenden

Akzeptierte Antwort

0 Kommentare -2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Weitere Antworten (0)

Kategorien

Tags

Siehe auch

Community Treasure Hunt

3 Kommentare
1 älteren Kommentar anzeigen 1 älteren Kommentar ausblenden

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden