I have three questions regarding the difference between a closed-loop and an open-loop narxnet, and it's behavior.
First, a little about the problem I'm trying to solve. I have an 2xN dimensional matrix of observation (X), from which I'm trying to predict an output Y, 1xN. Now, a narxnet takes as input both X and Y. The Matlab documentation says that an open-loop narxnet finds a function 'f' where y(t) = f( y(t-1), y(t-2), x(t-1), x(t-2) ), for a delay of 2. However, the results that I get are much more accurate than I expect them to be. This suggests that the narxnet uses the actual y(t) as input as well. When I convert the open-loop to a closed-loop, and retrain it, I get much more reasonable results, not good, but reasonable.
1.a) What is the actual input to an open-loop narxnet, and the closed-loop. When trying to predict y(t), are the inputs [ y(t), y(t-1), y(t-2), x(t-1), x(t-2) ] or [ y(t-1), y(t-2), x(t-1), x(t-2) ], for both?
1.b) For my problem I need the inputs to be [ y(t-1), y(t-2), x(t), x(t-1), x(t-2) ], 0 to 2 delays on the X, and 1 to 2 delays on the Y. How can I do that?
2. For comparison, I've trained my open-loop, and tested it, then convert the open-loop to a closed-loop using
netc = closeloop(net);
then trained the closed-loop from scratch and tested it. I've read on MATLAB Answers that I should start training the closed-loop with the weights from the open-loop net. How can I used the weights of one net as the initial weights for training another?
3. I understand that the narxnet uses a combination of NNs and difference equations. I problem I run into when using a closed-loop narxnet is that my predictions begin to oscillate and explode exponential. I know that this problem occurs in a difference equation, such as y(t)=a*y(t-1)+b, when 'a' is greater than 1. What, in a closed-loop narxnet could cause this similar issue?
I didn't include any code because my questions need high-level understanding and explanations, which I am lacking.
A huge thank you in advance,
- Yevgeniy

1 Kommentar

Mike Vargas
Mike Vargas am 3 Nov. 2016
OMG. Same question. This has been bothering me forever.

Melden Sie sich an, um zu kommentieren.

 Akzeptierte Antwort

Greg Heath
Greg Heath am 4 Nov. 2016

4 Stimmen

% NARXNET closed-loop vs open-loop
% Asked by Yevgeniy Arabadzhi
% 8:38 PM Thursday, November 3, 2016
%
% I have three questions regarding the difference between
% a closed-loop and an open-loop narxnet, and it's behavior.
%
% First, a little about the problem I'm trying to solve. I
% have an 2xN dimensional matrix of observation (X), from
% which I'm trying to predict an output Y, 1xN. Now, a
% narxnet takes as input both X and Y. The Matlab
% documentation says that an open-loop narxnet finds a
% function 'f' where y(t) = f( y(t-1), y(t-2), x(t-1),
% x(t-2) ), for a delay of 2.
What you are overlooking is the role of the target; In fact, you never mention the target!
The target is used to design and operate the OL net. To emphasize this point I prefer to use the following notation:
Uppercase for cells
Lowercase for doubles and integers
X,x for inputs
T,t for targets
Y,y for outputs
i,j for timestep indices
% However, the results that I % get are much more accurate than I expect them to be. % This suggests that the narxnet uses the actual y(t) % as input as well.
ABSOLUTELY NOT! You just happened to have an easy problem (which you did not identify!). There are a variety of sample datasets in the help and doc documentations. Use the commands
help nndatasets
doc nndatsets
% When I convert the open-loop to a closed-loop, and retrain it, % I get much more reasonable results, not good, but reasonable.
Typically, when you convert from OL to CL and compare the results over the length of the target, you will get a spectrum of results: from excellent to lousy. Again, this will be demonstated if you try some of the sample datasets.
I have posted tutorials on narxnet design. I gather from this post that you are not familiar with them.
% 1.a) What is the actual input to an open-loop narxnet, % and the closed-loop. When trying to predict y(t), are the % inputs [ y(t), y(t-1), y(t-2), x(t-1), x(t-2) ] or % [ y(t-1), y(t-2), x(t-1), x(t-2) ], for both?
ID = 1:2, FD = 1:2, H = 10 % DEFAULTs
OL: y(i) = f(x(i-1),x(i-2),t(i-1),t(i-2)); i = 3:N
CL: y(i) = f(x(i-1),x(i-2),y(i-1),y(i-2)); i = 3:N
ID = 0:2, FD = 0:2, H = 10
OL: y(i) = f(x(i),x(i-1),x(i-2),t(i),t(i-1),t(i-2)); i=3:N
CL: y(i) = f(x(i),x(i-1),x(i-2),y(i-1),y(i-2)); i = 3:N
NOTE THAT y(i) FEEDBACK IS NOT ALLOWED FOR CL !
% 1.b) For my problem I need the inputs to be [ y(t-1), % y(t-2), x(t), x(t-1), x(t-2) ], 0 to 2 delays on the X, % and 1 to 2 delays on the Y. How can I do that?
See above
% 2. For comparison, I've trained my open-loop, and tested % it, then convert the open-loop to a closed-loop using % netc = closeloop(net);
Next, compare the OL and CL on the OL design data.
% then trained the closed-loop from scratch and tested it. % I've read on MATLAB Answers that I should start training % the closed-loop with the weights from the open-loop net. % How can I used the weights of one net as the initial % weights for training another?
netc = closeloop(neto); netc = train(netc,X,Xoi,Aoi);
% 3. I understand that the narxnet uses a combination of % NNs and difference equations. I problem I run into when % using a closed-loop narxnet is that my predictions begin % to oscillate and explode exponential. I know that this % problem occurs in a difference equation, such as % y(t)=a*y(t-1)+b, when 'a' is greater than 1. What, in a % closed-loop narxnet could cause this similar issue?
Since the output containing errors is fed back to the input, the cause is, basically, similar.
% I didn't include any code because my questions need % high-level understanding and explanations, which I am % lacking. % % A huge thank you in advance, % % Yevgeniy
No, thank you. I'm sure more people than you realize will be thankful for this post.
Hope this helps
Greg

2 Kommentare

Yevgeniy Arabadzhi
Yevgeniy Arabadzhi am 4 Nov. 2016
Greg,
Thanks for the detailed response. This answered my questions.
In my problem, I have 250 hours of accelerometer data. The accelerometer was attached to a hospital patient. I extract different features from this accelerometer, such as the number of movement made in an hour, or an average duration of movements in each hour. This is my X , a 250xN matrix, where N is the number of features I use. T, the targets, are the activity scores that a physician assigned to the patient for each hour, on scale 1-10. I want to build a model that can predict the activity score, without the physician. My goal is to use a set number of hours of data: the observations X, and targets T, to train the net, then I want to make predictions based only on the inputs X and the predictions(output) Y.
This is why I think the a very good result, like the one I get from the OL narxnet isn't realistic. Since human behavior is not consistent.
You've already cleared up alot for me, but do you have any last suggestions that might pertain to my problem.
Thank you very much,
Yevgeniy
Akash Menon
Akash Menon am 1 Feb. 2019
Thanks for the answer Greg! That clears up some questions I had too!

Melden Sie sich an, um zu kommentieren.

Weitere Antworten (1)

Greg Heath
Greg Heath am 4 Nov. 2016
Bearbeitet: Greg Heath am 4 Nov. 2016

0 Stimmen

Quick answer:
Openloop: Target and input signals are combined to form
a vector input for designing the net by estimating
weights and biases.
Closeloop: Target values are replaced by feedback
signals from the output so that the net is able to
predict outputs beyond the extent of the target signal.
Hope this helps.
Thank you for formally accepting my answer
Greg

3 Kommentare

Yevgeniy Arabadzhi
Yevgeniy Arabadzhi am 4 Nov. 2016
Bearbeitet: Yevgeniy Arabadzhi am 4 Nov. 2016
Hi Greg,
Thanks for your quick reply. Could I get the long version of that. Specifically in terms of y(t) and x(t), because it get a little confusing when talking about "target and Inputs", since I don't know if at any given step you are referring to y(t) or y(t-1), or x(t) or x(t-1).
Thanks again,
P.S. Also, thanks for inderectly helping me with many problems over the years. I wouldn't survive junior year without you.
Yevgeniy
Ran Wei
Ran Wei am 12 Feb. 2019
Greg,
I have found your answers to many questions helpful for understanding NARX. Given a dataset X with dimension MxN, and response Y with dimension MX1, is it true that an open loop NARX predicts values based on past Y's? I am having a situation where the NARX net performance is almost equal to NAR. However, the performance of timedelay network is very bad with low precision and very low recall. The goal for me is to rely on X as predictors instead of Y.
Thank you,
Ran
Greg Heath
Greg Heath am 13 Feb. 2019
CLARIFICATION
TARGET = "DESIRED" OUTPUT
OLNARX : ONLY USED FOR DESIGN
OUTPUT DEPENDS ON INPUT + TARGET
CLNARX: USED FOR PREDICTION
OUTPUT DEPENDS ON INPUT + FEDBACKOUTPUT
Hope this helps
Greg

Melden Sie sich an, um zu kommentieren.

Kategorien

Mehr zu Deep Learning Toolbox finden Sie in Hilfe-Center und File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by