Why is my neural network performing worse as the number of hidden layers increases?

Question

bitslice am 2 Aug. 2015

2
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/232036-why-is-my-neural-network-performing-worse-as-the-number-of-hidden-layers-increases

Kommentiert: Greg Heath am 5 Aug. 2015

Hello, I am currently using the Matlab Neural Network toolbox to experiment with the Iris dataset. I am training with "trainlm" algorithm, and I decided to see what would happen if I trained with 1:20 hidden layers. I was not expecting any change in the classification error, but when I do this, I get the following output:

I have been looking for a solution, but I cannot explain why the classification error begins to jump, or even increases at all as the number of hidden layers increases.

Thank You

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Answer 1

Greg Heath am 2 Aug. 2015

4
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/232036-why-is-my-neural-network-performing-worse-as-the-number-of-hidden-layers-increases#answer_187984

In MATLAB Online öffnen

The ultimate goal is too obtain a net that performs well on non-training data that comes from the same or similar source as the training data. This is called GENERALIZATION.

Frequent causes of failure are

Not enough weights to adequately characterize the training data
Training data does not adequately characterize the salient features of non-training data because of measurement error, transcription error, noise, interference, insufficient sampling size and variability
Fewer training equations than unknown weights.
Random weight initialization

Various techniques use to mitigate these causes are

Remove bad data and outliers (plots help)
Use enough training data to sufficiently characterize non-training data.
Use enough weights to adequately characterize the training data
Use more training equations than unknown weights. The stability of 
solutions w.r.t. noise and errors increases as the ratio increases.
Use the best of multiple random initialization & data-division designs 
K-fold Cross-validation
Validation Stopping
Regularization

For the iris_dataset

 [ I N ] = size(input)  % [ 4 150 ]
 [ O N ] = size(target) % [ 3 150 ]
 Assuming the default 0.7/0.15/0.15 trn/val/tst  data division, the number of training equations is approximately
 Ntrneq = 0.7*N*O   % 315
 Assuming the default I-H-O node topology, the number of unknown equations is
 Nw = (I+1)*H+(H+1)*O = (I+O+1)*H + O
 Obviously, Nw < Ntrneq when H <= Hub  (upper bound) where
 Hub = floor( (Ntrneq-O)/(I+O+1))  % 39
 Expecting decent solutions for H <= 20 seems reasonable. However, to 
mitigate the random initial weights and data division, design 10 nets for each value
 I have posted zillions of examples in both the NEWSGROUP and ANSWERS. I use patternnet for classification.

Hope this helps.

Thank you for formally accepting my answer

Greg

5 Kommentare
3 ältere Kommentare anzeigen3 ältere Kommentare ausblenden

bitslice am 2 Aug. 2015

Also, I used only one node in each of these hidden layers.

bitslice am 2 Aug. 2015

Ok, with your tips I was able to figure this out:

Changing the random seed does indeed change the consistency of the results, implying that the increased error is due to the random weight initialization.

Thank you so much!

Melden Sie sich an, um zu kommentieren.

Answer 2

Walter Roberson am 2 Aug. 2015

0
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/232036-why-is-my-neural-network-performing-worse-as-the-number-of-hidden-layers-increases#answer_187962

Each layer is initialized randomly. If you do not provide enough data to train the effects of the randomness out, then you have the effect of the cumulative randomness.

3 Kommentare
1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden

Walter Roberson am 2 Aug. 2015

Greg Heath has written several times about the amount of data that one should use, but I cannot think of good keywords at the moment to search for.

Greg Heath am 5 Aug. 2015

In MATLAB Online öffnen

Try combinations of Ntrneq Nw Hub

Melden Sie sich an, um zu kommentieren.

Why is my neural network performing worse as the number of hidden layers increases?

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Akzeptierte Antwort

5 Kommentare
3 ältere Kommentare anzeigen3 ältere Kommentare ausblenden

Weitere Antworten (1)

3 Kommentare
1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden

Siehe auch

Kategorien

Tags

Produkte

Community Treasure Hunt

Why is my neural network performing worse as the number of hidden layers increases?

0 Kommentare -2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Akzeptierte Antwort

5 Kommentare 3 ältere Kommentare anzeigen3 ältere Kommentare ausblenden

Weitere Antworten (1)

3 Kommentare 1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden

Siehe auch

Kategorien

Tags

Produkte

Community Treasure Hunt

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

5 Kommentare
3 ältere Kommentare anzeigen3 ältere Kommentare ausblenden

3 Kommentare
1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden