How to select the number of samples to train a Machine Learning algorithm?

Question

Jose Marques am 31 Jan. 2019

0
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/442422-how-to-select-the-number-of-samples-to-train-a-machine-learning-algorithm

Kommentiert: Greg Heath am 4 Feb. 2019

I working in a dataset of 12000 samples concerning about 5 years of an industrial process.

It is likely that during this time the plant has undergone changes (equipments, the performance drop itself, chemical products).

Is there a tool for identifying the best subset of this data? In my view, a temporal cut in the data could increase the quality of the models created.

3 Kommentare
1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden

Jose Marques am 31 Jan. 2019

Thanks for the comment!

The dataset has 426 inputs (I am using techniques for feature selection too).

I am using four algorithms to create the models: Regression Tree, Bagged Trees, SVM and Neural Networks.

Greg Heath am 4 Feb. 2019

As a common sense rule of thumb I try to use at least 10 to 30 times as many training points as unknown parameters that have to be estimated.

In addition I use 10 to 20 sets of random initial weights.

I assume , of course, that you ave examined plots of the data to initialize your common sense.

Hope this Helps

Greg

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Answer 1

BERGHOUT Tarek am 3 Feb. 2019

0
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/442422-how-to-select-the-number-of-samples-to-train-a-machine-learning-algorithm#answer_359276

u can use deep belif networks ; they are the best for feature sellection and mapping; and train you network by driven chunks of data "by randomly chosing a pairs of (inputs,targets)" and in the same time pire attention to your approximation function you must keep your error function in its local minimam. deep belif nets depands on a set of stacked auto_encoders that allows to tune all the parameters of the networks with small amount of training data

https://www.youtube.com/watch?v=E2Mt_7qked0

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

How to select the number of samples to train a Machine Learning algorithm?

3 Kommentare
1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden

Antworten (1)

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Siehe auch

Kategorien

Tags

Community Treasure Hunt

How to select the number of samples to train a Machine Learning algorithm?

3 Kommentare 1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden

Antworten (1)

0 Kommentare -2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Siehe auch

Kategorien

Tags

Community Treasure Hunt

3 Kommentare
1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden