k-fold Cross Validation in Classification Learner

Hi there, Just wandering how data is partitioned in k-fold cross validation in Classification Learner? Is the data partitioned into k-folds according to class/label or complete random? That is, Is data split up into n subsets according to n classes, partitioned in the class subsets, and then grouped together into the folds used to train/test?

Antworten (1)

ahmed nebli
ahmed nebli am 22 Aug. 2018

0 Stimmen

in k-fold cv, the the data is splited to k-1 for taining and 1 for testing.(e.g.: if you have 100 subject and you use 10-fold cv it would be splited into 90 subject for training and 10 subjects for testing and you make k iteration each time the date is re-splited to 90 and 10.

1 Kommentar

James
James am 22 Aug. 2018
Bearbeitet: James am 22 Aug. 2018
Yes but is the data as a whole split in one step, or are the class subsets split and then merged? So in your e.g. you mentioned subjects, which I presume equate to data instances, but each instance is labelled. My question is whether data is split according to labels so for e.g.2 if there are 4 classes with 100 instances each with 25 instances, is the data split as a whole, or is the data split within the class subsets and then merged to maintain the proportionality?

Melden Sie sich an, um zu kommentieren.

Gefragt:

am 22 Aug. 2018

Bearbeitet:

am 22 Aug. 2018

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by