Training and group matrices for classifying data

Question

0 Stimmen

I am getting this error when trying to classify matrix data:

Error using classify (line 220) TRAINING must have more observations than the number of groups.

My classification data matrix is 10x5, my training matrix is 2x5, and my group vector is of length 2:

classificationFeatureValues =

0e+004 *
0006    0.0761    0.0065    3.7003    0.0113
0005    0.0683    0.0063    3.3502    0.0114
0006    0.0761    0.0065    3.7003    0.0113
0005    0.0683    0.0063    3.3502    0.0114
0006    0.0761    0.0065    3.7003    0.0113
0005    0.0683    0.0063    3.3502    0.0114
0006    0.0761    0.0065    3.7003    0.0113
0005    0.0683    0.0063    3.3502    0.0114
0006    0.0761    0.0065    3.7003    0.0113
0005    0.0683    0.0063    3.3502    0.0114

training =

0e+004 *
0005    0.0683    0.0063    3.3502    0.0113
0006    0.0761    0.0065    3.7003    0.0114

group =

1 2

I can't seem to find the error here...

Steve

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Follow Question

Answer 1

Greg Heath am 27 Jul. 2012

Bearbeitet: Greg Heath am 27 Jul. 2012

In MATLAB Online öffnen

0 Stimmen

In order to use CLASSIFY, each class should have enough training points, Ntrni (i=1:c), to obtain accurate estimates of the mean and covariance matrix. The typical rule of thumb is that the number of vector measurements is much greater than the number of estimated parameters. For each class of p-dimensional vectors the Bayesian quadratic classifier requires

Ntrni >> numel(mean) + numel(cov) = p + p*(p+1)/2 = p*(p+3)/2

In addition, each class should have enough testing points Ntsti, to obtain accurate performance estimates on nontraining data (generalization). For classification, the errors are assumed to be Binomially distributed with approximate standard deviation

stdvei = sqrt(ei*(1-ei)/Ntsti ), ~0.05 <= ei <= ~0.95

It is desirable that stdvei << ei. Since max(ei*(1-ei)) = 0.25 , the typical rule of thumb for ei > ~0.05

Ntsti >> 19 >= (1-ei)/ei

For smaller errors Ntsti should be larger. Check a stats handbook for a more accurate estimate of stdvei for ei < 0.05.

If N is not large enough to obtain an adequate Ntrni/Ntsti division, crossvalidation or bootstrapping should be used.

For 10-fold crossfold validation of the the 3-class Fisher iris data with 50 4-dimensional inputs per class, Ntrni = 45 and the ratio per class is

r = 2*Ntrni/(p*(p+3)) < 90/(4*7) = 3.2

For the Bayesian linear classifier, the pooled covariance matrix is estimated yielding

 3*Ni >> 3*p + p*(p+1)/2 = p*(p+7)/2
 r = 6*Ni/(p*(p+7)) = 270/(4*11) = 6.1

For a LMSE (e.g.,backslash) linear classifier

 Ni >> p + 1
 r = 45/5 = 9

Therefore I suggest that you use

 1. Raw data (i.e., NOT medians or means)...increases Ni
 2. A backslash LMSE classifier... decreases no. of estimated parameters
W*[ones(1,N);traininput] = target    % Columns of eye(c) for c classes
W = target/[ones(1,Ntrn);traininput]  % Ntrn = sum(Ntrni)
output = W*[ones(1,size(input,2);input] 
3. Bootstrapping or crossvalidation

Hope this helps.

Greg

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Answer 2

Ilya am 24 Jul. 2012

0 Stimmen

classify needs to estimate either the pooled-in covariance matrix (for linear discriminant) or covariance matrix for each class (for quadratic discriminant). You can't estimate covariance if you have one observation per class. What is observed variance for a single observation?

With so little data, you should use all of it for training and estimate classification error by cross-validation. If you have 2011b or later, I would recommend ClassificationDiscriminant for an easier workflow.

2 Kommentare
Keine anzeigen Keine ausblenden

steve am 25 Jul. 2012

The rows of classificationFeatureValues each corresponds to the median values of five variables from a data acquisition, thus ten acquisitions. Should I instead include all my data from the acquisitions before finding the median values?

Or do I simply need to expand my training matrix...

Thanks,

Steve

Ilya am 25 Jul. 2012

Sorry, I couldn't understand what you are saying about your acquisition.

Given the signature

classify(SAMPLE,TRAINING,GROUP)

You cannot perform discriminant analysis when your TRAINING matrix has only one observation (row) per class (distinct value in GROUP). The more observations you have for training, the more accurate your model is going to be. Take a look at examples in classify help or doc to see how GROUP and TRAINING are formed.

Melden Sie sich an, um zu kommentieren.

Answer 3

Greg Heath am 26 Jul. 2012

In MATLAB Online öffnen

0 Stimmen

For the quadratic classifier, CLASSIFY requires full rank covariance matrices for each group. However, for the linear classifier, it only requires the pooled covariance matrix to have full rank.

Neither of these conditions hold. If you combine the training and test data and use format short g you will get

close all, clear all, clc

ClassificationFeatureValues = 1.0e+004 *[...

0006    0.0761    0.0065    3.7003    0.0113
0005    0.0683    0.0063    3.3502    0.0114
0006    0.0761    0.0065    3.7003    0.0113
0005    0.0683    0.0063    3.3502    0.0114
0006    0.0761    0.0065    3.7003    0.0113
0005    0.0683    0.0063    3.3502    0.0114
0006    0.0761    0.0065    3.7003    0.0113
0005    0.0683    0.0063    3.3502    0.0114
0006    0.0761    0.0065    3.7003    0.0113
0005    0.0683    0.0063    3.3502    0.0114 ]

Training =1.0e+004 *[...

    0.0005    0.0683    0.0063    3.3502    0.0113
    0.0006    0.0761    0.0065    3.7003    0.0114]

group = [ 1 2 ]

format short g

x = [ClassificationFeatureValues; Training]

x =

            6          761           65        37003          113
            5          683           63        33502          114
            6          761           65        37003          113
            5          683           63        33502          114
            6          761           65        37003          113
            5          683           63        33502          114
            6          761           65        37003          113
            5          683           63        33502          114
            6          761           65        37003          113
            5          683           63        33502          114
            5          683           63        33502          113
            6          761           65        37003          114

If you look closely at the 12 5-dimensional data points You will see that they collapse into two points. Therefore the data is only 1-dimensional and no formal classification is needed.

I usually recommend that, before classifier design, you should get a "feel" for the data via

plots and outlier checks 
SVD  condition and rank checks 
Clustering

For example

>> svdx = svd(x)

svdx =

>> condx = cond(x)

condx = 9.5233e+020

>> tol = max(size(x)) * eps(norm(x))

tol = 1.7462e-010

>> rankx = rank(x,tol))

rankx = 3 % Too conservative

>> svdx/max(svdx)

   ans =
                 1
        0.00020092    % Essentially one-dimensional
       5.6997e-006
       1.2204e-018
       1.0501e-021

Perhaps using your raw data will make the analysis more interesting.

Hope this helps.

Greg

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Answer 4

steve am 26 Jul. 2012

In MATLAB Online öffnen

0 Stimmen

I now have distinct median data from an experiment with 20 sample acquisitions of four classifiable groups (every forth trial is a similar trial to the previous forth), so hopefully the data can be classified into four groups with five acquisitions each.

classificationFeatureValues =

0e+007 *
0000    0.0001    0.0000    0.0065    0.0000
0000    0.0000    0.0000    0.0010    0.0000
0000    0.0007    0.0000    0.0486    0.0000
0002    0.0100    0.0000    1.0413    0.0000
0000    0.0000    0.0000    0.0011    0.0000
0000    0.0000    0.0000    0.0014    0.0000
0000    0.0000    0.0000    0.0028    0.0000
0001    0.0100    0.0000    0.9119    0.0000
0000    0.0001    0.0000    0.0087    0.0000
0000    0.0000    0.0000    0.0010    0.0000
0000    0.0000    0.0000    0.0020    0.0000
0000    0.0018    0.0000    0.1253    0.0000
0000    0.0000    0.0000    0.0056    0.0000
0000    0.0000    0.0000    0.0040    0.0000
0000    0.0000    0.0000    0.0014    0.0000
0002    0.0100    0.0000    1.1621    0.0000
0000    0.0000    0.0000    0.0042    0.0000
0000    0.0000    0.0000    0.0011    0.0000
0000    0.0000    0.0000    0.0018    0.0000
0002    0.0100    0.0000    1.1730    0.0000

You can see in column four how every forth trial is similar to the previous forth. My main question is, to classify data you have to design your own training matrix, MATLAB can't do this for you? Would it be smart to make the training matrix the first four samples, or should i be doing something slightly more complex like with mean values?

2 Kommentare
Keine anzeigen Keine ausblenden

steve am 26 Jul. 2012

I tried this:

training = classificationFeatureValues(1:4,:); group = [1,2,3,4]; class = classify(classificationFeatureValues,training,group);

And got this error:

Error using classify (line 220) TRAINING must have more observations than the number of groups.

Which doesn't make sense because I thought that the number of groups had to equal the number of rows in training.

Ilya am 26 Jul. 2012

In MATLAB Online öffnen

Again:

You cannot perform discriminant analysis when your TRAINING matrix has only one observation (row) per class (distinct value in GROUP).

If you type 'help classify', the very first example gives you:

load fisheriris
x = meas(51:end,1:2);  % for illustrations use 2 species, 2 columns
y = species(51:end);

Could you please look at the content of y. There are two distinct values there, 'versicolor' and 'virginica'. These are classes. Rows 1:50 in x are for class 'versicolor', and so you have 50 observations for this class. Rows 51:100 are for class 'virginica', and you have 50 observations for that class too.

Melden Sie sich an, um zu kommentieren.

Answer 5

steve am 31 Jul. 2012

0 Stimmen

Is it more statistically sound to build a random training matrix with half the data or to make the training matrix out of the entire data set?

1 Kommentar
-1 ältere Kommentare anzeigen -1 ältere Kommentare ausblenden

Oleg Komarov am 31 Jul. 2012

Please use comments. Who are you addressing with this question? If it is a standalone question open a new thread, however this doesn't sound like a MATLAB question and you might have more chances asking in math/stat forums.

Melden Sie sich an, um zu kommentieren.

Training and group matrices for classifying data

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Akzeptierte Antwort

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Weitere Antworten (4)

2 Kommentare
Keine anzeigen Keine ausblenden

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

2 Kommentare
Keine anzeigen Keine ausblenden

1 Kommentar
-1 ältere Kommentare anzeigen -1 ältere Kommentare ausblenden

Kategorien

Tags

Community Treasure Hunt

Training and group matrices for classifying data

0 Kommentare -2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Akzeptierte Antwort

0 Kommentare -2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

Weitere Antworten (4)

2 Kommentare Keine anzeigen Keine ausblenden

0 Kommentare -2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

2 Kommentare Keine anzeigen Keine ausblenden

1 Kommentar -1 ältere Kommentare anzeigen -1 ältere Kommentare ausblenden

Kategorien

Tags

Siehe auch

Community Treasure Hunt

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

2 Kommentare
Keine anzeigen Keine ausblenden

0 Kommentare
-2 ältere Kommentare anzeigen -2 ältere Kommentare ausblenden

2 Kommentare
Keine anzeigen Keine ausblenden

1 Kommentar
-1 ältere Kommentare anzeigen -1 ältere Kommentare ausblenden