Filter löschen
Filter löschen

Can we use k-means clustering on a 100×34596 matrix?

5 Ansichten (letzte 30 Tage)
bamini thavarajah
bamini thavarajah am 9 Nov. 2017
Kommentiert: Image Analyst am 5 Dez. 2017
  • I have a feature matrix with size 100×34596, with class labels.
  • This matrix is very large because I want to cluster these matrix.
  • Is it possible?
  • I don't have experience about k-means.
  • Could you help me?*

Antworten (1)

Walter Roberson
Walter Roberson am 9 Nov. 2017
Just make sure to pass data to kmeans such that the rows correspond to observations (points).
The class table will not be of any use for kmeans clustering: kmeans is a method of trying to figure out which values belong together without knowledge of the class labels.
  1 Kommentar
Image Analyst
Image Analyst am 5 Dez. 2017
Hopefully you have 34,596 observations and 100 features (which still seems ridiculously high) and not 100 observations of 34,596 features (which I don't see as possible in most normal/typical scenarios).
Anyway, if you have some of them labeled already, then you can use knnsearch() to label/classify any other, non-labeled data.
kmeans will come up with its own labels, so if they're all labeled already, then all you can do is pass in the feature values, without the labels as Walter said, and see how what kmeans chose differs from whatever method you have that produced the labels in advance (like human judging or whatever). You can then create a confusion matrix or ROC curve.

Melden Sie sich an, um zu kommentieren.

Kategorien

Mehr zu Statistics and Machine Learning Toolbox finden Sie in Help Center und File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by