MDCGen v2
MdcGen allows a high-flexibility for parameterization, implementing clusters with varied shapes and generated by diverse underlying distributions. The tool enables the creation of clusters based on multivariate distributions but also clusters where distributions directly determine cluster intra-distances (i.e., the distance of objects to cluster centroids). Additionally, MDCGen implements classic functionalities, e.g., customization of cluster-separation, overlap control, addition of outliers and noisy features, correlated variables, rotations, and dataset quality evaluations, among others.
In order to allow a broad generation variety and flexibility, some configurations might create meaningless or useless datasets. Therefore, some experience dealing with the parameters is advisable (parameters are widely explained in the documentation). To validate the dataset, Silhouette evaluations provide performance indices to assess if the generated data follows a clear cluster-like structure.
Denis Ojdanic revised and improved MDCGen v1, developing the current MDCGen v2.
Zitieren als
Felix Iglesias (2024). MDCGen v2 (https://github.com/CN-TU/mdcgen-matlab), GitHub. Abgerufen .
F.Iglesias, T.Zseby, D.Ferreira and A.Zimek. MDCGen: Multidimensional Dataset Generator for Clustering. Journal of Classification (2019). https://doi.org/10.1007/s00357-019-9312-3
Kompatibilität der MATLAB-Version
Plattform-Kompatibilität
Windows macOS LinuxKategorien
Tags
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!Live Editor erkunden
Erstellen Sie Skripte mit Code, Ausgabe und formatiertem Text in einem einzigen ausführbaren Dokument.
config_build/src
config_build/test
extra_tools
manual_tests
mdcgen/src
mdcgen/test
Versionen, die den GitHub-Standardzweig verwenden, können nicht heruntergeladen werden
Version | Veröffentlicht | Versionshinweise | |
---|---|---|---|
2.0.2 | Typos corrected |
|
|
2.0.1 | MathWorks image added |
|
|
2.0.0 |
|