This example shows how to construct support vector machine (SVM) classifiers in the
Classification Learner app, using the
ionosphere data set that
contains two classes. You can use a support vector machine (SVM) with two or more
classes in Classification Learner. An SVM classifies data by finding the best hyperplane
that separates all data points of one class from those of another class. In the
ionosphere data, the response variable is categorical with two
g represents good radar returns, and
represents bad radar returns.
In MATLAB®, load the
ionosphere data set and define some
variables from the data set to use for a classification.
load ionosphere ionosphere = array2table(X); ionosphere.Group = Y;
Alternatively, you can load the
ionosphere data set and
Y data as separate
On the Apps tab, in the Machine Learning and Deep Learning group, click Classification Learner.
On the Classification Learner tab, in the File section, click New Session > From Workspace.
In the New Session from Workspace dialog box, select the table
ionosphere from the Data Set
Variable list. Observe that the app has selected response and
predictor variables based on their data type. The response variable
Group has two levels. All the other variables are
Alternatively, if you kept your predictor data
Y as two separate variables, you can first
select the matrix
X from the Data Set
Variable list. Then, under Response, click
the From workspace option button and select
Y from the list. The
Y variable is the
same as the
Click Start Session.
Classification Learner creates a scatter plot of the data.
Use the scatter plot to visualize which variables are useful for predicting the response. Select different variables in the X- and Y-axis controls. Observe which variables separate the class colors most clearly.
To create a selection of SVM models, on the Classification Learner tab, in the Model Type section, click the down arrow to expand the list of classifiers, and under Support Vector Machines, click All SVMs.
Then click Train.
If you have Parallel Computing Toolbox™, you can train all the models (All SVMs) simultaneously by selecting the Use Parallel button in the Training section before clicking Train. After you click Train, the Opening Parallel Pool dialog box opens and remains open while the app opens a parallel pool of workers. During this time, you cannot interact with the software. After the pool opens, the app trains the models simultaneously.
Classification Learner trains one of each nonoptimizable SVM classification option in the gallery, and highlights the best score. The app outlines in a box the Accuracy (Validation) score of the best model. Classification Learner also displays a validation confusion matrix for the first SVM model (Linear SVM).
Validation introduces some randomness into the results. Your model validation results can vary from the results shown in this example.
To view the results for a model, select the model in the Models pane, and inspect the Current Model Summary pane. The Current Model Summary pane displays the Training Results metrics, calculated on the validation set.
For the selected model, inspect the accuracy of the predictions in each class. On the Classification Learner tab, in the Plots section, click the arrow to open the gallery, and then click Confusion Matrix (Validation) in the Validation Results group. View the matrix of true class and predicted class results.
Select the other models in the Models pane, open the validation confusion matrix for each of the models, and then compare the results.
Choose the best model (the best score is highlighted in a box). To improve the model, try including different features in the model. See if you can improve the model by removing features with low predictive power.
On the Classification Learner tab, in the Features section, click Feature Selection. In the Feature Selection dialog box, specify predictors to remove from the model, and click OK. In the Training section, click Train to train a new model using the new options. Compare results among the classifiers in the Models pane.
To investigate features to include or exclude, use the parallel coordinates plot. On the Classification Learner tab, in the Plots section, click the arrow to open the gallery, and click Parallel Coordinates in the Validation Results group.
Choose the best model in the Models pane. To try to improve the model further, try changing SVM settings. On the Classification Learner tab, in the Model Type section, click Advanced. In the Advanced SVM Options dialog box, try changing a setting and click OK. Train the new model by clicking Train in the Training section. For information on settings, see Support Vector Machines.
You can export a full or compact version of the trained model to the workspace. On the Classification Learner tab, in the Export section, click Export Model and select either Export Model or Export Compact Model. See Export Classification Model to Predict New Data.
To examine the code for training this classifier, click Generate Function. For SVM models, see also Generate C Code for Prediction.
Use the same workflow to evaluate and compare the other classifier types you can train in Classification Learner.
To try all the nonoptimizable classifier model presets available for your data set:
Click the arrow on the far right of the Model Type section to expand the list of classifiers.
Click All, then click Train.
To learn about other classifier types, see Train Classification Models in Classification Learner App.