Package: classreg.learning.partition
Cross-validated classification model
ClassificationPartitionedModel
is a set of
classification models trained on cross-validated folds. Estimate the quality of
classification by cross validation using one or more “kfold” methods:
kfoldPredict
, kfoldLoss
, kfoldMargin
, kfoldEdge
, and kfoldfun
.
Every “kfold” method uses models trained on in-fold observations to predict the response for out-of-fold observations. For example, suppose you cross validate using five folds. In this case, the software randomly assigns each observation into five roughly equally sized groups. The training fold contains four of the groups (i.e., roughly 4/5 of the data) and the test fold contains the other group (i.e., roughly 1/5 of the data). In this case, cross validation proceeds as follows:
The software trains the first model (stored in
CVMdl.Trained{1}
) using the observations in the last
four groups and reserves the observations in the first group for
validation.
The software trains the second model (stored in
CVMdl.Trained{2}
) using the observations in the first
group and last three groups, and reserves the observations in the second
group for validation.
The software proceeds in a similar fashion for the third to fifth models.
If you validate by calling kfoldPredict
, it computes predictions for
the observations in group 1 using the first model, group 2 for the second model, and so
on. In short, the software estimates a response for every observation using the model
trained without that observation.
creates a cross-validated classification model from a classification model
(CVMdl
=
crossval(Mdl
)Mdl
).
Alternatively:
CVDiscrMdl = fitcdiscr(X,Y,Name,Value)
CVKNNMdl = fitcknn(X,Y,Name,Value)
CVNBMdl = fitcnb(X,Y,Name,Value)
CVSVMMdl = fitcsvm(X,Y,Name,Value)
CVTreeMdl = fitctree(X,Y,Name,Value)
create a cross-validated model when name
is either
'CrossVal'
, 'KFold'
,
'Holdout'
, 'Leaveout'
, or
'CVPartition'
. For syntax details, see fitcdiscr
, fitcknn
, fitcnb
, fitcsvm
, and fitctree
.
|
A classification model.
|
|
Bin edges for numeric predictors, specified as a cell array of p numeric vectors, where p is the number of predictors. Each vector includes the bin edges for a numeric predictor. The element in the cell array for a categorical predictor is empty because the software does not bin categorical predictors. The software bins numeric predictors only if you specify the You can reproduce the binned predictor data X = mdl.X; % Predictor data
Xbinned = zeros(size(X));
edges = mdl.BinEdges;
% Find indices of binned predictors.
idxNumeric = find(~cellfun(@isempty,edges));
if iscolumn(idxNumeric)
idxNumeric = idxNumeric';
end
for j = idxNumeric
x = X(:,j);
% Convert x to array if x is a table.
if istable(x)
x = table2array(x);
end
% Group x into bins by using the Xbinned
contains the bin indices, ranging from 1 to the number of bins, for numeric predictors.
Xbinned values are 0 for categorical predictors. If
X contains NaN s, then the corresponding
Xbinned values are NaN s.
| ||||||||||||||||||||
|
Categorical predictor
indices, specified as a vector of positive integers. If | ||||||||||||||||||||
| Unique class labels used in training the model, specified as a categorical or character array, logical or numeric vector, or cell array of character vectors. | ||||||||||||||||||||
|
Square matrix, where If CVModel.Cost = CostMatrix; | ||||||||||||||||||||
|
Name of the cross-validated model, which is a character vector. | ||||||||||||||||||||
|
Number of folds used in cross-validated model, which is a positive integer. | ||||||||||||||||||||
|
Object holding parameters of | ||||||||||||||||||||
|
The partition of class | ||||||||||||||||||||
| Predictor variable names, specified as a cell array of character vectors. The order of the elements of | ||||||||||||||||||||
|
Numeric vector of prior probabilities for each class. The order of the
elements of If CVModel.Prior = priorVector; | ||||||||||||||||||||
| Response variable name, specified as a character vector. | ||||||||||||||||||||
| Score transformation, specified as a character vector or function handle.
To change the score transformation function to
| ||||||||||||||||||||
|
The trained learners, which is a cell array of compact classification models. | ||||||||||||||||||||
|
The scaled | ||||||||||||||||||||
|
A matrix or table of predictor values. Each column of | ||||||||||||||||||||
|
Categorical or character array, logical or numeric vector, or cell array
of character vectors specifying the class labels for each observation.
|
kfoldEdge | Classification edge for observations not used for training |
kfoldLoss | Classification loss for observations not used for training |
kfoldMargin | Classification margins for observations not used for training |
kfoldPredict | Predict response for observations not used for training |
kfoldfun | Cross validate function |
Value. To learn how value classes affect copy operations, see Copying Objects.
To estimate posterior probabilities of trained, cross-validated SVM classifiers, use
fitSVMPosterior
.