Description object for task, encapsulates basic properties of the task without having to store the complete data set.
Id string of task.
Type of task, “classif” for classification, “regr” for regression, “surv” for survival and “cluster” for cluster analysis, “costsens” for cost-sensitive classification, and “multilabel” for multilabel classification.
Name(s) of the target variable(s). For “surv” these are the names of the survival time and event columns, so it has length 2. For “costsens” it has length 0, as there is no target column, but a cost matrix instead. For “multilabel” these are the names of logical columns that indicate whether a class label is present and the number of target variables corresponds to the number of classes.
Number of cases in data set.
Number of features, named vector with entries: “numerics”, “factors”, “ordered”, “functionals”.
Are missing values present?
Are weights specified for each observation?
Is a blocking factor for cases available in the task?
All possible classes. Only present for “classif”, “costsens”, and “multilabel”.
Positive class label for binary classification. Only present for “classif”, NA for multiclass.
Negative class label for binary classification. Only present for “classif”, NA for multiclass.