R/smote.R
smote.Rd
In each iteration, samples one minority class element x1, then one of x1's nearest neighbors: x2. Both points are now interpolated / convex-combined, resulting in a new virtual data point x3 for the minority class.
The method handles factor features, too. The gower distance is used for nearest neighbor calculation, see cluster::daisy. For interpolation, the new factor level for x3 is sampled from the two given levels of x1 and x2 per feature.
smote(task, rate, nn = 5L, standardize = TRUE, alt.logic = FALSE)
task | (Task) |
---|---|
rate | ( |
nn | ( |
standardize | ( |
alt.logic | ( |
Task.
Chawla, N., Bowyer, K., Hall, L., & Kegelmeyer, P. (2000) SMOTE: Synthetic Minority Over-sampling TEchnique. In International Conference of Knowledge Based Computer Systems, pp. 46-57. National Center for Software Technology, Mumbai, India, Allied Press.
Other imbalancy:
makeOverBaggingWrapper()
,
makeUndersampleWrapper()
,
oversample()