Merges small levels of factors into new level.
Source:R/mergeSmallFactorLevels.R
mergeSmallFactorLevels.Rd
Merges factor levels that occur only infrequently into combined levels with a higher frequency.
Arguments
- task
(Task)
The task.- cols
(character) Which columns to convert. Default is all factor and character columns.
- min.perc
(
numeric(1)
)
The smallest levels of a factor are merged until their combined proportion w.r.t. the length of the factor exceedsmin.perc
. Must be between 0 and 1. Default is 0.01.- new.level
(
character(1)
)
New name of merged level. Default is “.merged”
See also
Other eda_and_preprocess:
capLargeValues()
,
createDummyFeatures()
,
dropFeatures()
,
normalizeFeatures()
,
removeConstantFeatures()
,
summarizeColumns()
,
summarizeLevels()