First, calls generateFilterValuesData.
Features are then selected via select
and val
.
filterFeatures( task, method = "randomForestSRC_importance", fval = NULL, perc = NULL, abs = NULL, threshold = NULL, fun = NULL, fun.args = NULL, mandatory.feat = NULL, select.method = NULL, base.methods = NULL, cache = FALSE, ... )
task | (Task) |
---|---|
method | ( |
fval | (FilterValues) |
perc | ( |
abs | ( |
threshold | ( |
fun | ( |
fun.args | (any) |
mandatory.feat | (character) |
select.method | If multiple methods are supplied in argument |
base.methods | If |
cache | ( |
... | (any) |
Task.
If cache = TRUE
, the default mlr cache directory is used to cache
filter values. The directory is operating system dependent and can be
checked with getCacheDir()
.
The default cache can be cleared with deleteCacheDir()
.
Alternatively, a custom directory can be passed to store the cache.
Note that caching is not thread safe. It will work for parallel computation on many systems, but there is no guarantee.
Besides passing (multiple) simple filter methods you can also pass an
ensemble filter method (in a list). The ensemble method will use the simple
methods to calculate its ranking. See listFilterEnsembleMethods()
for
available ensemble methods.
Other filter:
generateFilterValuesData()
,
getFilteredFeatures()
,
listFilterEnsembleMethods()
,
listFilterMethods()
,
makeFilterEnsemble()
,
makeFilterWrapper()
,
makeFilter()
,
plotFilterValues()
# simple filter filterFeatures(iris.task, method = "FSelectorRcpp_gain.ratio", abs = 2)#> Supervised task: iris-example #> Type: classif #> Target: Species #> Observations: 150 #> Features: #> numerics factors ordered functionals #> 2 0 0 0 #> Missings: FALSE #> Has weights: FALSE #> Has blocking: FALSE #> Has coordinates: FALSE #> Classes: 3 #> setosa versicolor virginica #> 50 50 50 #> Positive class: NA# ensemble filter filterFeatures(iris.task, method = "E-min", base.methods = c("FSelectorRcpp_gain.ratio", "FSelectorRcpp_information.gain"), abs = 2)#> Supervised task: iris-example #> Type: classif #> Target: Species #> Observations: 150 #> Features: #> numerics factors ordered functionals #> 2 0 0 0 #> Missings: FALSE #> Has weights: FALSE #> Has blocking: FALSE #> Has coordinates: FALSE #> Classes: 3 #> setosa versicolor virginica #> 50 50 50 #> Positive class: NA