microsoftml.count_select: Feature selection based on counts

Usage

microsoftml.count_select(cols: [list, str], count: int = 1, **kargs)

Description

Selects the features for which the count of non-default values is greater than or equal to a threshold.

Details

When using the count mode in feature selection transform, a feature is selected if the number of examples have at least the specified count examples of non-default values in the feature. The count mode feature selection transform is very useful when applied together with a categorical hash transform (see also, categorical_hash. The count feature selection can remove those features generated by hash transform that have no data in the examples.

Arguments

cols

Specifies character string or list of the names of the variables to select.

count

The threshold for count based feature selection. A feature is selected if and only if at least count examples have non-default value in the feature. The default value is 1.

kargs

Additional arguments sent to compute engine.

Returns

An object defining the transform.

See also

mutualinformation_select