Share via


CountVectorizerModel.SetMinTF(Double) Method

Definition

Sets the filter to ignore rare words in a document. For each document, terms with frequency/count less than the given threshold are ignored. If this is an integer greater than or equal to 1, then this specifies a count (of times the term must appear in the document); if this is a double in [0,1), then this specifies a fraction (out of the document's token count).

Note that the parameter is only used in transform of CountVectorizerModel and does not affect fitting.

public Microsoft.Spark.ML.Feature.CountVectorizerModel SetMinTF (double value);
member this.SetMinTF : double -> Microsoft.Spark.ML.Feature.CountVectorizerModel
Public Function SetMinTF (value As Double) As CountVectorizerModel

Parameters

value
Double

Minimum term frequency of type double.

Returns

New CountVectorizerModel with the min term frequency set

Applies to