CountVectorizerModel.SetMinTF(Double) Method
Definition
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
Sets the filter to ignore rare words in a document. For each document, terms with frequency/count less than the given threshold are ignored. If this is an integer greater than or equal to 1, then this specifies a count (of times the term must appear in the document); if this is a double in [0,1), then this specifies a fraction (out of the document's token count).
Note that the parameter is only used in transform of CountVectorizerModel and does not affect fitting.
public Microsoft.Spark.ML.Feature.CountVectorizerModel SetMinTF (double value);
member this.SetMinTF : double -> Microsoft.Spark.ML.Feature.CountVectorizerModel
Public Function SetMinTF (value As Double) As CountVectorizerModel
Parameters
- value
- Double
Minimum term frequency of type double.
Returns
New CountVectorizerModel with the min term frequency set
Applies to
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for