CountVectorizer.SetVocabSize(Int32) Method
Definition
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
Sets the max size of the vocabulary. CountVectorizer will build a vocabulary that only considers the top vocabSize terms ordered by term frequency across the corpus.
public Microsoft.Spark.ML.Feature.CountVectorizer SetVocabSize (int value);
member this.SetVocabSize : int -> Microsoft.Spark.ML.Feature.CountVectorizer
Public Function SetVocabSize (value As Integer) As CountVectorizer
Parameters
- value
- Int32
The max vocabulary size
Returns
CountVectorizer with the max vocab value set
Applies to
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for