Share via


CountVectorizer.SetVocabSize(Int32) Method

Definition

Sets the max size of the vocabulary. CountVectorizer will build a vocabulary that only considers the top vocabSize terms ordered by term frequency across the corpus.

public Microsoft.Spark.ML.Feature.CountVectorizer SetVocabSize (int value);
member this.SetVocabSize : int -> Microsoft.Spark.ML.Feature.CountVectorizer
Public Function SetVocabSize (value As Integer) As CountVectorizer

Parameters

value
Int32

The max vocabulary size

Returns

CountVectorizer with the max vocab value set

Applies to