CjkBigramTokenFilter Class

Forms bigrams of CJK terms that are generated from the standard tokenizer. This token filter is implemented using Apache Lucene.

All required parameters must be populated in order to send to Azure.

Inheritance
azure.search.documents.indexes._generated.models._models_py3.TokenFilter
CjkBigramTokenFilter

Constructor

CjkBigramTokenFilter(*, name: str, ignore_scripts: Optional[List[Union[str, azure.search.documents.indexes._generated.models._search_client_enums.CjkBigramTokenFilterScripts]]] = None, output_unigrams: Optional[bool] = False, **kwargs)

Parameters

odata_type
str
Required

Required. Identifies the concrete type of the token filter.Constant filled by server.

name
str
Required

Required. The name of the token filter. It must only contain letters, digits, spaces, dashes or underscores, can only start and end with alphanumeric characters, and is limited to 128 characters.

ignore_scripts
list[str or <xref:azure.search.documents.indexes.models.CjkBigramTokenFilterScripts>]
Required

The scripts to ignore.

output_unigrams
bool
Required

A value indicating whether to output both unigrams and bigrams (if true), or just bigrams (if false). Default is false.