CjkBigramTokenFilter Class
Forms bigrams of CJK terms that are generated from the standard tokenizer. This token filter is implemented using Apache Lucene.
All required parameters must be populated in order to send to Azure.
- Inheritance
-
azure.search.documents.indexes._generated.models._models_py3.TokenFilterCjkBigramTokenFilter
Constructor
CjkBigramTokenFilter(*, name: str, ignore_scripts: Optional[List[Union[str, azure.search.documents.indexes._generated.models._search_client_enums.CjkBigramTokenFilterScripts]]] = None, output_unigrams: Optional[bool] = False, **kwargs)
Parameters
- odata_type
- str
Required. Identifies the concrete type of the token filter.Constant filled by server.
- name
- str
Required. The name of the token filter. It must only contain letters, digits, spaces, dashes or underscores, can only start and end with alphanumeric characters, and is limited to 128 characters.
- ignore_scripts
- list[str or <xref:azure.search.documents.indexes.models.CjkBigramTokenFilterScripts>]
The scripts to ignore.
- output_unigrams
- bool
A value indicating whether to output both unigrams and bigrams (if true), or just bigrams (if false). Default is false.
Feedback
Submit and view feedback for