TokenFilterName Class

Reference

Package:: com.azure.search.documents.indexes.models

Maven Artifact:: com.azure:azure-search-documents:11.6.4

java.lang.Object
- com.azure.core.util.ExpandableStringEnum<T>
- - com.azure.search.documents.indexes.models.TokenFilterName

public final class TokenFilterName
extends ExpandableStringEnum<TokenFilterName>

Defines the names of all token filters supported by the search engine.

Field Summary

Modifier and Type	Field and Description
static final TokenFilterName	APOSTROPHE Strips all characters after an apostrophe (including the apostrophe itself).
static final TokenFilterName	ARABIC_NORMALIZATION A token filter that applies the Arabic normalizer to normalize the orthography.
static final TokenFilterName	ASCII_FOLDING Converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if such equivalents exist.
static final TokenFilterName	CJK_BIGRAM Forms bigrams of CJK terms that are generated from the standard tokenizer.
static final TokenFilterName	CJK_WIDTH Normalizes CJK width differences.
static final TokenFilterName	CLASSIC Removes English possessives, and dots from acronyms.
static final TokenFilterName	COMMON_GRAM Construct bigrams for frequently occurring terms while indexing.
static final TokenFilterName	EDGE_NGRAM Generates n-grams of the given size(s) starting from the front or the back of an input token.
static final TokenFilterName	ELISION Removes elisions.
static final TokenFilterName	GERMAN_NORMALIZATION Normalizes German characters according to the heuristics of the German2 snowball algorithm.
static final TokenFilterName	HINDI_NORMALIZATION Normalizes text in Hindi to remove some differences in spelling variations.
static final TokenFilterName	INDIC_NORMALIZATION Normalizes the Unicode representation of text in Indian languages.
static final TokenFilterName	KEYWORD_REPEAT Emits each incoming token twice, once as keyword and once as non-keyword.
static final TokenFilterName	KSTEM A high-performance kstem filter for English.
static final TokenFilterName	LENGTH Removes words that are too long or too short.
static final TokenFilterName	LIMIT Limits the number of tokens while indexing.
static final TokenFilterName	LOWERCASE Normalizes token text to lower case.
static final TokenFilterName	NGRAM Generates n-grams of the given size(s).
static final TokenFilterName	PERSIAN_NORMALIZATION Applies normalization for Persian.
static final TokenFilterName	PHONETIC Create tokens for phonetic matches.
static final TokenFilterName	PORTER_STEM Uses the Porter stemming algorithm to transform the token stream.
static final TokenFilterName	REVERSE Reverses the token string.
static final TokenFilterName	SCANDINAVIAN_FOLDING_NORMALIZATION Folds Scandinavian characters ��->a and ��->o.
static final TokenFilterName	SCANDINAVIAN_NORMALIZATION Normalizes use of the interchangeable Scandinavian characters.
static final TokenFilterName	SHINGLE Creates combinations of tokens as a single token.
static final TokenFilterName	SNOWBALL A filter that stems words using a Snowball-generated stemmer.
static final TokenFilterName	SORANI_NORMALIZATION Normalizes the Unicode representation of Sorani text.
static final TokenFilterName	STEMMER Language specific stemming filter.
static final TokenFilterName	STOPWORDS Removes stop words from a token stream.
static final TokenFilterName	TRIM Trims leading and trailing whitespace from tokens.
static final TokenFilterName	TRUNCATE Truncates the terms to a specific length.
static final TokenFilterName	UNIQUE Filters out tokens with same text as the previous token.
static final TokenFilterName	UPPERCASE Normalizes token text to upper case.
static final TokenFilterName	WORD_DELIMITER Splits words into subwords and performs optional transformations on subword groups.

Constructor Summary

Constructor	Description
TokenFilterName()	Deprecated Use the fromString(String name) factory method. Creates a new instance of TokenFilterName value.

Constructor

Description

TokenFilterName()

Deprecated

Use the fromString(String name) factory method.

Creates a new instance of TokenFilterName value.

Method Summary

Modifier and Type	Method and Description
static TokenFilterName	fromString(String name) Creates or finds a TokenFilterName from its string representation.
static Collection<TokenFilterName>	values() Gets known TokenFilterName values.

Methods inherited from ExpandableStringEnum

<T>fromString <T>values equals hashCode toString

Methods inherited from java.lang.Object

clone finalize getClass notify notifyAll wait wait wait

Field Details

APOSTROPHE

public static final TokenFilterName APOSTROPHE

Strips all characters after an apostrophe (including the apostrophe itself). See http://lucene.apache.org/core/4\_10\_3/analyzers-common/org/apache/lucene/analysis/tr/ApostropheFilter.html.

ARABIC_NORMALIZATION

public static final TokenFilterName ARABIC_NORMALIZATION

A token filter that applies the Arabic normalizer to normalize the orthography. See http://lucene.apache.org/core/4\_10\_3/analyzers-common/org/apache/lucene/analysis/ar/ArabicNormalizationFilter.html.

ASCII_FOLDING

public static final TokenFilterName ASCII_FOLDING

Converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if such equivalents exist. See http://lucene.apache.org/core/4\_10\_3/analyzers-common/org/apache/lucene/analysis/miscellaneous/ASCIIFoldingFilter.html.

CJK_BIGRAM

public static final TokenFilterName CJK_BIGRAM

Forms bigrams of CJK terms that are generated from the standard tokenizer. See http://lucene.apache.org/core/4\_10\_3/analyzers-common/org/apache/lucene/analysis/cjk/CJKBigramFilter.html.

CJK_WIDTH

public static final TokenFilterName CJK_WIDTH

Normalizes CJK width differences. Folds fullwidth ASCII variants into the equivalent basic Latin, and half-width Katakana variants into the equivalent Kana. See http://lucene.apache.org/core/4\_10\_3/analyzers-common/org/apache/lucene/analysis/cjk/CJKWidthFilter.html.