public final class CustomAnalyzer extends LexicalAnalyzer
| Constructor and Description |
|---|
CustomAnalyzer(String name,
LexicalTokenizerName tokenizerName)
Constructor of
LexicalAnalyzer. |
| Modifier and Type | Method and Description |
|---|---|
List<CharFilterName> |
getCharFilters()
Get the charFilters property: A list of character filters used to
prepare input text before it is processed by the tokenizer.
|
List<TokenFilterName> |
getTokenFilters()
Get the tokenFilters property: A list of token filters used to filter
out or modify the tokens generated by a tokenizer.
|
LexicalTokenizerName |
getTokenizer()
Get the tokenizer property: The name of the tokenizer to use to divide
continuous text into a sequence of tokens, such as breaking a sentence
into words.
|
CustomAnalyzer |
setCharFilters(CharFilterName... charFilters)
Set the charFilters property: A list of character filters used to
prepare input text before it is processed by the tokenizer.
|
CustomAnalyzer |
setCharFilters(List<CharFilterName> charFilters)
Set the charFilters property: A list of character filters used to
prepare input text before it is processed by the tokenizer.
|
CustomAnalyzer |
setTokenFilters(List<TokenFilterName> tokenFilters)
Set the tokenFilters property: A list of token filters used to filter
out or modify the tokens generated by a tokenizer.
|
CustomAnalyzer |
setTokenFilters(TokenFilterName... tokenFilters)
Set the tokenFilters property: A list of token filters used to filter
out or modify the tokens generated by a tokenizer.
|
getNamepublic CustomAnalyzer(String name, LexicalTokenizerName tokenizerName)
LexicalAnalyzer.name - The name of the analyzer. It must only contain letters, digits, spaces,
dashes or underscores, can only start and end with alphanumeric
characters, and is limited to 128 characters.tokenizerName - The name of the tokenizer to use to divide continuous text into a
sequence of tokens, such as breaking a sentence into words. Possible
values include: 'Classic', 'EdgeNGram', 'Keyword', 'Letter',
'Lowercase', 'MicrosoftLanguageTokenizer',
'MicrosoftLanguageStemmingTokenizer', 'NGram', 'PathHierarchy',
'Pattern', 'Standard', 'UaxUrlEmail', 'Whitespace'public LexicalTokenizerName getTokenizer()
public List<TokenFilterName> getTokenFilters()
public CustomAnalyzer setTokenFilters(TokenFilterName... tokenFilters)
tokenFilters - the tokenFilters value to set.public CustomAnalyzer setTokenFilters(List<TokenFilterName> tokenFilters)
tokenFilters - the tokenFilters value to set.public List<CharFilterName> getCharFilters()
public CustomAnalyzer setCharFilters(CharFilterName... charFilters)
charFilters - the charFilters value to set.public CustomAnalyzer setCharFilters(List<CharFilterName> charFilters)
charFilters - the charFilters value to set.Copyright © 2020 Microsoft Corporation. All rights reserved.