| Modifier and Type | Field and Description |
|---|---|
static NGramTokenizer |
DEFAULT_TOKENIZER |
| Constructor and Description |
|---|
NGramTokenizer(int minNGramSize,
int maxNGramSize,
boolean keepOldTokens,
Tokenizer innerTokenizer) |
| Modifier and Type | Method and Description |
|---|---|
Token |
intern(String s)
Convert a given string into a token.
|
static void |
main(String[] argv)
Test routine
|
int |
maxTokenIndex()
Return the higest index of any interned token
|
Iterator |
tokenIterator()
Return an iterator over interned tokens
|
Token[] |
tokenize(String input)
Return tokenized version of a string.
|
public static NGramTokenizer DEFAULT_TOKENIZER
public NGramTokenizer(int minNGramSize,
int maxNGramSize,
boolean keepOldTokens,
Tokenizer innerTokenizer)
public Token[] tokenize(String input)
public Token intern(String s)
Tokenizerpublic Iterator tokenIterator()
TokenizertokenIterator in interface Tokenizerpublic int maxTokenIndex()
TokenizermaxTokenIndex in interface Tokenizerpublic static void main(String[] argv)
Copyright © 2016. All rights reserved.