|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectnet.java.sen.StringTagger
public class StringTagger
Tokenizes strings
See examples.StringTaggerDemo in the Sen source for an example of how to use this class
Thread Safety: Objects of this class are NOT thread safe and
should not be accessed simultaneously by multiple threads. Note that creating
additional instances using SenFactory is relatively cheap in both
memory and time
| Constructor Summary | |
|---|---|
StringTagger(Tokenizer tokenizer)
|
|
| Method Summary | |
|---|---|
void |
addFilter(StreamFilter filter)
Add a StreamFilter to be applied during analysis |
List<Token> |
analyze(char[] surface)
Deprecated. use analyze(char[], List) instead. |
List<Token> |
analyze(char[] surface,
List<Token> reuse)
Decompose a string into its most likely constituent morphemes |
List<Token> |
analyze(String surface)
Deprecated. use analyze(String, List) instead. |
List<Token> |
analyze(String surface,
List<Token> reuse)
Decompose a string into its most likely constituent morphemes |
void |
removeFilters()
Remove all current StreamFilters |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public StringTagger(Tokenizer tokenizer)
tokenizer - The Tokenizer to use for analysis| Method Detail |
|---|
public void addFilter(StreamFilter filter)
StreamFilter to be applied during analysis
filter - The StreamFilter to addpublic void removeFilters()
StreamFilters
public List<Token> analyze(String surface,
List<Token> reuse)
throws IOException
surface - The string to analyse
Tokens representing the most likely morphemes
IOException
@Deprecated
public List<Token> analyze(String surface)
throws IOException
analyze(String, List) instead.
IOException
public List<Token> analyze(char[] surface,
List<Token> reuse)
throws IOException
surface - The string to analyse
Tokens representing the most likely morphemes
IOException
@Deprecated
public List<Token> analyze(char[] surface)
throws IOException
analyze(char[], List) instead.
IOException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||