public class RegexTokenizerAnnotator
extends org.apache.uima.fit.component.JCasAnnotator_ImplBase
Pattern. Mostly useful
for tests.| Modifier and Type | Field and Description |
|---|---|
static String |
PARAM_TOKENIZATION_PATTERN |
static String |
patterPunctDigitsCamelcase
Splits on any punctuation character, digits and case change.
|
static String |
patterPunctuation
Splits on any punctuation character, @see tests
|
static String |
patterPunctuationNoDash
Splits on any punctuation character, except dashes, @see tests
|
| Constructor and Description |
|---|
RegexTokenizerAnnotator() |
| Modifier and Type | Method and Description |
|---|---|
void |
initialize(org.apache.uima.UimaContext context) |
void |
process(org.apache.uima.jcas.JCas jcas) |
getRequiredCasInterface, processgetCasInstancesRequired, hasNext, nextpublic static final String patterPunctDigitsCamelcase
public static final String patterPunctuation
public static final String patterPunctuationNoDash
public static final String PARAM_TOKENIZATION_PATTERN
public void initialize(org.apache.uima.UimaContext context)
throws org.apache.uima.resource.ResourceInitializationException
initialize in interface org.apache.uima.analysis_component.AnalysisComponentinitialize in class org.apache.uima.fit.component.JCasAnnotator_ImplBaseorg.apache.uima.resource.ResourceInitializationExceptionpublic void process(org.apache.uima.jcas.JCas jcas)
throws org.apache.uima.analysis_engine.AnalysisEngineProcessException
process in class org.apache.uima.analysis_component.JCasAnnotator_ImplBaseorg.apache.uima.analysis_engine.AnalysisEngineProcessExceptionCopyright © 2015 Bluebrain Project. All rights reserved.