public class TokenizationStep extends BasePipelineStep
| Constructor and Description |
|---|
TokenizationStep() |
| Modifier and Type | Method and Description |
|---|---|
List<Token> |
apostrophe(Token token,
LocaleId locale)
Break French and Italian words with apostrophe into three tokens WORD, PUNCTUATION, WORD
|
String |
getDescription()
Gets a short localizable description of what this step does.
|
String |
getName()
Gets the localizable name of this step.
|
LocaleId |
getSourceLocale()
Delegate to concrete class
|
LocaleId |
getTargetLocale()
Delegate to concrete class
|
protected Event |
handleStartDocument(Event event)
Handles the
EventType.START_DOCUMENT event. |
protected Event |
handleTextUnit(Event event)
Handles the
EventType.TEXT_UNIT event. |
Collection<? extends Token> |
postProcess(Token t,
LocaleId language)
Various rules to make corrections to
RbbiTokenizer |
void |
setSourceLocale(LocaleId sourceLocale)
Delegate to concrete class
|
void |
setTargetLocale(LocaleId targetLocale) |
cancel, destroy, getHelpLocation, getParameters, handleCustom, handleDocumentPart, handleEndBatch, handleEndBatchItem, handleEndDocument, handleEndGroup, handleEndSubDocument, handleEndSubfilter, handleEvent, handleMultiEvent, handlePipelineParameters, handleRawDocument, handleStartBatch, handleStartBatchItem, handleStartGroup, handleStartSubDocument, handleStartSubfilter, isDone, isLastOutputStep, setLastOutputStep, setParametersprotected Event handleStartDocument(Event event)
BasePipelineStepEventType.START_DOCUMENT event.handleStartDocument in class BasePipelineStepevent - event to handle.protected Event handleTextUnit(Event event)
BasePipelineStepEventType.TEXT_UNIT event.handleTextUnit in class BasePipelineStepevent - event to handle.public LocaleId getSourceLocale()
IPipelineStepgetSourceLocale in interface IPipelineStepgetSourceLocale in class BasePipelineSteppublic void setSourceLocale(LocaleId sourceLocale)
IPipelineStepsetSourceLocale in interface IPipelineStepsetSourceLocale in class BasePipelineSteppublic LocaleId getTargetLocale()
IPipelineStepgetTargetLocale in interface IPipelineStepgetTargetLocale in class BasePipelineSteppublic void setTargetLocale(LocaleId targetLocale)
setTargetLocale in interface IPipelineStepsetTargetLocale in class BasePipelineSteppublic Collection<? extends Token> postProcess(Token t, LocaleId language)
RbbiTokenizert - the Tokenpublic List<Token> apostrophe(Token token, LocaleId locale)
token - public String getName()
IPipelineSteppublic String getDescription()
IPipelineStepCopyright © 2021. All rights reserved.