| Class | Description |
|---|---|
| CopyAnnotationsAnnotator |
Copies specified annotations into other specified annotations, then removes
the former.
|
| CopyAnnotationsAnnotator2 |
Copies specified annotations into other specified annotations, then removes
the former.
|
| DeduplicatorAnnotator |
Removes duplicates annotations that have same Class, same begin and same end.
|
| DotSentenceSplitterAnnotator |
Splits an input text into
Sentences at each dot. |
| EnsureDocHasHeader |
Ensures that all
CASes have a Header annotation, sets it to a
random value otherwise otherwise. |
| EnsureDocHasOneSentence |
Ensures that the jCas has at lease one
Sentence annotation, sets it
to the whole text otherwise. |
| EnsureDocTextNotNullAnnotator |
Ensures that the text of a CAS is not null, sets it to "" otherwise
|
| EnsureTokensHaveLemmaAndPOS |
Checks that each
Token has a POS and a lemma, sets otherwise the POS
to a "null" string, resp. |
| EntityNormalizerAnnotator |
Sets
Keep#setNormalizedText() for each Keep which encloses an
entity (recognized by a ConceptMapper) to the canonical form of the entity. |
| EvaluationAnnotator |
Evaluates precision, recall and f-score, comparing annotations in Gold sofa
with the annotions in normal view.
|
| EvaluationPreprocessorAnnotator |
Copies specified annotation to view_gold (later used by
EvaluationAnnotator) and removes it from the _InitialView. |
| GarbageCollectorAnnotator |
"Forces" to run the garbage collection every N document.
|
| GeneralEnglishAnnotator |
Annotates general English words.
|
| GridSearchConfiguration |
Utility/hack to perform in-place (statically configured) parameter
configuration for grid-search optimization.
|
| KeepLargestAnnotationAnnotator |
Prunes/dedupes overlapping Annotations, keeps the largest one.
|
| LucasHelperAnnotator |
Adds e.g.
|
| MultipleProteinsAnnotator |
Adds a
MultipleProteins annotations if there is more that one
Protein in this document |
| NaiveSentenceSplitterAnnotator |
Sentence splitter that outputs the content of the whole CAS text as one
single
Sentence. |
| NewlineSentenceSplitterAnnotator |
Splits an input text into
Sentences at each new line. |
| PunctuationAnnotator |
Annotates Tokens consisting (exclusively) of punctuation chars.
|
| RegexTokenizerAnnotator |
Tokenizer that can be configured with a regex
Pattern. |
| RemoveAnnotationsAnnotator |
Removes specific annotation from the
CASes |
| SkipSomePosAnnotator |
Annotate (flags) tokens that have an "uninteresting" POS, like V.., W.., DT
or TO, IN, ...
|
| SnowballAnnotator |
Snowball stemmer annotator for
Tokens |
| StaticConfiguration |
Utility to perform in-place (statically configured) parameter configuration
for optimization.
Parameters are set like this: paramName{space}paramType{space}value1{space}value2...Examples: o1: myparam1 bool true
o2: myparam2 int 3
o3: myparam2 dble 3.14
o4: myparam3 string asdf
Then, in your java code you can retreive the vales using the StaticOption.getXXX methods |
| StatsAnnotatorPlus |
Logs statistics about the progress of the pipeline to the console.
|
| TestAnnotator |
Tests the number of annotations on each CAS.
|
| WhitespaceTokenizerAnnotator |
Simple whitespace tokenizer.
|
Copyright © 2015 Bluebrain Project. All rights reserved.