Class ParagraphAlignerStep

  • All Implemented Interfaces:
    IPipelineStep

    public class ParagraphAlignerStep
    extends BasePipelineStep
    Align paragraphs (TextUnits) between a source and target document. Uses inter-paragraph formatting and other heuristics to align paragraphs. TextUnits from this step can be sent the the SentenceAlignerStep for more fine grained alignment. TextUnits should not be segmented.
    Author:
    HARGRAVEJE