All Classes and Interfaces
Class
Description
This class extends PDFTextStripper to provide custom text extraction and formatting
capabilities for PDF pages.
Groups the parsed PDF pages into
Documents.The ParagraphManager class is responsible for managing the paragraphs and hierarchy of
a PDF document.
Represents a document paragraph metadata and hierarchy.
Uses the PDF catalog (e.g.
Common configuration builder for the
PagePdfDocumentReader and the
ParagraphPdfDocumentReader.Re-implement the PDFLayoutTextStripperByArea on top of the PDFLayoutTextStripper
instead the original PDFTextStripper.
The PdfReaderRuntimeHints class is responsible for registering runtime hints for PDFBox
resources.