All Classes and Interfaces

Class
Description
This class extends PDFTextStripper to provide custom text extraction and formatting capabilities for PDF pages.
Groups the parsed PDF pages into Documents.
The ParagraphManager class is responsible for managing the paragraphs and hierarchy of a PDF document.
Represents a document paragraph metadata and hierarchy.
Uses the PDF catalog (e.g.
Common configuration builder for the PagePdfDocumentReader and the ParagraphPdfDocumentReader.
 
Re-implement the PDFLayoutTextStripperByArea on top of the PDFLayoutTextStripper instead the original PDFTextStripper.
The PdfReaderRuntimeHints class is responsible for registering runtime hints for PDFBox resources.