| Interface | Description |
|---|---|
| DownloadContext |
Provides all information available during a “fetch resources” operation to a
DownloadHandler callback, and allows the user to skip the download of certain file types. |
| DownloadHandler |
A callback interface for downloads handled by a “fetch resources” operation, which downloads all resources referenced by a given HTML input.
|
| HtmlElement |
A
HtmlElement contains information about HTML elements collected by the parser |
| HtmlElementMatcher |
A custom matcher to be triggered each time the parser visits a
HtmlElement. |
| HtmlElementTransformation |
A transformation operation over a
HtmlElement, that results in a String |
| HtmlPaginationContext |
Contains information about the pagination process managed by a
HtmlPaginator and made available to the user through the NextInputHandler callback. |
| HtmlParserInterface |
An extension of the
EntityParserInterface to include all operations specific to the HtmlParser |
| HtmlParserResult |
Stores the results of parsing some input with the
HtmlParser parser. |
| HtmlParsingContext |
A class that returns information about
HtmlParser’s parsing process. |
| HtmlRecord |
A
HtmlRecord is a record that can be linked with one more more HtmlRecords. |
| HtmlTreeParser |
A parser that generates a tree of
HtmlElements. |
| Class | Description |
|---|---|
| FetchOptions |
Configuration class for use in the
HtmlElement.fetchResources(com.univocity.api.io.FileProvider, com.univocity.api.entity.html.FetchOptions) methods Setters return this instance to enable method chaining during initialization. |
| FetchOutput |
The output produced by the
HtmlElement.fetchResources(com.univocity.api.io.FileProvider, com.univocity.api.entity.html.FetchOptions) methods. |
| HtmlEntityList |
Manages a list of HTML entities and their
HtmlEntitySettings. |
| HtmlEntitySettings |
A
HtmlEntitySettings object manages the configuration of a HTML entity. |
| HtmlLinkFollower |
A class that allows the addition of fields which are used by the
HtmlParser to parse and return information from a linked page. |
| HtmlPaginator |
Used by the
HtmlParser to collect multiple pages of results in a website and to handle the files that have been downloaded for each page. |
| HtmlParser |
A very fast HTML parser.
|
| HtmlParserListener |
An abstract class that is used by the
HtmlParser to provide information about events that occur during the parsing process. |
| HtmlParserSettings |
Configuration class for the
HtmlParser. |
| Loggers |
Loggers used by the HTML parser
|
Copyright © 2018 uniVocity Software Pty Ltd. All rights reserved.