| Package | Description |
|---|---|
| com.univocity.api.entity.html | |
| com.univocity.api.entity.html.builders |
| Modifier and Type | Method and Description |
|---|---|
HtmlElement |
HtmlParsingContext.currentElement()
Returns the element that the parser is currently visiting.
|
HtmlElement |
HtmlPaginationContext.getCurrentPageRoot()
Returns the root element of the HTML tree being processed by the parser.
|
HtmlElement |
FetchOutput.getTreeRoot()
Returns the root
HtmlElement of the new HTML structure. |
HtmlElement |
HtmlElement.nextSibling()
Returns the
HtmlElement that is located just after this element. |
HtmlElement |
HtmlParsingContext.pageRoot()
Returns the root element of the HTML tree being processed by the parser.
|
HtmlElement |
HtmlElement.parent()
Returns the parent of this Element.
|
HtmlElement |
HtmlTreeParser.parseTree(File file)
Generates a HTML tree from the input made available by a
File. |
static HtmlElement |
HtmlParser.parseTree(File file)
Generates a DOM tree from the input made available by a
File. |
HtmlElement |
HtmlTreeParser.parseTree(File file,
Charset encoding)
Generates a HTML tree from the input made available by a
File. |
static HtmlElement |
HtmlParser.parseTree(File file,
Charset encoding)
Generates a DOM tree from the input made available by a
File. |
HtmlElement |
HtmlTreeParser.parseTree(com.univocity.api.io.FileProvider fileProvider)
Generates a HTML tree from the input made available by a
FileProvider. |
static HtmlElement |
HtmlParser.parseTree(com.univocity.api.io.FileProvider fileProvider)
Generates a DOM tree from the input made available by a
FileProvider. |
HtmlElement |
HtmlTreeParser.parseTree(File file,
String encoding)
Generates a HTML tree from the input made available by a
File. |
static HtmlElement |
HtmlParser.parseTree(File file,
String encoding)
Generates a DOM tree from the input made available by a
File. |
HtmlElement |
HtmlTreeParser.parseTree(InputStream inputStream)
Generates a HTML tree from the input made available by a
InputStream. |
static HtmlElement |
HtmlParser.parseTree(InputStream inputStream)
Generates a DOM tree from the input made available by a
InputStream. |
HtmlElement |
HtmlTreeParser.parseTree(InputStream inputStream,
Charset encoding)
Generates a HTML tree from the input made available by a
InputStream. |
static HtmlElement |
HtmlParser.parseTree(InputStream inputStream,
Charset encoding)
Generates a DOM tree from the input made available by a
InputStream. |
HtmlElement |
HtmlTreeParser.parseTree(InputStream inputStream,
String encoding)
Generates a HTML tree from the input made available by a
InputStream. |
static HtmlElement |
HtmlParser.parseTree(InputStream inputStream,
String encoding)
Generates a DOM tree from the input made available by a
InputStream. |
HtmlElement |
HtmlTreeParser.parseTree(Reader reader)
Generates a HTML tree from the input made available by a
Reader. |
static HtmlElement |
HtmlParser.parseTree(Reader reader)
Generates a DOM tree from the input made available by a
Reader. |
HtmlElement |
HtmlTreeParser.parseTree(com.univocity.api.io.ReaderProvider readerProvider)
Generates a HTML tree from the input made available by a
ReaderProvider. |
static HtmlElement |
HtmlParser.parseTree(com.univocity.api.io.ReaderProvider readerProvider)
Generates a DOM tree from the input made available by a
ReaderProvider. |
HtmlElement |
HtmlElement.previousSibling()
Returns the
HtmlElement that is located just before this element. |
HtmlElement |
DownloadContext.sourceElement()
Returns the specific
HtmlElement of the HTML that has a reference to the resource being downloaded. |
| Modifier and Type | Method and Description |
|---|---|
List<HtmlElement> |
HtmlElement.children()
Returns a copy of all children of this element in an array.
|
List<HtmlElement> |
HtmlElement.query(String cssQuery)
Searches for elements that match a CSS query, with the current
HtmlElement as the starting context. |
| Modifier and Type | Method and Description |
|---|---|
boolean |
HtmlElement.containsElementInHierarchy(HtmlElement element)
Returns
true if the specified element is a descendant of the current element. |
void |
HtmlParserListener.elementMatched(HtmlElement element,
HtmlParsingContext context)
A method that runs when a HTML element is matched based on the path set in the creation a field in the corresponding
HtmlEntitySettings |
void |
HtmlParserListener.elementVisited(HtmlElement element,
HtmlParsingContext context)
A method that runs every time the
HtmlParser visits a HTML element on a HTML document. |
boolean |
HtmlElementMatcher.match(HtmlElement lastMatchedElement,
HtmlElement currentElement)
Used to determine if the currentElement should be matched by the parser.
|
com.univocity.parsers.common.Results<HtmlParserResult> |
HtmlParser.parse(HtmlElement htmlTree)
Given a
HtmlElement, parses all records of all entities defined in the EntityList of this parser, submitting them to the Processor implementation associated with each entity (through EntitySettings.setProcessor(Processor). |
com.univocity.parsers.common.Results<HtmlParserResult> |
HtmlParserInterface.parse(HtmlElement htmlTree)
Given a
HtmlElement, parses all records of all entities defined in the EntityList of this parser, and returns them in a map. |
| Constructor and Description |
|---|
FetchOutput(HtmlElement treeRoot,
File treeHtmlFile,
Map<File,URL> resourceMap)
Creates a new
FetchOutput with the results obtained from a call to fetchResources(com.univocity.api.io.FileProvider, com.univocity.api.entity.html.FetchOptions) |
| Modifier and Type | Method and Description |
|---|---|
HtmlElement |
ElementContentReader.getElement()
Gets the first matching
HtmlElement when applying the previously defined matching rules. |
| Modifier and Type | Method and Description |
|---|---|
List<HtmlElement> |
ElementContentReader.getElements()
Gets all matching
HtmlElements when applying the previously defined matching rules. |
Copyright © 2018 uniVocity Software Pty Ltd. All rights reserved.