Class HTMLUnicodeEntitiesParser


  • public class HTMLUnicodeEntitiesParser
    extends Object
    Parser for the Pre-defined named HTML entities. 12.2.5.72 Character reference state

    From the spec:
    Consume the maximum number of characters possible, with the consumed characters matching one of the identifiers in the first column of the named character references table (in a case-sensitive manner). Append each character to the temporary buffer when it's consumed.

    Author:
    Ronald Brill
    • Constructor Detail

      • HTMLUnicodeEntitiesParser

        public HTMLUnicodeEntitiesParser()
    • Method Detail

      • getMatch

        public String getMatch()
      • getRewindCount

        public int getRewindCount()
      • setMatchFromCode

        public void setMatchFromCode()
      • parseNumeric

        public boolean parseNumeric​(int current)
        Parses a numeric entity such as #x64; or #42; The ampersand must not be presented.
        Parameters:
        current - the next character to check
        Returns:
        if we have reached the end of the parsing