Class StandardEncodingTranslator

    • Field Detail

      • REPLACEMENT

        public static final String REPLACEMENT
        https://encoding.spec.whatwg.org/#replacement

        For the moment we are using this constant in the HTMLScanner to support this encoding. Another option might be to install a proper charset impl in the jvm (like https://github.com/lovasoa/replacement-charset-java) but this might introduce some behavior changes for the whole vm and therefore we handle it here.

        See Also:
        Constant Field Values
      • IANA_TO_JAVA_ENCODINGS

        public static final Map<String,​String> IANA_TO_JAVA_ENCODINGS
    • Method Detail

      • encodingNameFromLabel

        public String encodingNameFromLabel​(String charsetLabel)
        Specified by:
        encodingNameFromLabel in interface EncodingTranslator
        Returns:
        the Java encoding name for the specified HTTP encoding name, or null if the name is unknown. The returned encoding name may or may not be supported by the running JVM. Use Charset.isSupported(String) to check availability. Moreover, either of the special definitions REPLACEMENT or "x-user-defined" may be returned depending on input.