Package org.apache.tika.parser.mail
Class RFC822Parser
java.lang.Object
org.apache.tika.parser.mail.RFC822Parser
- All Implemented Interfaces:
Serializable,org.apache.tika.parser.Parser
Uses apache-mime4j to parse emails. Each part is treated with the
corresponding parser and displayed within elements.
A
MimeConfig object can be passed in the parsing context
to better control the parsing process.- Author:
- jnioche@digitalpebble.com
- See Also:
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionSet<org.apache.tika.mime.MediaType>getSupportedTypes(org.apache.tika.parser.ParseContext context) voidparse(InputStream stream, ContentHandler handler, org.apache.tika.metadata.Metadata metadata, org.apache.tika.parser.ParseContext context) voidsetExtractAllAlternatives(boolean extractAllAlternatives) Until version 1.17, Tika handled all body parts as embedded objects (see TIKA-2478).
-
Constructor Details
-
RFC822Parser
public RFC822Parser()
-
-
Method Details
-
getSupportedTypes
public Set<org.apache.tika.mime.MediaType> getSupportedTypes(org.apache.tika.parser.ParseContext context) - Specified by:
getSupportedTypesin interfaceorg.apache.tika.parser.Parser
-
parse
public void parse(InputStream stream, ContentHandler handler, org.apache.tika.metadata.Metadata metadata, org.apache.tika.parser.ParseContext context) throws IOException, SAXException, org.apache.tika.exception.TikaException - Specified by:
parsein interfaceorg.apache.tika.parser.Parser- Throws:
IOExceptionSAXExceptionorg.apache.tika.exception.TikaException
-
setExtractAllAlternatives
public void setExtractAllAlternatives(boolean extractAllAlternatives) Until version 1.17, Tika handled all body parts as embedded objects (see TIKA-2478). In 1.17, we modified the parser to select only the best alternative body parts for multipart/alternative sections, and we inline the content as we do for .msg files.The legacy behavior can be set by setting
extractAllAlternativestotrue. As of 1.17, the default value isfalse- Parameters:
extractAllAlternatives- whether or not to extract all alternative parts- Since:
- 1.17
-