Package org.apache.tika.parser.mail
Class RFC822Parser
- java.lang.Object
-
- org.apache.tika.parser.AbstractParser
-
- org.apache.tika.parser.mail.RFC822Parser
-
- All Implemented Interfaces:
Serializable,org.apache.tika.parser.Parser
public class RFC822Parser extends org.apache.tika.parser.AbstractParserUses apache-mime4j to parse emails. Each part is treated with the corresponding parser and displayed within elements. AMimeConfigobject can be passed in the parsing context to better control the parsing process.- Author:
- jnioche@digitalpebble.com
- See Also:
- Serialized Form
-
-
Constructor Summary
Constructors Constructor Description RFC822Parser()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description Set<org.apache.tika.mime.MediaType>getSupportedTypes(org.apache.tika.parser.ParseContext context)voidparse(InputStream stream, ContentHandler handler, org.apache.tika.metadata.Metadata metadata, org.apache.tika.parser.ParseContext context)voidsetExtractAllAlternatives(boolean extractAllAlternatives)Until version 1.17, Tika handled all body parts as embedded objects (see TIKA-2478).
-
-
-
Method Detail
-
getSupportedTypes
public Set<org.apache.tika.mime.MediaType> getSupportedTypes(org.apache.tika.parser.ParseContext context)
-
parse
public void parse(InputStream stream, ContentHandler handler, org.apache.tika.metadata.Metadata metadata, org.apache.tika.parser.ParseContext context) throws IOException, SAXException, org.apache.tika.exception.TikaException
- Throws:
IOExceptionSAXExceptionorg.apache.tika.exception.TikaException
-
setExtractAllAlternatives
public void setExtractAllAlternatives(boolean extractAllAlternatives)
Until version 1.17, Tika handled all body parts as embedded objects (see TIKA-2478). In 1.17, we modified the parser to select only the best alternative body parts for multipart/alternative sections, and we inline the content as we do for .msg files.The legacy behavior can be set by setting
extractAllAlternativestotrue. As of 1.17, the default value isfalse- Parameters:
extractAllAlternatives- whether or not to extract all alternative parts- Since:
- 1.17
-
-