Package com.github.euler.tika
Class DefaultEmbeddedStrategy
- java.lang.Object
-
- com.github.euler.tika.EmbeddedStrategy
-
- com.github.euler.tika.DefaultEmbeddedStrategy
-
- All Implemented Interfaces:
org.apache.tika.extractor.EmbeddedDocumentExtractor
public class DefaultEmbeddedStrategy extends EmbeddedStrategy
-
-
Constructor Summary
Constructors Constructor Description DefaultEmbeddedStrategy(int maxDepth, List<Pattern> includeParseEmbeddedPatterns, List<Pattern> excludeParseEmbeddedPatterns, List<Pattern> includeExtractEmbeddedPatterns, List<Pattern> excludeExtractEmbeddedPatterns, String mimeTypeField, boolean outputName)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description List<Pattern>getExcludeExtractEmbeddedPatterns()List<Pattern>getExcludeParseEmbeddedPatterns()List<Pattern>getIncludeExtractEmbeddedPatterns()List<Pattern>getIncludeParseEmbeddedPatterns()intgetMaxDepth()StringgetMimeTypeField()protected booleanisIncluded(List<Pattern> includedPatterns, List<Pattern> excludedPatterns, String value)booleanisOutputName()voidparseEmbedded(InputStream stream, ContentHandler handler, org.apache.tika.metadata.Metadata metadata, boolean outputHtml)protected booleanshouldExtractEmbedded(org.apache.tika.metadata.Metadata metadata)protected booleanshouldParseEmbedded(ProcessingContext ctx)-
Methods inherited from class com.github.euler.tika.EmbeddedStrategy
getListener, getParseContext, notifyNewExtractedEmbedded, setListener, setParseContext, shouldParseEmbedded
-
-
-
-
Method Detail
-
parseEmbedded
public void parseEmbedded(InputStream stream, ContentHandler handler, org.apache.tika.metadata.Metadata metadata, boolean outputHtml) throws SAXException, IOException
- Throws:
SAXExceptionIOException
-
shouldExtractEmbedded
protected boolean shouldExtractEmbedded(org.apache.tika.metadata.Metadata metadata)
-
shouldParseEmbedded
protected boolean shouldParseEmbedded(ProcessingContext ctx)
- Specified by:
shouldParseEmbeddedin classEmbeddedStrategy
-
isIncluded
protected boolean isIncluded(List<Pattern> includedPatterns, List<Pattern> excludedPatterns, String value)
-
getMaxDepth
public int getMaxDepth()
-
getMimeTypeField
public String getMimeTypeField()
-
isOutputName
public boolean isOutputName()
-
-