public class XPSExtractorDecorator extends AbstractOOXMLExtractor
config, EMBEDDED_RELATIONSHIPS, extractor| Constructor and Description |
|---|
XPSExtractorDecorator(org.apache.tika.parser.ParseContext context,
org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor) |
| Modifier and Type | Method and Description |
|---|---|
protected void |
buildXHTML(org.apache.tika.sax.XHTMLContentHandler xhtml)
Populates the
XHTMLContentHandler object received as parameter. |
org.apache.poi.ooxml.POIXMLDocument |
getDocument()
Returns the opened document.
|
protected List<org.apache.poi.openxml4j.opc.PackagePart> |
getMainDocumentParts()
Return a list of the main parts of the document, used
when searching for embedded resources.
|
getJustFileName, getMetadataExtractor, getXHTML, handleEmbeddedFile, loadLinkedRelationshipspublic XPSExtractorDecorator(org.apache.tika.parser.ParseContext context,
org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor)
throws org.apache.tika.exception.TikaException
org.apache.tika.exception.TikaExceptionpublic org.apache.poi.ooxml.POIXMLDocument getDocument()
OOXMLExtractorgetDocument in interface OOXMLExtractorgetDocument in class AbstractOOXMLExtractorOOXMLExtractor.getDocument()protected void buildXHTML(org.apache.tika.sax.XHTMLContentHandler xhtml)
throws SAXException,
IOException
AbstractOOXMLExtractorXHTMLContentHandler object received as parameter.buildXHTML in class AbstractOOXMLExtractorSAXExceptionIOExceptionprotected List<org.apache.poi.openxml4j.opc.PackagePart> getMainDocumentParts() throws org.apache.tika.exception.TikaException
AbstractOOXMLExtractorgetMainDocumentParts in class AbstractOOXMLExtractororg.apache.tika.exception.TikaExceptionCopyright © 2007–2022 The Apache Software Foundation. All rights reserved.