Package de.julielab.xml
Class StaxXmiSplitter
- java.lang.Object
-
- de.julielab.xml.AbstractXmiSplitter
-
- de.julielab.xml.StaxXmiSplitter
-
- All Implemented Interfaces:
XmiSplitter
public class StaxXmiSplitter extends AbstractXmiSplitter
-
-
Field Summary
-
Fields inherited from class de.julielab.xml.AbstractXmiSplitter
annotationModules, currentSecondSofaMapKey, moduleAnnotationNames, nodesByXmiId, recursively, storeBaseDocument
-
Fields inherited from interface de.julielab.xml.XmiSplitter
DOCUMENT_MODULE_LABEL
-
-
Constructor Summary
Constructors Constructor Description StaxXmiSplitter(Set<String> moduleAnnotationNames, boolean recursively, boolean storeBaseDocument, Set<String> baseDocumentAnnotations)StaxXmiSplitter(Set<String> annotationModulesToExtract, boolean recursively, boolean storeBaseDocument, Set<String> baseDocumentAnnotations, int attribute_size)For large documents, the XMI sofa string can be very large (tenth of megabytes).
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description protected StringgetNodeXml(JeDISVTDGraphNode node)XmiSplitterResultprocess(byte[] xmiData, org.apache.uima.cas.TypeSystem ts, int nextPossibleId, Map<String,Integer> existingSofaIdMap)-
Methods inherited from class de.julielab.xml.AbstractXmiSplitter
adaptSofaIdMap, assignNewXmiIds, createAnnotationModuleData, createAnnotationModules, determineLabelsForNode, labelNodes
-
-
-
-
Constructor Detail
-
StaxXmiSplitter
public StaxXmiSplitter(Set<String> moduleAnnotationNames, boolean recursively, boolean storeBaseDocument, Set<String> baseDocumentAnnotations)
-
StaxXmiSplitter
public StaxXmiSplitter(Set<String> annotationModulesToExtract, boolean recursively, boolean storeBaseDocument, Set<String> baseDocumentAnnotations, int attribute_size)
For large documents, the XMI sofa string can be very large (tenth of megabytes). The StAX XML parser has configurable limit on the maximum size of attribute values. This constructor allows to pass a value for this limit.- Parameters:
annotationModulesToExtract-recursively-storeBaseDocument-baseDocumentAnnotations-attribute_size-
-
-
Method Detail
-
getNodeXml
protected String getNodeXml(JeDISVTDGraphNode node)
- Specified by:
getNodeXmlin classAbstractXmiSplitter
-
process
public XmiSplitterResult process(byte[] xmiData, org.apache.uima.cas.TypeSystem ts, int nextPossibleId, Map<String,Integer> existingSofaIdMap) throws XMISplitterException
- Throws:
XMISplitterException
-
-