- generateRuleRegex(Rule) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
- getAfter() - Method in class net.sf.okapi.lib.segmentation.Rule
-
Gets the pattern after the break point for this rule.
- getAllLanguageRules() - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Gets a map of all the language rules in this document.
- getAllLanguagesMaps() - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Gets the list of all the language maps in this document.
- getBefore() - Method in class net.sf.okapi.lib.segmentation.Rule
-
Gets the pattern before the break point for this rule.
- getComment() - Method in class net.sf.okapi.lib.segmentation.Rule
-
Gets the optional comment for this rule.
- getComments() - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Gets the comments associated with this document.
- getHeaderComments() - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Gets the comments associated with the header of this document.
- getLanguage() - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- getLanguageRules(String) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Gets the list of rules for a given <languagerule7gt; element.
- getMaskRule() - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Gets the current pattern of the mask rule.
- getNextSegmentRange(TextContainer) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- getPattern() - Method in class net.sf.okapi.lib.segmentation.LanguageMap
-
Gets the pattern associated to this language map.
- getRanges() - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- getRuleName() - Method in class net.sf.okapi.lib.segmentation.LanguageMap
-
Gets the name of this language map.
- getSampleLanguage() - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Gets the current sample language code.
- getSampleText() - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Gets the current sample text.
- getSplitPositions() - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- getVersion() - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Gets the version of this SRX document.
- getWarning() - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Gets the last warning that was issued while loading a document.
- saveRules(String, boolean, boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Saves the current rules to an SRX rules document.
- saveRulesToString(boolean, boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Saves the current rules to an SRX string.
- SegmentationRuleException - Exception in net.sf.okapi.lib.segmentation
-
Signals that a sever error related to segmentation has occurred.
- SegmentationRuleException(String) - Constructor for exception net.sf.okapi.lib.segmentation.SegmentationRuleException
-
Creates a new SegmentationRuleException object with a given message.
- SegmentationRuleException(Throwable) - Constructor for exception net.sf.okapi.lib.segmentation.SegmentationRuleException
-
Creates a new SegmentationRuleException object with a given parent exception.
- segmentSubFlows() - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Indicates if sub-flows must be segmented.
- segmentSubFlows() - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- setActive(boolean) - Method in class net.sf.okapi.lib.segmentation.Rule
-
Sets the flag indicating if this rule is active.
- setAfter(String) - Method in class net.sf.okapi.lib.segmentation.Rule
-
Sets the pattern after the break point for this rule.
- setBefore(String) - Method in class net.sf.okapi.lib.segmentation.Rule
-
Sets the pattern before the break point for this rule.
- setBreak(boolean) - Method in class net.sf.okapi.lib.segmentation.Rule
-
Sets the flag indicating if this rule is a breaking rule.
- setCascade(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the flag indicating if cascading must be applied when selecting the
rules for a given language pattern.
- setCascade(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
Sets the flag indicating if cascading must be applied when selecting the
rules for a given language pattern.
- setComment(String) - Method in class net.sf.okapi.lib.segmentation.Rule
-
Sets the comment for this rule.
- setComments(String) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the comments for this document.
- setHeaderComments(String) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the comments for the header of this document.
- setIncludeEndCodes(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the indicator that tells if end codes should be included or not.
- setIncludeEndCodes(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- setIncludeIsolatedCodes(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the indicator that tells if isolated codes should be included or
not.
- setIncludeIsolatedCodes(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- setIncludeStartCodes(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the indicator that tells if start codes should be included or not.
- setIncludeStartCodes(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- setLanguage(LocaleId) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- setMaskRule(String) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the pattern for the mask rule.
- setMaskRule(String) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
Sets the pattern for the mask rule.
- setModified(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the flag indicating if the document has been modified since the last
load or save.
- setOneSegmentIncludesAll(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the indicator that tells if when there is a single segment in a text
it should include the whole text (no spaces or codes trim left/right)
text.
- setOneSegmentIncludesAll(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- setOptions(boolean, boolean, boolean, boolean, boolean, boolean, boolean, boolean, boolean, boolean) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
Sets the options for this segmenter.
- setOptions(boolean, boolean, boolean, boolean, boolean, boolean, boolean) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- setSampleLanguage(String) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the sample language code.
- setSampleText(String) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the sample text.
- setSegmentSubFlows(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the flag indicating if sub-flows must be segmented.
- setSegmentSubFlows(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- setTestOnSelectedGroup(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the indicator on how to apply rules for samples.
- setTreatIsolatedCodesAsWhitespace(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the indicator if this document should treat isolated codes as
whitespace when matching SRX rules.
- setTreatIsolatedCodesAsWhitespace(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- setTrimCodes(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- setTrimLeadingWhitespaces(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the indicator that tells if leading white-spaces should be left
outside the segments.
- setTrimLeadingWS(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- setTrimTrailingWhitespaces(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the indicator that tells if trailing white-spaces should be left
outside the segments.
- setTrimTrailingWS(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
- setUseICU4JBreakRules(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Sets the indicator that tells if this document uses ICU4J BreakIterator rules.
- setUseJavaRegex(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXDocument
-
Deprecated.
- setUseJavaRegex(boolean) - Method in class net.sf.okapi.lib.segmentation.SRXSegmenter
-
Sets the indicator that tells if this document has rules that are defined for the Java regular expression engine (vs ICU).
- SRXDocument - Class in net.sf.okapi.lib.segmentation
-
Provides facilities to load, save, and manage segmentation rules in SRX
format.
- SRXDocument() - Constructor for class net.sf.okapi.lib.segmentation.SRXDocument
-
Creates an empty SRX document.
- SRXSegmenter - Class in net.sf.okapi.lib.segmentation
-
Implements the
ISegmenter interface for SRX rules.
- SRXSegmenter() - Constructor for class net.sf.okapi.lib.segmentation.SRXSegmenter
-
Creates a new SRXSegmenter object.