public class HocrPassageFormatter extends OcrPassageFormatter
absoluteHighlights| Constructor and Description |
|---|
HocrPassageFormatter(String startHlTag,
String endHlTag,
boolean absoluteHighlights) |
| Modifier and Type | Method and Description |
|---|---|
String |
determineStartPage(String ocrFragment,
int startOffset,
IterableCharSequence content)
Determine the id of the page an OCR fragment resides on.
|
protected List<OcrBox> |
parseWords(String ocrFragment,
String startPage)
Parse word boxes from an OCR fragment.
|
addHighlightsToSnippet, format, format, getTextFromXml, mergeBoxes, parseFragmentpublic String determineStartPage(String ocrFragment, int startOffset, IterableCharSequence content)
OcrPassageFormatterdetermineStartPage in class OcrPassageFormatterprotected List<OcrBox> parseWords(String ocrFragment, String startPage)
OcrPassageFormatterparseWords in class OcrPassageFormatterCopyright © 2019. All rights reserved.