| Modifier and Type | Method and Description |
|---|---|
List<OcrBox> |
OcrSnippet.getSnippetRegions()
Get the region of the page that the snippes is located in
|
protected List<OcrBox> |
OcrPassageFormatter.mergeBoxes(List<OcrBox> boxes)
Merge adjacent OCR boxes into a single one, taking line breaks into account
|
protected abstract List<OcrBox> |
OcrPassageFormatter.parseWords(String ocrFragment,
String startPage)
Parse word boxes from an OCR fragment.
|
| Modifier and Type | Method and Description |
|---|---|
void |
OcrSnippet.addHighlightRegion(List<OcrBox> region)
Add a new highlighted region in the snippet.
|
protected void |
OcrPassageFormatter.addHighlightsToSnippet(List<List<OcrBox>> hlBoxes,
OcrSnippet snippet) |
protected List<OcrBox> |
OcrPassageFormatter.mergeBoxes(List<OcrBox> boxes)
Merge adjacent OCR boxes into a single one, taking line breaks into account
|
| Constructor and Description |
|---|
OcrSnippet(String text,
List<OcrBox> snippetRegions)
Create a new snippet on the given region on the page along with its plaintext.
|
| Modifier and Type | Method and Description |
|---|---|
protected List<OcrBox> |
AltoPassageFormatter.parseWords(String ocrFragment,
String startPage) |
| Modifier and Type | Method and Description |
|---|---|
protected List<OcrBox> |
HocrPassageFormatter.parseWords(String ocrFragment,
String startPage) |
| Modifier and Type | Method and Description |
|---|---|
protected List<OcrBox> |
MiniOcrPassageFormatter.parseWords(String ocrFragment,
String startPage) |
| Modifier and Type | Method and Description |
|---|---|
protected void |
MiniOcrPassageFormatter.addHighlightsToSnippet(List<List<OcrBox>> hlBoxes,
OcrSnippet snippet) |
| Modifier and Type | Method and Description |
|---|---|
int |
OcrBox.compareTo(OcrBox o) |
Copyright © 2019. All rights reserved.