| Package | Description |
|---|---|
| de.l3s.icrawl.contentanalysis |
Analysis of crawled documents
|
| Modifier and Type | Method and Description |
|---|---|
static WebPageDateExtractor.WebPageDate |
WebPageDateExtractor.extractModifiedDate(org.jsoup.nodes.Document dom)
Extract the likely modification date from a parsed document.
|
static WebPageDateExtractor.WebPageDate |
WebPageDateExtractor.getModifiedDate(String url,
org.jsoup.nodes.Document document,
Long httpModifiedTime,
org.apache.hadoop.mapreduce.Mapper.Context context) |
Copyright © 2017. All rights reserved.