Uses of Interface
de.jungblut.crawl.extraction.Extractor
-
Packages that use Extractor Package Description de.jungblut.crawl de.jungblut.crawl.extraction -
-
Uses of Extractor in de.jungblut.crawl
Methods in de.jungblut.crawl with parameters of type Extractor Modifier and Type Method Description voidCrawler. setup(int fetches, Extractor<T> extractor, ResultWriter<T> writer)Setups this crawler.voidMultithreadedCrawler. setup(int fetches, Extractor<T> extractor, ResultWriter<T> writer)voidSequentialCrawler. setup(int fetches, Extractor<T> extractor, ResultWriter<T> writer)Constructors in de.jungblut.crawl with parameters of type Extractor Constructor Description FetchThread(java.util.List<java.lang.String> url, Extractor<T> extractor)MultithreadedCrawler(int threadPoolSize, int batchSize, int fetches, Extractor<T> extractor, ResultWriter<T> writer)Constructs a new Multithreaded Crawler.MultithreadedCrawler(int fetches, Extractor<T> extractor, ResultWriter<T> writer)Constructs a new Multithreaded Crawler with 32 threads working on 10 url batches at each time.SequentialCrawler(int fetches, Extractor<T> extractor, ResultWriter<T> writer) -
Uses of Extractor in de.jungblut.crawl.extraction
Classes in de.jungblut.crawl.extraction that implement Extractor Modifier and Type Class Description classArticleContentExtratorExtractor for news articles.classHtmlExtratorExtractor for raw html.classOutlinkExtractorOutlink extractor, parses a page just for its outlinks.
-