Uses of Class
de.jungblut.crawl.FetchResult
-
Packages that use FetchResult Package Description de.jungblut.crawl de.jungblut.crawl.extraction -
-
Uses of FetchResult in de.jungblut.crawl
Classes in de.jungblut.crawl with type parameters of type FetchResult Modifier and Type Class Description classConsoleResultWriter<T extends FetchResult>Simple class that outputs to console.interfaceCrawler<T extends FetchResult>Basic Crawler Interface, all implements should implicit give a constructor with the same arguments like setup and redirect the call to it.classFetchResultPersister<T extends FetchResult>Asynchronous persister thread, taking a resultwriter and handles the logic behind asynchronous writing to disk or an arbitrary sink implemented by theResultWriter.classFetchThread<T extends FetchResult>Callablefetcher that extracts, for a given list of URLs and with a givenExtractor, the content from the list of urls.classMultithreadedCrawler<T extends FetchResult>Fast multithreaded crawler, will start a fixed threadpool of 32 threads each will be fed by 10 urls at once.interfaceResultWriter<T extends FetchResult>Result writing interface.classResultWriterAdapter<T extends FetchResult>Empty Adapter class for aResultWriter.classSequenceFileResultWriter<T extends FetchResult>Writes the result into a sequencefile "files/crawl/result.seq".classSequentialCrawler<T extends FetchResult>Sequential crawler, mainly for debugging or development.Methods in de.jungblut.crawl with parameters of type FetchResult Modifier and Type Method Description voidSequenceFileResultWriter. write(FetchResult result) -
Uses of FetchResult in de.jungblut.crawl.extraction
Classes in de.jungblut.crawl.extraction with type parameters of type FetchResult Modifier and Type Interface Description interfaceExtractor<T extends FetchResult>Simple extraction logic interface for a site and a result.Subclasses of FetchResult in de.jungblut.crawl.extraction Modifier and Type Class Description static classArticleContentExtrator.ContentFetchResultArticle content fetch result.static classHtmlExtrator.HtmlFetchResultArticle content fetch result.Methods in de.jungblut.crawl.extraction that return FetchResult Modifier and Type Method Description FetchResultOutlinkExtractor. extract(java.lang.String realUrl)
-