Package de.l3s.boilerpipe.extractors
Class KeepEverythingWithMinKWordsExtractor
- java.lang.Object
-
- de.l3s.boilerpipe.extractors.ExtractorBase
-
- de.l3s.boilerpipe.extractors.KeepEverythingWithMinKWordsExtractor
-
- All Implemented Interfaces:
BoilerpipeExtractor,BoilerpipeFilter
public final class KeepEverythingWithMinKWordsExtractor extends ExtractorBase
A full-text extractor which extracts the largest text component of a page. For news articles, it may perform better than theDefaultExtractor, but usually worse thanArticleExtractor.
-
-
Constructor Summary
Constructors Constructor Description KeepEverythingWithMinKWordsExtractor(int kMin)
-
Method Summary
Modifier and Type Method Description booleanprocess(TextDocument doc)Processes the given documentdoc.
-
-
-
Method Detail
-
process
public boolean process(TextDocument doc) throws BoilerpipeProcessingException
Description copied from interface:BoilerpipeFilterProcesses the given documentdoc.- Parameters:
doc- TheTextDocumentthat is to be processed.- Returns:
trueif changes have been made to theTextDocument.- Throws:
BoilerpipeProcessingException
-
-