Class DocumentTitleMatchClassifier

  • All Implemented Interfaces:
    BoilerpipeFilter

    public final class DocumentTitleMatchClassifier
    extends java.lang.Object
    implements BoilerpipeFilter
    Marks TextBlocks which contain parts of the HTML <TITLE> tag, using some heuristics which are quite specific to the news domain.