Class DocumentTitleMatchClassifier

  • All Implemented Interfaces:
    BoilerpipeFilter

    public final class DocumentTitleMatchClassifier
    extends java.lang.Object
    implements BoilerpipeFilter
    Marks TextBlocks which contain parts of the HTML <TITLE> tag, using some heuristics which are quite specific to the news domain.
    • Method Summary

      Modifier and Type Method Description
      java.util.Set<java.lang.String> getPotentialTitles()  
      boolean process​(TextDocument doc)
      Processes the given document doc.
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait