Class CommonTagActions


  • public abstract class CommonTagActions
    extends java.lang.Object
    Defines an action that is to be performed whenever a particular tag occurs during HTML parsing.
    • Field Detail

      • TA_IGNORABLE_ELEMENT

        public static final TagAction TA_IGNORABLE_ELEMENT
        Marks this tag as "ignorable", i.e. all its inner content is silently skipped.
      • TA_ANCHOR_TEXT

        public static final TagAction TA_ANCHOR_TEXT
        Marks this tag as "anchor" (this should usually only be set for the <A> tag). Anchor tags may not be nested. There is a bug in certain versions of NekoHTML which still allows nested tags. If boilerpipe encounters such nestings, a SAXException is thrown.
      • TA_BODY

        public static final TagAction TA_BODY
        Marks this tag the body element (this should usually only be set for the <BODY> tag).
      • TA_INLINE_WHITESPACE

        public static final TagAction TA_INLINE_WHITESPACE
        Marks this tag a simple "inline" element, which generates whitespace, but no new block.
      • TA_INLINE_NO_WHITESPACE

        public static final TagAction TA_INLINE_NO_WHITESPACE
        Marks this tag a simple "inline" element, which neither generates whitespace, nor a new block.
      • TA_BLOCK_LEVEL

        public static final TagAction TA_BLOCK_LEVEL
        Explicitly marks this tag a simple "block-level" element, which always generates whitespace
      • TA_FONT

        public static final TagAction TA_FONT
        Special TagAction for the <FONT> tag, which keeps track of the absolute and relative font size.