Package de.l3s.boilerpipe.sax
Class CommonTagActions
- java.lang.Object
-
- de.l3s.boilerpipe.sax.CommonTagActions
-
public abstract class CommonTagActions extends java.lang.ObjectDefines an action that is to be performed whenever a particular tag occurs during HTML parsing.
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classCommonTagActions.BlockTagLabelActionCommonTagActionsfor block-level elements, which triggers someLabelActionon the generatedTextBlock.static classCommonTagActions.Chainedstatic classCommonTagActions.InlineTagLabelAction
-
Field Summary
Fields Modifier and Type Field Description static TagActionTA_ANCHOR_TEXTMarks this tag as "anchor" (this should usually only be set for the<A>tag).static TagActionTA_BLOCK_LEVELExplicitly marks this tag a simple "block-level" element, which always generates whitespacestatic TagActionTA_BODYMarks this tag the body element (this should usually only be set for the<BODY>tag).static TagActionTA_FONTSpecial TagAction for the<FONT>tag, which keeps track of the absolute and relative font size.static TagActionTA_IGNORABLE_ELEMENTMarks this tag as "ignorable", i.e.static TagActionTA_INLINEDeprecated.UseTA_INLINE_WHITESPACEinsteadstatic TagActionTA_INLINE_NO_WHITESPACEMarks this tag a simple "inline" element, which neither generates whitespace, nor a new block.static TagActionTA_INLINE_WHITESPACEMarks this tag a simple "inline" element, which generates whitespace, but no new block.
-
-
-
Field Detail
-
TA_IGNORABLE_ELEMENT
public static final TagAction TA_IGNORABLE_ELEMENT
Marks this tag as "ignorable", i.e. all its inner content is silently skipped.
-
TA_ANCHOR_TEXT
public static final TagAction TA_ANCHOR_TEXT
Marks this tag as "anchor" (this should usually only be set for the<A>tag). Anchor tags may not be nested. There is a bug in certain versions of NekoHTML which still allows nested tags. If boilerpipe encounters such nestings, a SAXException is thrown.
-
TA_BODY
public static final TagAction TA_BODY
Marks this tag the body element (this should usually only be set for the<BODY>tag).
-
TA_INLINE_WHITESPACE
public static final TagAction TA_INLINE_WHITESPACE
Marks this tag a simple "inline" element, which generates whitespace, but no new block.
-
TA_INLINE
@Deprecated public static final TagAction TA_INLINE
Deprecated.UseTA_INLINE_WHITESPACEinstead
-
TA_INLINE_NO_WHITESPACE
public static final TagAction TA_INLINE_NO_WHITESPACE
Marks this tag a simple "inline" element, which neither generates whitespace, nor a new block.
-
TA_BLOCK_LEVEL
public static final TagAction TA_BLOCK_LEVEL
Explicitly marks this tag a simple "block-level" element, which always generates whitespace
-
TA_FONT
public static final TagAction TA_FONT
Special TagAction for the<FONT>tag, which keeps track of the absolute and relative font size.
-
-