Class BlockProximityFusion
- java.lang.Object
-
- de.l3s.boilerpipe.filters.heuristics.BlockProximityFusion
-
- All Implemented Interfaces:
BoilerpipeFilter
public final class BlockProximityFusion extends java.lang.Object implements BoilerpipeFilter
Fuses adjacent blocks if their distance (in blocks) does not exceed a certain limit. This probably makes sense only in cases where an upstream filter already has removed some blocks.
-
-
Field Summary
Fields Modifier and Type Field Description static BlockProximityFusionMAX_DISTANCE_1static BlockProximityFusionMAX_DISTANCE_1_CONTENT_ONLYstatic BlockProximityFusionMAX_DISTANCE_1_CONTENT_ONLY_SAME_TAGLEVELstatic BlockProximityFusionMAX_DISTANCE_1_SAME_TAGLEVEL
-
Constructor Summary
Constructors Constructor Description BlockProximityFusion(int maxBlocksDistance, boolean contentOnly, boolean sameTagLevelOnly)Creates a newBlockProximityFusioninstance.
-
Method Summary
Modifier and Type Method Description booleanprocess(TextDocument doc)Processes the given documentdoc.
-
-
-
Field Detail
-
MAX_DISTANCE_1
public static final BlockProximityFusion MAX_DISTANCE_1
-
MAX_DISTANCE_1_SAME_TAGLEVEL
public static final BlockProximityFusion MAX_DISTANCE_1_SAME_TAGLEVEL
-
MAX_DISTANCE_1_CONTENT_ONLY
public static final BlockProximityFusion MAX_DISTANCE_1_CONTENT_ONLY
-
MAX_DISTANCE_1_CONTENT_ONLY_SAME_TAGLEVEL
public static final BlockProximityFusion MAX_DISTANCE_1_CONTENT_ONLY_SAME_TAGLEVEL
-
-
Constructor Detail
-
BlockProximityFusion
public BlockProximityFusion(int maxBlocksDistance, boolean contentOnly, boolean sameTagLevelOnly)Creates a newBlockProximityFusioninstance.- Parameters:
maxBlocksDistance- The maximum distance in blocks.contentOnly-
-
-
Method Detail
-
process
public boolean process(TextDocument doc) throws BoilerpipeProcessingException
Description copied from interface:BoilerpipeFilterProcesses the given documentdoc.- Specified by:
processin interfaceBoilerpipeFilter- Parameters:
doc- TheTextDocumentthat is to be processed.- Returns:
trueif changes have been made to theTextDocument.- Throws:
BoilerpipeProcessingException
-
-