Interface SMAggregate
-
- All Superinterfaces:
SimilarityMeasure
- All Known Subinterfaces:
SMAggregateAverage,SMAggregateEuclidian,SMAggregateKMaximum,SMAggregateKMinimum,SMAggregateMaximum,SMAggregateMinimum,SMAggregateMinkowski,SMAggregateWeighted
- All Known Implementing Classes:
SMAggregateAverageImpl,SMAggregateEuclidianImpl,SMAggregateImpl,SMAggregateKMaximumImpl,SMAggregateKMinimumImpl,SMAggregateMaximumImpl,SMAggregateMinimumImpl,SMAggregateMinkowskiImpl,SMAggregateWeightedImpl
public interface SMAggregate extends SimilarityMeasure
Abstract interface that collects all similarity measures forAggregateClasses.Global similarity measures are defined by applying an aggregation function Φ to the local similarity values. The simple similarity measures for numeric attributes can be generalized easily to aggregation functions. Such aggregation functions are defined by determining
- a basic aggregation function and
- a weight model that determines weights ω = (ω1,\ldots, ωi) such that 0 ≤ ωi ≤ 1 and ∑1n ωi = 1
The default weight is 1.0 for all attributes. To ensure that ∑1n ωi = 1 all weights will be normalized automatically during runtime.
The aggregate measures can be defined in the xml file sim.xml. Therefore, it's necessary that an aggregate class was created in the xml file model.xml, which is referenced in the definition of the measure. It also needs an arbitrary name. In the inner tag, weights for the single attributes can be defined. The aggregate classes Average, Euclidian and Minkowski need weights anyway, otherwise the similarity will always be 1.0. The other classes will have the same weight for each attribute, if no weights are defined.
For example, an aggregate measure can look like:
<AggregateMinimum name="AggregateMinimumDataflowWeighted" class="DataflowElement" default="false"> <AggWeight att="name" weight="0.5"/> </AggregateMinimum>- Author:
- Rainer Maximini
-
-
Field Summary
Fields Modifier and Type Field Description static StringCOMPONENTComponent: "cake.data.similarity" (LogComponentIdentifier.COMPONENT)static StringCOMPONENT_KEYComponent-Key: "02" (LogComponentIdentifier.COMPONENT_KEY_SMAGGREGATE)static booleanDEFAULT_IGNORE_NULL_ATTRIBUTES_IN_QUERYThe default for ignoring null attribute values as void is true.static StringLOG_ATTRIBUTE_NAME_NOT_FOUNDAttribute-name not found in aggregate class in current measure.static StringLOG_ATTRIBUTE_NOT_FOUNDAttribute not found in aggregate class in current measure.static StringPROPERTY_USER_WEIGHTThe query case can contain user weights $w_u$ that are stored in the properties, accessable with this key.-
Fields inherited from interface de.uni_trier.wi2.procake.similarity.SimilarityMeasure
LOG_ORDER_NAME_NOT_FOUND
-
-
Method Summary
All Methods Instance Methods Abstract Methods Modifier and Type Method Description booleanisIgnoreNullAttributesInQuery()voidsetIgnoreNullAttributesInQuery(boolean ignoreNullAttributesInQuery)-
Methods inherited from interface de.uni_trier.wi2.procake.similarity.SimilarityMeasure
compute, getDataClass, getName, getSystemName, isForceOverride, isReusable, setForceOverride
-
-
-
-
Field Detail
-
DEFAULT_IGNORE_NULL_ATTRIBUTES_IN_QUERY
static final boolean DEFAULT_IGNORE_NULL_ATTRIBUTES_IN_QUERY
The default for ignoring null attribute values as void is true.- See Also:
- Constant Field Values
-
COMPONENT
static final String COMPONENT
Component: "cake.data.similarity" (LogComponentIdentifier.COMPONENT)- See Also:
- Constant Field Values
-
COMPONENT_KEY
static final String COMPONENT_KEY
Component-Key: "02" (LogComponentIdentifier.COMPONENT_KEY_SMAGGREGATE)- See Also:
- Constant Field Values
-
LOG_ATTRIBUTE_NAME_NOT_FOUND
static final String LOG_ATTRIBUTE_NAME_NOT_FOUND
Attribute-name not found in aggregate class in current measure.- Component:
COMPONENT - Key: "0200"
- this
- Class:
AggregateClass - Attributename
- See Also:
- Constant Field Values
- Component:
-
LOG_ATTRIBUTE_NOT_FOUND
static final String LOG_ATTRIBUTE_NOT_FOUND
Attribute not found in aggregate class in current measure.- Component:
COMPONENT - Key: "0201"
- this
- Class:
AggregateClass - Attribute
- See Also:
- Constant Field Values
- Component:
-
PROPERTY_USER_WEIGHT
static final String PROPERTY_USER_WEIGHT
The query case can contain user weights $w_u$ that are stored in the properties, accessable with this key. The weight $w$ for an attrbibute is the mulitplikation of $w_u$ and $w_c$, the weight defined for the class.- See Also:
- Constant Field Values
-
-