I1 - The type of the first input DataSet of the CoGroup transformation.I2 - The type of the second input DataSet of the CoGroup transformation.OUT - The type of the result of the CoGroup transformation.public class CoGroupOperator<I1,I2,OUT> extends TwoInputUdfOperator<I1,I2,OUT,CoGroupOperator<I1,I2,OUT>>
DataSet that is the result of a CoGroup transformation.DataSet| Modifier and Type | Class and Description |
|---|---|
static class |
CoGroupOperator.CoGroupOperatorSets<I1,I2>
Intermediate step of a CoGroup transformation.
|
| Constructor and Description |
|---|
CoGroupOperator(DataSet<I1> input1,
DataSet<I2> input2,
Keys<I1> keys1,
Keys<I2> keys2,
CoGroupFunction<I1,I2,OUT> function,
TypeInformation<OUT> returnType,
List<org.apache.commons.lang3.tuple.Pair<Integer,Order>> groupSortKeyOrderFirst,
List<org.apache.commons.lang3.tuple.Pair<Integer,Order>> groupSortKeyOrderSecond,
Partitioner<?> customPartitioner,
String defaultName) |
CoGroupOperator(DataSet<I1> input1,
DataSet<I2> input2,
Keys<I1> keys1,
Keys<I2> keys2,
CoGroupFunction<I1,I2,OUT> function,
TypeInformation<OUT> returnType,
Partitioner<?> customPartitioner,
String defaultName) |
| Modifier and Type | Method and Description |
|---|---|
protected Keys<I1> |
getKeys1() |
protected Keys<I2> |
getKeys2() |
Partitioner<?> |
getPartitioner()
Gets the custom partitioner used by this join, or
null, if none is set. |
protected CoGroupOperatorBase<?,?,OUT,?> |
translateToDataFlow(Operator<I1> input1,
Operator<I2> input2)
Translates this java API operator into a common API operator with two inputs.
|
CoGroupOperator<I1,I2,OUT> |
withPartitioner(Partitioner<?> partitioner)
Sets a custom partitioner for the CoGroup operation.
|
extractSemanticAnnotationsFromUdf, getBroadcastSets, getParameters, getSemanticProperties, setSemanticProperties, withBroadcastSet, withConstantSetFirst, withConstantSetSecond, withParametersgetInput1, getInput1Type, getInput2, getInput2TypegetName, getParallelism, getResultType, name, setParallelismaggregate, checkSameExecutionContext, clean, coGroup, cross, crossWithHuge, crossWithTiny, distinct, distinct, distinct, distinct, filter, first, flatMap, getExecutionEnvironment, getType, groupBy, groupBy, groupBy, iterate, iterateDelta, join, join, joinWithHuge, joinWithTiny, map, mapPartition, max, maxBy, min, minBy, output, partitionByHash, partitionByHash, partitionByHash, partitionCustom, partitionCustom, partitionCustom, print, printToErr, project, rebalance, reduce, reduceGroup, runOperation, sum, union, write, write, writeAsCsv, writeAsCsv, writeAsCsv, writeAsCsv, writeAsFormattedText, writeAsFormattedText, writeAsText, writeAsTextpublic CoGroupOperator(DataSet<I1> input1, DataSet<I2> input2, Keys<I1> keys1, Keys<I2> keys2, CoGroupFunction<I1,I2,OUT> function, TypeInformation<OUT> returnType, Partitioner<?> customPartitioner, String defaultName)
public CoGroupOperator(DataSet<I1> input1, DataSet<I2> input2, Keys<I1> keys1, Keys<I2> keys2, CoGroupFunction<I1,I2,OUT> function, TypeInformation<OUT> returnType, List<org.apache.commons.lang3.tuple.Pair<Integer,Order>> groupSortKeyOrderFirst, List<org.apache.commons.lang3.tuple.Pair<Integer,Order>> groupSortKeyOrderSecond, Partitioner<?> customPartitioner, String defaultName)
public CoGroupOperator<I1,I2,OUT> withPartitioner(Partitioner<?> partitioner)
NOTE: A custom partitioner can only be used with single-field CoGroup keys, not with composite CoGroup keys.
partitioner - The custom partitioner to be used.public Partitioner<?> getPartitioner()
null, if none is set.protected CoGroupOperatorBase<?,?,OUT,?> translateToDataFlow(Operator<I1> input1, Operator<I2> input2)
TwoInputOperatortranslateToDataFlow in class TwoInputOperator<I1,I2,OUT,CoGroupOperator<I1,I2,OUT>>input1 - The first input of the operation, as a common API operator.input2 - The second input of the operation, as a common API operator.Copyright © 2015 The Apache Software Foundation. All rights reserved.