public class GpuColumnBatch extends Object
| Constructor and Description |
|---|
GpuColumnBatch(ai.rapids.cudf.Table table,
org.apache.spark.sql.types.StructType schema) |
| Modifier and Type | Method and Description |
|---|---|
long |
getColumn(int index) |
ai.rapids.cudf.ColumnVector |
getColumnVector(int index) |
int |
getNumColumns() |
long |
getNumRows() |
static ai.rapids.cudf.DType |
getRapidsType(org.apache.spark.sql.types.DataType type) |
org.apache.spark.sql.types.StructType |
getSchema() |
long[] |
groupByColumnWithAggregation(int groupIndex,
int oneIndex,
boolean checkEqual)
This is used to group the CUDF Dataset by column 'groupIndex', and merge all the rows into
one row in each group based on column 'oneIndex'.
|
int[] |
groupByColumnWithCountHost(int groupIndex) |
public GpuColumnBatch(ai.rapids.cudf.Table table,
org.apache.spark.sql.types.StructType schema)
public org.apache.spark.sql.types.StructType getSchema()
public long getNumRows()
public int getNumColumns()
public ai.rapids.cudf.ColumnVector getColumnVector(int index)
public long getColumn(int index)
public int[] groupByColumnWithCountHost(int groupIndex)
public long[] groupByColumnWithAggregation(int groupIndex,
int oneIndex,
boolean checkEqual)
groupIndex - The index of column to group byoneIndex - The index of column to get one value in each groupcheckEqual - Whether to check all the values in one group are the samepublic static ai.rapids.cudf.DType getRapidsType(org.apache.spark.sql.types.DataType type)
Copyright © 2019. All rights reserved.