org.apache.crunch.impl.spark
Class SparkPipeline
java.lang.Object
org.apache.crunch.impl.dist.DistributedPipeline
org.apache.crunch.impl.spark.SparkPipeline
- All Implemented Interfaces:
- Pipeline
public class SparkPipeline
- extends DistributedPipeline
| Methods inherited from class org.apache.crunch.impl.dist.DistributedPipeline |
cleanup, createIntermediateOutput, createTempPath, enableDebug, getConfiguration, getFactory, getMaterializeSourceTarget, getName, getNextAnonymousStageId, read, read, readTextFile, setConfiguration, write, write, writeTextFile |
SparkPipeline
public SparkPipeline(String sparkConnect,
String appName)
SparkPipeline
public SparkPipeline(org.apache.spark.api.java.JavaSparkContext sparkContext,
String appName)
materialize
public <T> Iterable<T> materialize(PCollection<T> pcollection)
cache
public <T> void cache(PCollection<T> pcollection,
CachingOptions options)
run
public PipelineResult run()
runAsync
public PipelineExecution runAsync()
done
public PipelineResult done()
- Specified by:
done in interface Pipeline- Overrides:
done in class DistributedPipeline
Copyright © 2013 The Apache Software Foundation. All Rights Reserved.