public final class SpiderEngine extends Object
| 限定符和类型 | 方法和说明 |
|---|---|
SpiderEngine |
addHBase(Process hbase)
增加HBase下载流程
|
void |
build()
构建爬虫参数
|
static SpiderEngine |
create() |
SpiderEngine |
setClusterNode(ClusterNode clusterNode)
设置cluster节点初始化信息,在分布式情况下调用
|
SpiderEngine |
setConfiguration(Configuration configuration)
设置configuration配置
|
SpiderEngine |
setDynamicField(DynamicField dynamicField)
设置动态字段映射
|
SpiderEngine |
setDynamicFields(List<DynamicField> dynamicFields)
设置动态字段映射列表
|
SpiderEngine |
setResourceSync(ResourceSync resourceSync)
设置资源同步器
|
SpiderEngine |
setSeed(Seed seed)
设置种子Seed
|
SpiderEngine |
setSeeds(List<Seed> seeds)
设置种子Seed列表
|
void |
startUp(Seed seed,
WorkerStatusOpt workerStatusOpt,
boolean isProbeMaster)
启动一个种子任务
|
public static SpiderEngine create()
public SpiderEngine addHBase(Process hbase)
hbase - Processpublic SpiderEngine setClusterNode(ClusterNode clusterNode)
clusterNode - cluster节点public void build()
public SpiderEngine setSeed(Seed seed)
seed - Seedpublic SpiderEngine setSeeds(List<Seed> seeds)
seeds - Listpublic SpiderEngine setConfiguration(Configuration configuration)
configuration - Configurationpublic SpiderEngine setResourceSync(ResourceSync resourceSync)
resourceSync - ResourceSyncpublic SpiderEngine setDynamicField(DynamicField dynamicField)
dynamicField - DynamicFieldpublic SpiderEngine setDynamicFields(List<DynamicField> dynamicFields)
dynamicFields - Listpublic void startUp(Seed seed, WorkerStatusOpt workerStatusOpt, boolean isProbeMaster)
seed - SeedworkerStatusOpt - 分布式下的工作状态接口isProbeMaster - 集群环境下本机是否为probe masterCopyright © 2018. All rights reserved.