Package org.apache.solr.hadoop

MapReduceIndexerTool and related code.

See:
          Description

Interface Summary
HdfsFileFieldNames Solr field names for metadata of an HDFS file.
 

Class Summary
DataInputInputStream An InputStream that wraps a DataInput.
DataOutputOutputStream OutputStream implementation that wraps a DataOutput.
HeartBeater This class runs a background thread that once every 60 seconds checks to see if a progress report is needed.
LineRandomizerMapper MR Mapper that randomizing a list of URLs.
LineRandomizerReducer MR Reducer that randomizing a list of URLs.
MapReduceIndexerTool Public API for a MapReduce batch job driver that creates a set of Solr index shards from a set of input files and writes the indexes into HDFS, in a flexible, scalable and fault-tolerant manner.
PathArgumentType ArgumentType subclass for HDFS Path type, using fluent style API.
PathParts Extracts various components of an HDFS Path
SolrCloudPartitioner MapReduce partitioner that partitions the Mapper output such that each SolrInputDocument gets sent to the SolrCloud shard that it would have been sent to if the document were ingested via the standard SolrCloud Near Real Time (NRT) API.
SolrInputDocumentWritable  
SolrMapper<KEYIN,VALUEIN>  
SolrOutputFormat<K,V>  
SolrReducer This class loads the mapper's SolrInputDocuments into one EmbeddedSolrServer per reducer.
TreeMergeMapper For the meat see TreeMergeOutputFormat.
TreeMergeOutputFormat See IndexMergeTool.
UnbufferedDataInputInputStream  
Utils  
 

Enum Summary
SolrCounters  
 

Package org.apache.solr.hadoop Description

MapReduceIndexerTool and related code.



Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.