Package org.apache.solr.hadoop.dedup

Dedupe related code.

See:
          Description

Interface Summary
UpdateConflictResolver Interface that enables deduplication and ordering of a series of document updates for the same unique document key.
 

Class Summary
NoChangeUpdateConflictResolver UpdateConflictResolver implementation that returns the solr documents in the same order as they are received on input, i.e.
RejectingUpdateConflictResolver UpdateConflictResolver implementation that rejects multiple documents with the same key with an exception.
RetainMostRecentUpdateConflictResolver UpdateConflictResolver implementation that ignores all but the most recent document version, based on a configurable numeric Solr field, which defaults to the file_last_modified timestamp.
SolrInputDocumentComparator Default mechanism of determining which of two Solr documents with the same key is the more recent version.
SolrInputDocumentComparator.TimeStampComparator  
SortingUpdateConflictResolver UpdateConflictResolver implementation that orders colliding updates ascending from least recent to most recent (partial) update, based on a configurable numeric Solr field, which defaults to the file_last_modified timestamp.
 

Package org.apache.solr.hadoop.dedup Description

Dedupe related code.



Copyright © 2000-2014 Apache Software Foundation. All Rights Reserved.