| Class | Description |
|---|---|
| IntIntPairWritable | |
| TextDoublePairWritable | |
| TextIntIntIntWritable | |
| TextIntPairWritable | |
| TextTextPairWritable | |
| TfIdfCalculatorJob |
Job that will calculate tf-idf based on the output of the
WordCorpusFrequencyJob. |
| TfIdfCalculatorJob.DocumentVectorizerReducer |
Calculate the sparse vector with TF-IDF.
|
| WordCorpusFrequencyJob |
MapReduce job that calculates the word frequency over all documents by
inverting document->words and writing the sum of the assigned documents per
word and its document.
|
| WordCorpusFrequencyJob.DocumentSumReducer |
Sums up all the documents per token index by docID.
|
| WordCorpusFrequencyJob.TokenMapper |
Write a token with its document id.
|
| WordCountJob |
MapReduce job that calculates the token frequency by an improved word count.
|
| WordCountJob.WordFrequencyMapper |
Group the tokens in memory for each chunk, write it in the cleanup step.
|
| WordCountJob.WordFrequencyReducer |
Group the tokens by reducing the mappers output and summing the sums for
each token.
|
| Enum | Description |
|---|---|
| WordCorpusFrequencyJob.WordCorpusCounter |
Copyright © 2016. All rights reserved.