| Package | Description |
|---|---|
| de.jungblut.nlp | |
| de.jungblut.nlp.mr |
| Class and Description |
|---|
| DocumentSimilarity
Simply distance measure wrapper for debug string similarity measuring.
|
| MarkovChain
Markov chain, that can "learn" the state transition probabilities by a given
input and returns the probability for a given sequence of states.
|
| MinHash
Linear MinHash algorithm to find near duplicates faster or to speedup nearest
neighbour searches.
|
| MinHash.HashType |
| Tokenizer
Standard tokenizer interface.
|
| Class and Description |
|---|
| Tokenizer
Standard tokenizer interface.
|
Copyright © 2016. All rights reserved.