@ThreadSafe public class DocumentVectorSimilarity extends Object implements Serializable
| Modifier | Constructor and Description |
|---|---|
protected |
DocumentVectorSimilarity(Map<Locale,DocumentVector> referenceVectors,
Map<Locale,de.l3s.icrawl.contentanalysis.LanguageModel.KeywordMatcher> matchers,
Locale defaultLanguage,
Map<Locale,Double> correctionFactors) |
|
DocumentVectorSimilarity(Map<String,Locale> referenceDocumentsToLanguage,
Set<String> keywords,
Set<NamedEntity> entities,
int maxTerms,
boolean useDF,
Locale defaultLanguage,
LanguageModels languageModels)
Create a SpecSimilarity Object that is serializable and stores a
reference to the input's document collection.
|
| Modifier and Type | Method and Description |
|---|---|
static DocumentVectorSimilarity |
fromVectors(Map<Locale,DocumentVector> referenceVectors,
Map<Locale,Set<String>> keywords,
Locale defaultLanguage,
LanguageModels languageModels,
Map<Locale,Double> correctionFactors) |
Map<Locale,Double> |
getCorrectionFactors() |
Map<Locale,de.l3s.icrawl.contentanalysis.LanguageModel.KeywordMatcher> |
getMatchers() |
Map<Locale,DocumentVector> |
getReferenceVectors() |
double |
getSimilarity(Locale language,
String text) |
void |
setLanguageModels(LanguageModels languageModels) |
String |
toString() |
public DocumentVectorSimilarity(Map<String,Locale> referenceDocumentsToLanguage, Set<String> keywords, Set<NamedEntity> entities, int maxTerms, boolean useDF, Locale defaultLanguage, LanguageModels languageModels)
referenceDocumentsToLanguage - the document collection (each mapping to its language)public static DocumentVectorSimilarity fromVectors(Map<Locale,DocumentVector> referenceVectors, Map<Locale,Set<String>> keywords, Locale defaultLanguage, LanguageModels languageModels, Map<Locale,Double> correctionFactors)
public void setLanguageModels(LanguageModels languageModels)
public Map<Locale,DocumentVector> getReferenceVectors()
public Map<Locale,de.l3s.icrawl.contentanalysis.LanguageModel.KeywordMatcher> getMatchers()
Copyright © 2017. All rights reserved.