public class CoreNlpLanguageModel extends LanguageModel
TOKEN_LENGTH_LIMIT| Constructor and Description |
|---|
CoreNlpLanguageModel(edu.stanford.nlp.pipeline.WordsToSentencesAnnotator splitter,
edu.stanford.nlp.pipeline.TokenizerAnnotator tokenizer,
edu.stanford.nlp.pipeline.POSTaggerAnnotator tagger,
org.tartarus.snowball.SnowballProgram stemmer) |
| Modifier and Type | Method and Description |
|---|---|
static LanguageModel |
englishModel() |
static LanguageModel |
germanModel() |
String |
getNodeKey(String text,
String pos)
Prepare a stable key for a graph node (stemmed, lemmatized)
from a token.
|
boolean |
isAdjective(String pos)
Determine whether the given PoS tag is an adjective.
|
boolean |
isNoun(String pos)
Determine whether the given PoS tag is a noun.
|
Sentence |
parseSentence(String text)
Tokenize and tag the sentence text.
|
String[] |
splitParagraph(String text)
Split sentences within the paragraph text.
|
String |
stemToken(String token)
Perform stemming on the given token.
|
buildLanguage, isRelevant, scrubTokenpublic CoreNlpLanguageModel(edu.stanford.nlp.pipeline.WordsToSentencesAnnotator splitter,
edu.stanford.nlp.pipeline.TokenizerAnnotator tokenizer,
edu.stanford.nlp.pipeline.POSTaggerAnnotator tagger,
org.tartarus.snowball.SnowballProgram stemmer)
public static LanguageModel englishModel()
public static LanguageModel germanModel()
public String[] splitParagraph(String text)
LanguageModelsplitParagraph in class LanguageModelpublic Sentence parseSentence(String text)
LanguageModelparseSentence in class LanguageModelpublic String getNodeKey(String text, String pos)
LanguageModelgetNodeKey in class LanguageModelpublic boolean isNoun(String pos)
LanguageModelisNoun in class LanguageModelpublic boolean isAdjective(String pos)
LanguageModelisAdjective in class LanguageModelpublic String stemToken(String token)
LanguageModelstemToken in class LanguageModelCopyright © 2017. All rights reserved.