|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectopennlp.tools.sentdetect.SentenceDetectorME
public class SentenceDetectorME
A sentence detector for splitting up raw text into sentences.
A maximum entropy model is used to evaluate the characters ".", "!", and "?" in a string to determine if they signify the end of a sentence.
| Field Summary | |
|---|---|
static String |
NO_SPLIT
Constant indicates no sentence split. |
static String |
SPLIT
Constant indicates a sentence split. |
| Constructor Summary | |
|---|---|
SentenceDetectorME(SentenceModel model)
Initializes the current instance. |
|
SentenceDetectorME(SentenceModel model,
Factory factory)
|
|
| Method Summary | |
|---|---|
double[] |
getSentenceProbabilities()
Returns the probabilities associated with the most recent calls to sentDetect(). |
String[] |
sentDetect(String s)
Detect sentences in a String. |
Span[] |
sentPosDetect(String s)
Detect the position of the first words of sentences in a String. |
static SentenceModel |
train(String languageCode,
ObjectStream<SentenceSample> samples,
boolean useTokenEnd,
Dictionary abbreviations)
|
static SentenceModel |
train(String languageCode,
ObjectStream<SentenceSample> samples,
boolean useTokenEnd,
Dictionary abbreviations,
int cutoff,
int iterations)
Deprecated. use train(String, ObjectStream, boolean, Dictionary, TrainingParameters)
instead and pass in a TrainingParameters object. |
static SentenceModel |
train(String languageCode,
ObjectStream<SentenceSample> samples,
boolean useTokenEnd,
Dictionary abbreviations,
TrainingParameters mlParams)
|
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final String SPLIT
public static final String NO_SPLIT
| Constructor Detail |
|---|
public SentenceDetectorME(SentenceModel model)
model - the SentenceModel
public SentenceDetectorME(SentenceModel model,
Factory factory)
| Method Detail |
|---|
public String[] sentDetect(String s)
sentDetect in interface SentenceDetectors - The string to be processed.
public Span[] sentPosDetect(String s)
sentPosDetect in interface SentenceDetectors - The string to be processed.
public double[] getSentenceProbabilities()
public static SentenceModel train(String languageCode,
ObjectStream<SentenceSample> samples,
boolean useTokenEnd,
Dictionary abbreviations,
TrainingParameters mlParams)
throws IOException
IOException
@Deprecated
public static SentenceModel train(String languageCode,
ObjectStream<SentenceSample> samples,
boolean useTokenEnd,
Dictionary abbreviations,
int cutoff,
int iterations)
throws IOException
train(String, ObjectStream, boolean, Dictionary, TrainingParameters)
instead and pass in a TrainingParameters object.
IOException
public static SentenceModel train(String languageCode,
ObjectStream<SentenceSample> samples,
boolean useTokenEnd,
Dictionary abbreviations)
throws IOException
IOException
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||