Package opennlp.tools.doccat
Class DoccatCrossValidator
- java.lang.Object
-
- opennlp.tools.doccat.DoccatCrossValidator
-
public class DoccatCrossValidator extends Object
Cross validator forDocumentCategorizer.
-
-
Constructor Summary
Constructors Constructor Description DoccatCrossValidator(String languageCode, TrainingParameters mlParams, DoccatFactory factory, DoccatEvaluationMonitor... listeners)Instantiates aDoccatCrossValidatorwith the givengenerators.
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description voidevaluate(ObjectStream<DocumentSample> samples, int nFolds)Starts the evaluation.doublegetDocumentAccuracy()longgetDocumentCount()
-
-
-
Constructor Detail
-
DoccatCrossValidator
public DoccatCrossValidator(String languageCode, TrainingParameters mlParams, DoccatFactory factory, DoccatEvaluationMonitor... listeners)
Instantiates aDoccatCrossValidatorwith the givengenerators.- Parameters:
languageCode- An ISO conform language code.mlParams- TheTrainingParametersfor the context of cross validation.factory- TheDoccatFactoryfor creating related objects.listeners- theevaluation listeners.
-
-
Method Detail
-
evaluate
public void evaluate(ObjectStream<DocumentSample> samples, int nFolds) throws IOException
Starts the evaluation.- Parameters:
samples- TheObjectStreamofsamplesto train and test with.nFolds- Number of folds. It must be greater than zero.- Throws:
IOException- Thrown if IO errors occurred.
-
getDocumentAccuracy
public double getDocumentAccuracy()
- Returns:
- Retrieves the accuracy for all iterations.
-
getDocumentCount
public long getDocumentCount()
- Returns:
- Retrieves the number of words which where validated over all iterations. The result is the amount of folds multiplied by the total number of words.
-
-