public class NigamActiveLearning extends NBClassifier
Nigam Active Learning which can utilize unlabeled documents during text classification
Please refer to the following paper for details:
Nigam, K., McCallum, A., Thrun, S., Mitchell, T. Text Classification from Labeled and Unlabeled Documents using EM,
Machine Learning, Volume 39, Issue 2-3 (May-June 2000), pp103-134
Copyright: Copyright (c) 2005
Company: IST, Drexel University
classPrior, lastClassProb, modelarrLabel, classNum, doctermMatrix, featureSelector, indexReader, validatingDocSet| Constructor and Description |
|---|
NigamActiveLearning(IndexReader indexReader,
double unlabeledRate) |
NigamActiveLearning(String modelFile) |
| Modifier and Type | Method and Description |
|---|---|
DocClassSet |
classify(DocClassSet trainingDocSet,
DocClass testingDocs)
This method trains the classifier with the training document set and then using the trained model to classify the testing documents.
|
int |
classify(IRDoc curDoc)
Classify one particular document
|
void |
setUnlabeledData(DocClass docSet) |
void |
setUnlabeledData(IndexReader newIndexReader,
DocClass docSet) |
void |
train(DocClassSet trainingDocSet)
This method trains the classifier with the training document set.
|
classify, getClassPrior, rank, saveModelclassify, classify, getClassLabel, getDocTermMatrix, getFeatureSelector, getIndexReader, getRow, setFeatureSelector, train, trainFeatureSelectorpublic NigamActiveLearning(String modelFile)
public NigamActiveLearning(IndexReader indexReader, double unlabeledRate)
public void setUnlabeledData(IndexReader newIndexReader, DocClass docSet)
public void setUnlabeledData(DocClass docSet)
public DocClassSet classify(DocClassSet trainingDocSet, DocClass testingDocs)
Classifierclassify in interface Classifierclassify in class AbstractClassifiertrainingDocSet - training document settestingDocs - testing document setpublic void train(DocClassSet trainingDocSet)
Classifiertrain in interface Classifiertrain in class NBClassifiertrainingDocSet - training document setpublic int classify(IRDoc curDoc)
Classifierclassify in interface Classifierclassify in class NBClassifiercurDoc - the index of the document is stored in the IRDoc objectCopyright © 2018 JULIE Lab, Germany. All rights reserved.