edu.cmu.minorthird.ui
Class Recommended.SemiCRFAnnotatorLearner

java.lang.Object
  extended by edu.cmu.minorthird.text.learn.AnnotatorLearner
      extended by edu.cmu.minorthird.text.learn.SegmentAnnotatorLearner
          extended by edu.cmu.minorthird.ui.Recommended.SemiCRFAnnotatorLearner
Enclosing class:
Recommended

public static class Recommended.SemiCRFAnnotatorLearner
extends SegmentAnnotatorLearner

Learns a semi-Markovian extension of CRFs. Like the voted-perceptron SMM classes, this allows features to describe properties of multi-token spans, rather than only properties of single tokens.

Reference: Sunita Sarawagi and William W. Cohen, Semi-Markov Conditional Random Fields for Information Extraction, in Neural Information Proceedings Systems (NIPS) 2004.


Nested Class Summary
 
Nested classes/interfaces inherited from class edu.cmu.minorthird.text.learn.SegmentAnnotatorLearner
SegmentAnnotatorLearner.SegmentAnnotator
 
Field Summary
 
Fields inherited from class edu.cmu.minorthird.text.learn.SegmentAnnotatorLearner
annotationType, dataset, fe, learner, maxWindowSize
 
Constructor Summary
Recommended.SemiCRFAnnotatorLearner()
          Extracted entities must be of length 4 or less.
Recommended.SemiCRFAnnotatorLearner(int maxIters, int maxLen)
           
 
Method Summary
 
Methods inherited from class edu.cmu.minorthird.text.learn.SegmentAnnotatorLearner
getAnnotationType, getAnnotator, getCompressDataset, getCompressDatasetHelp, getDisplayDatasetBeforeLearning, getDisplayDatasetBeforeLearningHelp, getHistorySize, getSemiMarkovLearner, getSemiMarkovLearnerHelp, getSpanFeatureExtractor, hasNextQuery, nextQuery, reset, setAnnotationType, setAnswer, setCompressDataset, setDisplayDatasetBeforeLearning, setDocumentPool, setSemiMarkovLearner, setSpanFeatureExtractor
 
Methods inherited from class edu.cmu.minorthird.text.learn.AnnotatorLearner
getAnnotationTypeHelp, getSpanFeatureExtractorHelp
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

Recommended.SemiCRFAnnotatorLearner

public Recommended.SemiCRFAnnotatorLearner()
Extracted entities must be of length 4 or less.


Recommended.SemiCRFAnnotatorLearner

public Recommended.SemiCRFAnnotatorLearner(int maxIters,
                                           int maxLen)