|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.cmu.minorthird.text.AbstractTextBase
edu.cmu.minorthird.text.MutableTextBase
edu.cmu.minorthird.text.BasicTextBase
public class BasicTextBase
Maintains information about what's in a set of documents. Specifically, this contains a set of character sequences (TextToken's) from some sort of set of containing documents - typically found by tokenization.
Field Summary |
---|
Fields inherited from class edu.cmu.minorthird.text.AbstractTextBase |
---|
tokenizer |
Constructor Summary | |
---|---|
BasicTextBase()
Default constructor creates a new TextBase with the default Tokenizer. |
|
BasicTextBase(Tokenizer t)
Constructor that specifies a custom Tokenizer to be used with this TextBase. |
Method Summary | |
---|---|
Span |
documentSpan(java.lang.String documentId)
Returns a Span instance that encloses all of the tokens in the document specified by documentId. |
java.util.Iterator<Span> |
documentSpanIterator()
Returns a Span.Looper instance that includes a document span for every document in this TextBase. |
Document |
getDocument(java.lang.String documentId)
Returns the Document instance that corresponds to the specified documentId or null if no document exists with the specified documentId. |
void |
loadDocument(java.lang.String documentId,
java.lang.String documentString)
Adds a document to this TextBase with documentId as its identifier and with text specified by documentString. |
void |
loadDocument(java.lang.String documentId,
java.lang.String documentString,
int charOffset)
Adds a document to this TextBase with documentId as its identifier and with text specified by documentString. |
static void |
main(java.lang.String[] args)
|
void |
setDocumentGroupId(java.lang.String documentId,
java.lang.String documentGroupId)
Sets the document group id for the specified documentId to the specified document group id. |
int |
size()
Returns the number of documents currently in this TextBase. |
Methods inherited from class edu.cmu.minorthird.text.AbstractTextBase |
---|
getTokenizer |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public BasicTextBase()
public BasicTextBase(Tokenizer t)
Method Detail |
---|
public void loadDocument(java.lang.String documentId, java.lang.String documentString)
loadDocument
in class MutableTextBase
public void loadDocument(java.lang.String documentId, java.lang.String documentString, int charOffset)
loadDocument
in class MutableTextBase
public void setDocumentGroupId(java.lang.String documentId, java.lang.String documentGroupId)
setDocumentGroupId
in class MutableTextBase
public int size()
size
in interface TextBase
size
in class MutableTextBase
public Document getDocument(java.lang.String documentId)
getDocument
in interface TextBase
getDocument
in class MutableTextBase
public Span documentSpan(java.lang.String documentId)
documentSpan
in interface TextBase
documentSpan
in class MutableTextBase
public java.util.Iterator<Span> documentSpanIterator()
documentSpanIterator
in interface TextBase
documentSpanIterator
in class MutableTextBase
public static void main(java.lang.String[] args)
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |