|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
public interface TextBase
Maintains information about what's in a set of documents. Specifically, this contains a set of character sequences (TextToken's) from some sort of set of containing documents - typically found by tokenization.
Method Summary | |
---|---|
Span |
documentSpan(java.lang.String documentId)
Looks up the document Span for the given documentId. |
java.util.Iterator<Span> |
documentSpanIterator()
Returns an iterator over the documents in this TextBase. |
Document |
getDocument(java.lang.String docID)
Returns the Document with the given ID |
Tokenizer |
getTokenizer()
Returns the Tokenizer used on the documents in this text base. |
int |
size()
Returns the number of documents contained in this TextBase. |
Method Detail |
---|
Tokenizer getTokenizer()
Tokenizer
used on the documents in this text base.
int size()
Document getDocument(java.lang.String docID)
Document
with the given ID
java.util.Iterator<Span> documentSpanIterator()
Span documentSpan(java.lang.String documentId)
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |