org.knowceans.corpus
Interface IRandomAccessTermCorpus

All Superinterfaces:
ITermCorpus
All Known Implementing Classes:
AmqCorpus, LuceneMapCorpus, TermCorpus

public interface IRandomAccessTermCorpus
extends ITermCorpus

IRandomAccessCorpus provides term corpus functionality that allows direct access to all fields in the corpus, i.e., the complete lists / map of indices can be read.

Author:
gregor

Method Summary
 java.util.ArrayList<java.lang.String> getDocNames()
          Get a list of all document names / ids.
 java.util.ArrayList<java.util.Map<java.lang.Integer,java.lang.Integer>> getDocTerms()
          Get list of document term maps (index->freq)
 int getNwords()
          Get the number of words (term observations) in the corpus.
 org.knowceans.map.IBijectiveMap<java.lang.String,java.lang.Integer> getTermIndex()
          Get a bijective map term / id
 
Methods inherited from interface org.knowceans.corpus.ITermCorpus
getDocTerms, getDocWords, getNdocs, getNterms, lookup, lookup, lookupDoc, lookupDoc
 

Method Detail

getDocNames

java.util.ArrayList<java.lang.String> getDocNames()
Get a list of all document names / ids.

Returns:

getDocTerms

java.util.ArrayList<java.util.Map<java.lang.Integer,java.lang.Integer>> getDocTerms()
Get list of document term maps (index->freq)

Returns:

getTermIndex

org.knowceans.map.IBijectiveMap<java.lang.String,java.lang.Integer> getTermIndex()
Get a bijective map term / id

Returns:

getNwords

int getNwords()
Get the number of words (term observations) in the corpus.

Returns: