org.knowceans.corpus
Interface ITermCorpusFiltered

All Known Subinterfaces:
IRandomAccessTermCorpusFiltered
All Known Implementing Classes:
AmqCorpus, TermCorpus

public interface ITermCorpusFiltered

IFilteredCorpus is an interface for corpora to provide information on filtered terms. In a filtered corpus, the value nterms and the return values of the maps and vectors for terms refer to the unfiltered terms (as this is the relevant figure for processing algorithms). This interface provides additional getters for filtered terms that simply add a suffix *Filtered.

Author:
gregor

Method Summary
 java.util.Map<java.lang.Integer,java.lang.Integer> getDocTermsFiltered(int doc)
          Get the document terms as a frequency map id->frequency.
 int getNtermsFiltered()
          Number of terms in corpus
 int getNwordsFiltered()
          Number of words in corpus that are filtered.
 

Method Detail

getNtermsFiltered

int getNtermsFiltered()
Number of terms in corpus

Returns:

getNwordsFiltered

int getNwordsFiltered()
Number of words in corpus that are filtered.

Returns:

getDocTermsFiltered

java.util.Map<java.lang.Integer,java.lang.Integer> getDocTermsFiltered(int doc)
Get the document terms as a frequency map id->frequency.

Returns: