Package org.knowceans.corpus.util

Class Summary
ActorNames ActorNames is a static class that provides normalisation functions for person names to identify actors.
CorpusIo InputOutput reads and outputs binary matrices
EnStemmer English stemmer based on a Snowball-based Porter algorithm.
LdaToAscii  
LocalSamplers Diverse sampling methods, including beta, gamma, multinomial, and Dirichlet distributions as well as Dirichlet processes, using Sethurahman's stick-breaking construction and Chinese restaurant process.
ReverseUnicodeMapFactory loads a text file with unicode to html conversion maps the html as key and stores the unicode as value
Stemmer German stemmer based on a Snowball algorithm.
StopWordFilter StopWordFilter
UnicodeMapFactory loads a text file with unicode to html conversion maps the unicode as key and stores the html-code