org.knowceans.corpus
Class ADocument

java.lang.Object
  extended by org.knowceans.corpus.ADocument
Direct Known Subclasses:
DpaDocument, EpgDocument, IgdDocument, NipsDocument, ReutersDocument, SimpleDocument, XptDocument

public abstract class ADocument
extends java.lang.Object

ADocument is an abstract document, which can be subclassed to reflect special features of a corpus. TODO: change to bean (?)

Author:
heinrich

Field Summary
 java.lang.String key
          <key>message uid </key>
 java.util.Vector<java.lang.Integer> sentenceIndex
          Indices into txt where sentences start.
 java.util.Vector<java.lang.String> txt
          <txt>body text with paragraph markup </txt>
 java.util.Vector<java.lang.String> ueb
          <ueb>title </ueb>
 
Constructor Summary
ADocument()
           
 
Method Summary
 java.lang.String toString()
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
 

Field Detail

key

public java.lang.String key
<key>message uid </key>


ueb

public java.util.Vector<java.lang.String> ueb
<ueb>title </ueb>


txt

public java.util.Vector<java.lang.String> txt
<txt>body text with paragraph markup </txt>


sentenceIndex

public java.util.Vector<java.lang.Integer> sentenceIndex
Indices into txt where sentences start.

Constructor Detail

ADocument

public ADocument()
Method Detail

toString

public java.lang.String toString()
Overrides:
toString in class java.lang.Object