Configuring the term splitter

The term splitter interface defines numerous methods, many of which must be the same, regardless of the splitting algorithm employed. To simplify implementing new term splitters, Sybase Search includes an abstract base class that you can extended to inherit much of the required functionality:

com.isdduk.text.AbstractTermSplitter

The convenience base class does not implement any splitting algorithms. The various split methods defined by the term splitter interface are as follows (see the Javadocs for the full interface method listing):

com.isdduk.text.TermSplitter     split(java.lang.String source) : com.isdduk.util.set.FastSet<java.lang.String>
     split(java.lang.String source, boolean validate) : com.isdduk.text.StringList
     splitFrequencies(char[] source,
          com.isdduk.util.map.FastTermMap insertInto) : void
     splitFrequencies(java.lang.CharSequence source,
          com.isdduk.util.map.FastTermMap insertInto) : void