Document Clustering using Combinatorial Topology

This document clustering/data mining software suite is composed of various tools.
WORDNET needs to be installed if you want to use any of them.
You will also need FLEX if you want to edit the tokenizer(dmlex).
The lexer is available as a stand alone program to produce the inverted file from a set of documents.
A sample input file is available in the tar ball.

Files available for download as of 4.10.2007:

Tokenizer: