Changes for version 0.11 - 2003-04-22
- deprecated setparse() and replaced $self->{parsefcn} crud with a simple member function "parse" that can be subclassed as needed. This simplified a lot of crud elsewhere.
- created a "classify" function to make classification tables on an existing, trained Classifier (avoiding destructive validation in "crossvalidate")
- tore out and replaced save/load functionality from scratch to use Storable to a single file rather than dumping it to a set of MLDBM files (seems to have improved size/speed of resulting saved files, too)
- new save/load subsystem also fixes bugs loading saved Classifier objects (thanks to Brad Davis for finding this bug in the first place)
- added a very rudimentary procmail-based spam filter called "spamometer" in the examples directory
Modules
Perl extension for probabilistic mail classification
spam classification based on Paul Graham's algorithm
a trivial subclass example
Examples
- examples/corpora/README
- examples/corpora/sa-nonspam.mbox
- examples/corpora/sa-spam.mbox
- examples/graham-test.pl
- examples/spamometer/README
- examples/spamometer/new-spamometer.pl
- examples/spamometer/procmailrc
- examples/spamometer/tag-message.pl
- examples/spamometer/train-spamometer.pl
- examples/trivial-test.pl