AI-Categorize-0.07

Changes for version 0.07 - 2002-02-15

Major improvements to NaiveBayes - it now gives results on par with (slightly better than, actually) the Naive Bayes results in Yang's "Re-Examination" paper.
Corrected a floating-point underflow problem in NaiveBayes that occurred with long documents
When categorizing, NaiveBayes now correctly skips words that weren't in any training documents.
Added the threshold() accessor method to NaiveBayes
Fixed a crash that occurred when no stopwords were specified
Improved the formatting of the output for AI::Categorize::Evaluate
reuters-21578, features_kept => 0.1
- Summary *************************************
- Name miR miP miF1 error time *
- 01-NaiveBayes: 0.824 0.883 0.839 0.005 407 sec *
drmath-1.00, features_kept => 0.1
- Summary *************************************
- Name miR miP miF1 error time *
- 01-NaiveBayes: 0.323 0.397 0.341 0.016 145 sec *
- 01-kNN: 0.636 0.144 0.220 0.078 990 sec * <- features_kept => 0.2
- 01-kNN: 0.606 0.149 0.223 0.073 1221 sec * <- features_kept => 0.1

Automatically categorize documents based on content

Automate and compare AI::Categorize modules

Naive Bayes Algorithm For AI::Categorize

Base class for other algorithms

k-Nearest-Neighbor Algorithm For AI::Categorize

in Categorize.pm

in Categorize.pm

To install AI::Categorize, copy and paste the appropriate command in to your terminal.

cpanm AI::Categorize

perl -MCPAN -e shell
install AI::Categorize

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)