NAME

Stemmer - Perl extension for stemming words using a modified version of the Stem routine from the SWISH search engine. (http://sunsite.berkeley.edu/SWISH-E/)

SYNOPSIS

use SWISH::Stemmer;
$stemmed_word = SwishStem( $word );

DESCRIPTION

This module provides access to the Stem() function used in SWISH-E to stem words. This module is needed if you wish to highlight words in source documents.

Not that this module includes the stemmer.c function from the SWISH-E distribution. You should make sure that the stemmer.c file contained here is closely matched to the stemmer.c module in the SWISH-E distribution.

Also, there is a SWISH-E library that will contain the Stem() and SwishStem() functions. So you may be able to access that library instead of using this module. Check the SWISH-E discussion list for more info.

Purpose:    Implementation of the Porter stemming algorithm documented 
            in: Porter, M.F., "An Algorithm For Suffix Stripping," 
            Program 14 (3), July 1980, pp. 130-137.

Provenance: Written by B. Frakes and C. Cox, 1986.
            Changed by C. Fox, 1990.
               - made measure function a DFA
               - restructured structs
               - renamed functions and variables
               - restricted function and variable scopes
            Changed by C. Fox, July, 1991.
               - added ANSI C declarations 
               - branch tested to 90% coverage

Notes:      This code will make little sense without the the Porter
            article.  The stemming function converts its input to
            lower case.

AUTHOR

Bill Moseley used the stemmer.c from the Swish-e distribution for this module.

SEE ALSO

perl(1)