NAME
Text::WordGrams - Calculates statistics on word ngrams.
VERSION
Version 0.01
SYNOPSIS
use Text::WordGrams;
my $data = word_grams( $text );
FUNCTIONS
word_grams
Returns a reference to an hash table with word ngrams counts for a specified string. Options are passed as a hash reference as first argument if needed.
Options include:
- ignore_case
-
Set this option to ignore text case;
- size
-
Set this option to the n-gram size you want. Notice that the value should be greater or equal to two. Also, keep in mind that the bigger size you ask for, the larger the hash will become. Future releases might include a DB File version for less memory consuption.
AUTHOR
Alberto Simões, <ambs@cpan.org>
BUGS
Please report any bugs or feature requests to bug-text-wordgrams@rt.cpan.org
, or through the web interface at http://rt.cpan.org/NoAuth/ReportBug.html?Queue=Text-WordGrams. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.
COPYRIGHT & LICENSE
Copyright 2005 Alberto Simões, all rights reserved.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
1 POD Error
The following errors were encountered while parsing the POD:
- Around line 87:
Non-ASCII character seen before =encoding in 'Simões,'. Assuming CP1252