Revision history for Perl extension Text::Ngrams.
0.01 Wed Jun 4 20:52:49 ADT 2003
- initial version
0.02 Fri Jun 6 16:00:00 ADT 2003
- fixed installation of ngrams.pl man page
0.03 Mon Jun 9 07:05:00 ADT 2003
- fixed test cases 3, 5, and 7 (problems found thanks to
cpan-testers, Josts Smokehouse)
0.04 Jun 11 2003
- added parameter 'limit' to new
- added non-ordered print
- added parameter 'onlyfirst' to print
0.05 Wed Sep 3 13:00:43 ADT 2003
- fixed README
1.0 Thu Dec 18 13:09:45 AST 2003
- added parameter 'normalize' to ngrams.pl and to_string
method in Ngrams.pm
1.1 Sat May 29 16:15:34 ADT 2004
- corrected typo submitted by Shlomo Yona
1.2 Thu Jul 29 15:05:28 ADT 2004
- documentation improvement
- introduced first and last n-gram in the output
- made _reduce_to_limit more efficient (--limit parameter)
thanks to observations by David Allen
1.3 Mon Nov 1 12:52:02 AST 2004
- Improved for multi-line token processing
- resolved efficiency issue with binary file (not reading line
by line any more)
- introduced parameters skipinsert and allow_iproc
- thanks to Andrija Tomovic, Roger Zhang for suggestions
1.4 Wed Nov 3 08:45:27 AST 2004
- Fixed minor testing bugs, a warning
1.5 Wed Nov 3 14:04:30 AST 2004
- minor testing bug (rounding error)
1.6 Thu Dec 2 23:00:42 AST 2004
- added utf8 type n-grams
1.7 Wed Dec 8 11:01:07 AST 2004
- fixed a documentation typo (thanks to Kevin J. Ziese)
- solving "our deprecated" warning
1.8 Thu Dec 15 11:45:38 AST 2005
- debuging
- introduced spartan option in to_string
- function get_ngrams added
- changed output format to include space between tokens
(no token separator)
1.9 Mon Nov 6 07:20:42 AST 2006
- bug fixed in get_ngrams
- added option spartan to script ngrams.pl
2.001 Sat Oct 25 00:10:40 ADT 2008
- fixed bug in to_string for writing to a file with given name
2.002 Sat Oct 25 09:31:17 ADT 2008
- Improved documentation.
- changed perl to $^X in test 06, which may cause problems
with testing