Changes for version 0.25 - 2010-05-21

  • Really dropped Text::ExtractWords dependency;
  • Some code rewrite for UTF-8 support (now is the default);
  • Dropped some knew languages given lack of training corpora;
  • Added some new languages;
  • Added at least two tests per language (t/02 and t/06);

Documentation

identifies the language files are written in
creates language modules for Lingua::Identify

Modules

Language identification
Meta-information on Bulgarian
Meta-information on Danish
Meta-information on German
Meta-information on English
Meta-information on Spanish
Meta-information on Finnish
Meta-information on French
Meta-information on Croatian
Meta-information on Hungarian
Meta-information on Indonesian
Meta-information on Italian
Meta-information on Latin
Meta-information on Dutch
Module for tests
Meta-information on Polish
Meta-information on Portuguese
Meta-information on Romanian
Meta-information on Russian
Meta-information on Slovene
Meta-information on Albanian
Meta-information on Swedish
Meta-information on Turkish