NAME
Lingua::RU::OpenCorpora::Tokenizer::List - represents a data file
DESCRIPTION
This module provides an API to access files that are used by tokenizer.
It's useful to know that this module actually has 2 versions: the code version and the data version. These versions do not depend on each other.
METHODS
new($name [, $args])
Constructor.
Takes one required argument: list name. List name is one of these: exceptions, prefixes and hyphens.
Optionally you can pass a hashref with additional arguments:
- data_dir
-
Path to the directory where vectors file is stored. Defaults to distribution directory (see File::ShareDir).
in_list($value)
Checks if given value is in the list.
Returns true or false correspondingly.
SEE ALSO
Lingua::RU::OpenCorpora::Tokenizer::Vectors
Lingua::RU::OpenCorpora::Tokenizer::Updater
Lingua::RU::OpenCorpora::Tokenizer
AUTHOR
OpenCorpora team http://opencorpora.org
LICENSE
This program is free software, you can redistribute it under the same terms as Perl itself.