NAME
Lingua::RU::OpenCorpora::Tokenizer::Vectors - represents a file with vectors
DESCRIPTION
This module inherits most of its code from Lingua::RU::OpenCorpora::Tokenizer::List.
The reason to put this code into a separate class is that vectors file has a slightly different format and needs to be processed in a slightly different manner.
METHODS
new([$args])
Constructor.
Takes an optional hashref with arguments:
- data_dir
-
Path to the directory where vectors file is stored. Defaults to distribution directory (see File::ShareDir).
in_list($vector)
Given a vector, checks if there is a probability value defined for it.
Returns probability or undef correspondingly.
SEE ALSO
Lingua::RU::OpenCorpora::Tokenizer::List
Lingua::RU::OpenCorpora::Tokenizer::Updater
Lingua::RU::OpenCorpora::Tokenizer
AUTHOR
OpenCorpora team http://opencorpora.org
LICENSE
This program is free software, you can redistribute it under the same terms as Perl itself.