NAME
Word::Segmenter::Chinese::Lite - Split Chinese into words
SYNOPSIS
use Word::Segmenter::Chinese::Lite qw(wscl_seg);
my @result = wscl_seg("中华人民共和国成立了oyeah");
print @result;
DESCRIPTION
Support UTF8 string input only.
TODOS
1. Optimize dictionary loading speed.
2. Support for custom dictionary.
3. Add overlapping-bigram,bigram,1gram algorithm.
4. Support for specify the maximum word length.
METHODS
wscl_seg()
Main methods.
Input a utf8 string which want to de splited.
Output a list.
EXPORT
no method will be exported by default.
AUTHOR
Chen Gang, <yikuyiku.com@gmail.com>
COPYRIGHT AND LICENSE
Copyright (C) 2014 by Chen Gang
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.16.2 or, at your option, any later version of Perl 5 you may have available.
1 POD Error
The following errors were encountered while parsing the POD:
- Around line 76:
Non-ASCII character seen before =encoding in 'wscl_seg("中华人民共和国成立了oyeah");'. Assuming UTF-8