NAME
Lingua::JA::Summarize::Extract - summary generator for Japanese
SYNOPSIS
use strict;
use warnings;
use utf8;
use Lingua::JA::Summarize::Extract;
my $text = '日本語の文章を適当に書く。';
my $summary = Lingua::JA::Summarize::Extract->extract($text);
print $summary->as_string;
print "$summary";
# cuts short to 20 length
$summary->length(20);
print "$summary";
# mecab charset
my $extractor = Lingua::JA::Summarize::Extract->new({ mecab_charset => 'utf8' });
DESCRIPTION
Lingua::JA::Summarize::Extract is a summary generator for Japanese text. The extraction method can be changed with the plug-in mechanism.
METHODS
- new([options])
-
a object is made by using the options.
- extract(text[, options])
-
text is summarized. blessed by using options if called direct. return to Lingua::JA::Summarize::Extract::ResultSet object.
OPTIONS
the content of processing can be changed by passing the constructor the options.
- plugins
-
the processing of split of word and line and the scoring etc. can be done by using another modules. please pass it by the ARRAY reference.
- rate
-
the weight at scoring can be changed.
thing to refer to POD of each plugin when you want to examine other options.
THANKS TO
Tatsuhiko Miyagawa
AUTHOR
Kazuhiro Osawa <ko@yappo.ne.jp>
SEE ALSO
http://gensen.dl.itc.u-tokyo.ac.jp/, http://www.ryo.com/getsen/
LICENSE
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
1 POD Error
The following errors were encountered while parsing the POD:
- Around line 86:
Non-ASCII character seen before =encoding in ''日本語の文章を適当に書く。';'. Assuming UTF-8