NAME

Lingua::JA::OkapiBM25 - Okapi-BM25 algorithm module which derived from Lingua::JA::TFIDF

SYNOPSIS

use Lingua::JA::OkapiBM25;
use Data::Dumper; 

my $calc   = Lingua::JA::OkapiBM25->new(%config);
my $result = $calc->bm25($text);
print Dumper $result->list;

DESCRIPTION

* This software is still in alpha release *

Okapi-BM25 algorithm module which derived from Lingua::JA::TFIDF

Sorry, this module for Japanese only.

OkapiBM25 is ...

In information retrieval, Okapi BM25 is a ranking function used by search engines to rank matching documents according to their relevance to a given search query. It is based on the probabilistic retrieval framework developed in the 1970s and 1980s by Stephen E. Robertson, Karen Spärck Jones, and others.

The name of the actual ranking function is BM25. To set the right context, however, it usually referred to as "Okapi BM25", since the Okapi information retrieval system, implemented at London's City University in the 1980s and 1990s, was the first system to implement this function.

BM25, and its newer variants, e.g. BM25F (a version of BM25 that can take document structure and anchor text into account), represent state-of-the-art retrieval functions used in document retrieval, such as Web search.

--from wikipedia ( http://en.wikipedia.org/wiki/Okapi_BM25 )

METHODS

new

bm25

AUTHOR

Takeshi Miki <miki@cpan.org>

LICENSE

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

SEE ALSO

1 POD Error

The following errors were encountered while parsing the POD:

Around line 74:

Non-ASCII character seen before =encoding in 'Spärck'. Assuming UTF-8