There is an ongoing outage on the primary CPAN mirror. It is possible to work around the issue by using MetaCPAN as a mirror.

NAME

Plucene::SearchEngine::Index::HTML - Backend for simply parsing HTML

DESCRIPTION

This backend analysis a HTML file for the following Plucene fields:

text

The text part of the HTML

A list of links in the HTML

Additionally, any META tags are turned into Plucene fields.