There is an ongoing outage on the primary CPAN mirror. It is possible to work around the issue by using MetaCPAN as a mirror.

Changes for version 0.22

  • Updated the xpath queries to parse HTML files.
  • Updated the documentation.
  • Improved the parsing speed of the HTML pages.

Documentation

Script to create corpus for summary testing.

Modules

Creates corpora for summarization testing.