NAME

WWW::ContentRetrieval::Extract - Content Extractor

SYNOPSIS

 use WWW::ContentRetrieval::Extract;

 $e = WWW::ContentRetrieval::Extract->new({
     TEXT    => $t,                      # webpage text
     DESC    => $desc->{foo},            # site foo
     THISURL => 'http://bazz.buzz.org/', # url of TEXT
 });

 print Dumper $e->extract;

DESCRIPTION

WWW::ContentRetrieval::Extract extracts data according to a given description file.

METHODS

new

$e = new ({
   TEXT    => page's content,
   THISURL => URL of the text,
   DESC    => data description
});

extract

$e->extract returns an array of hashes. You may use Data::Dumper to see it

STANDALONES

WWW::ContentRetrieval::Extract::lookup( text, node_identifier )

WWW::ContentRetrieval::Extract::lookup( WWW::ContentRetrieval::bldTree($t), "0.0.0");

It looks up the given text for the some node identifier, and returns an anonymous hash with entries "tag" and "text".

COPYRIGHT

xern <xern@cpan.org>

This module is free software; you can redistribute it or modify it under the same terms as Perl itself.

To install WWW::ContentRetrieval, copy and paste the appropriate command in to your terminal.

cpanm

cpanm WWW::ContentRetrieval

CPAN shell

perl -MCPAN -e shell
install WWW::ContentRetrieval

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	Go to GitHub issues (only if GitHub is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)