NAME

WWW::ContentRetrieval::Extract - Content Extractor

SYNOPSIS

 use WWW::ContentRetrieval::Extract;

 $e = WWW::ContentRetrieval::Extract->new({
     TEXT    => $t,                      # webpage text
     DESC    => $desc->{foo},            # site foo
     THISURL => 'http://bazz.buzz.org/', # url of TEXT
 });

 print Dumper $e->extract;

DESCRIPTION

WWW::ContentRetrieval::Extract extracts data according to a given description file.

METHODS

new

$e = new ({
   TEXT    => page's content,
   THISURL => URL of the text,
   DESC    => data description
});

See also WWW::ContentRetrieval for how to write down description.

extract

$e->extract() returns an array of hashes.

You may use Data::Dumper to see it

SEE ALSO

WWW::ContentRetrieval, WWW::ContentRetrieval::Spider

COPYRIGHT

xern <xern@cpan.org>

This module is free software; you can redistribute it or modify it under the same terms as Perl itself.