NAME

WWW::CheckSite - OO interface to an iterator that checks a website

SYNOPSIS

use WWW::CheckSite;

my $wcs = WWW::CheckSite->new(
    uri    => 'http://www.test-smoke.org/',
    prefix => 'tsorg',
    save   => 1,
);

$wcs->validate;

$wcs->write_report;

Or using saved data (skip the real validation):

my $wcs = WWW::CheckSite->load(
    uri    => 'http://www.test-smoke.org/',
    prefix => 'tsorg',
);

$wcs->write_report;

DESCRIPTION

This module implents a spider, that checks the pages on a website. For each page the links and images on that page are checked for availability. After that, the page is validated by W3.ORG.

When the spider is done, one can have a report in HTML written.

WARNING: Although the spider respects /robots.txt on the target site, the validator does not! Use this tool only on your own sites.

METHODS

WWW::CheckSite->new( %args )

Initialize a new instance. Options supported:

  • uri => the base uri to check [mandatory]

  • prefix => the name of the project [mandatory]

  • dir => target directory (curdir())

  • save => true/false (false)

  • strictrules => true/false (false)

  • validate => by_none/by_uri/by_upload (by_none)

WWW::CheckSite->load( %args )

Initialize the object from datafile. Supported options:

  • dir => target/source directory

  • prefix => the prefix used for this dataset [mandatory]

$wcs->validate

The validate() method collects all the data.

$wcs->write_report

Generate the reports.

$wcs->_die;

Do a Carp::croak().

NO METHODS

create_report()

AUTHOR

Abe Timmerman, <abeltje@cpan.org>

BUGS

Please report any bugs or feature requests to bug-www-checksite@rt.cpan.org, or through the web interface at http://rt.cpan.org. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.

COPYRIGHT & LICENSE

Copyright MMV Abe Timmerman, All Rights Reserved.

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.