NAME
WWW::Search::WebCrawler - class for searching WebCrawler
SYNOPSIS
use WWW::Search;
my $oSearch = new WWW::Search('WebCrawler');
my $sQuery = WWW::Search::escape_query("+sushi restaurant +Columbus Ohio");
$oSearch->native_query($sQuery);
while (my $oResult = $oSearch->next_result())
print $oResult->url, "\n";
DESCRIPTION
This class is a WebCrawler specialization of WWW::Search. It handles making and interpreting WebCrawler searches http://www.WebCrawler.com.
This class exports no public interface; all interaction should be done through WWW::Search objects.
SEE ALSO
To make new back-ends, see WWW::Search.
HOW DOES IT WORK?
native_setup_search
is called (from WWW::Search::setup_search
) before we do anything. It initializes our private variables (which all begin with underscore) and sets up a URL to the first results page in {_next_url}
.
native_retrieve_some
is called (from WWW::Search::retrieve_some
) whenever more hits are needed. It calls WWW::Search::http_request
to fetch the page specified by {_next_url}
. It then parses this page, appending any search hits it finds to {cache}
. If it finds a ``next'' button in the text, it sets {_next_url}
to point to the page for the next set of results, otherwise it sets it to undef to indicate we''re done.
BUGS
Please tell the author if you find any!
TESTING
This module adheres to the WWW::Search
test suite mechanism. See $TEST_CASES below.
AUTHOR
As of 1998-03-16, WWW::Search::WebCrawler
is maintained by Martin Thurn (MartinThurn@iname.com).
WWW::Search::WebCrawler
was originally written by Martin Thurn based on WWW::Search::HotBot
.
LEGALESE
THIS SOFTWARE IS PROVIDED "AS IS" AND WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.
VERSION HISTORY
If it's not listed here, then it wasn't a meaningful or released version.
2.02, 1999-10-05
now uses hash_to_cgi_string()
2.01, 1999-07-13
1.13, 1999-03-29
Remove extraneous HTML from description (thanks to Jim Smyser jsmyser@bigfoot.com)
1.11, 1998-10-09
Now uses split_lines function
1.9
1998-08-20: New format of www.webcrawler.com output.
1.7
\n changed to \012 for MacPerl compatibility
1.5
1998-05-29: New format of www.webcrawler.com output.
1.3
First publicly-released version.