NAME
SWISH::Prog::Native::Indexer - wrapper around Swish-e binary
SYNOPSIS
use SWISH::Prog::Native::Indexer;
my $indexer = SWISH::Prog::Native::Indexer->new(
invindex => SWISH::Prog::Native::InvIndex->new,
config => SWISH::Prog::Config->new,
count => 0,
clobber => 1,
flush => 10000,
started => time(),
);
$indexer->start;
for my $doc (@list_of_docs) {
$indexer->process($doc);
}
$indexer->finish;
DESCRIPTION
The Native Indexer is a wrapper around the swish-e version 2.x binary tool.
METHODS
new
Create indexer object. All the following parameters are also accessor methods.
- index
-
A SWISH::Prog::InvIndex::Native object.
- config
-
A SWISH::Prog::Config object.
- exe
-
The path to the
swish-e
executable. If empty, will just look in $ENV{PATH}. - verbose
-
Takes same args as
swish-e -v
option. - warnings
-
Takes same args as
swish-e -W
option. - opts
-
String of options passed directly to the swish-e program.
init
Initialize object. Called by new().
swish_check
Returns true if the exe() executable works, false otherwise.
start( [cmd] )
Start the indexer on its merry way. Stores the filehandle in fh().
Returns the $indexer object.
You likely don't want to pass cmd in but let start() construct it for you.
fh
Get or set the open() filehandle for the swish-e process. CAUTION: don't set unless you know what you're doing.
You can print() to the filehandle using the SWISH::Prog index() method. Or do it directly like:
print { $indexer->fh } "your headers and body here";
The filehandle is close()'d by the finish() method.
finish
Close the open fh() filehandle and check for any errors.
Called by the magic DESTROY method so $indexer will finish() whenever it goes out of scope.
merge( @InvIndex objects )
merge() will merge @SWISH::Prog::Native::InvIndex objects together with the index named in the calling Indexer object.
Returns the $indexer object on success, croaks on failure.
process( doc )
process() will parse and index doc. doc should be a SWISH::Prog::Doc instance.
Will croak() on failure.
add( doc )
Add doc to the index.
Note this is slower than merge(). If you have multiple doc objects, create a new Indexer object and process() them all, then merge() the two InvIndex objects.
my $indexer = SWISH::Prog::Native::Indexer->new(invindex => 'tmpmerge');
$indexer->start;
for my $doc (@list_of_docs) {
$indexer->process($doc);
}
$indexer->finish;
$indexer->merge( 'path/to/other/index' );
AUTHOR
Peter Karman, <perl@peknet.com>
BUGS
Please report any bugs or feature requests to bug-swish-prog at rt.cpan.org
, or through the web interface at http://rt.cpan.org/NoAuth/ReportBug.html?Queue=SWISH-Prog. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.
SUPPORT
You can find documentation for this module with the perldoc command.
perldoc SWISH::Prog
You can also look for information at:
Mailing list
RT: CPAN's request tracker
AnnoCPAN: Annotated CPAN documentation
CPAN Ratings
Search CPAN
COPYRIGHT AND LICENSE
Copyright 2008-2009 by Peter Karman
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.