NAME

PDF::GetImages - get images from pdf document

SYNOPSIS

use PDF::GetImages 'pdfimages';

my $images = pdfimages('/abs/path/tofile.pdf');

DESCRIPTION

Get images out of a pdf document. This code makes use of pdfimages which is part of xpdf. In case CAM::PDF scripts don't work for you, you may want to try using this to extract images from PDF documents. See "DEPENDENCIES AND REQUIREMENTS"

pdfimages()

argument is abs path to pdf doc optional argument is a dir to which to send images extracted returns abs paths to images extracted images are extracted by default to same dir pdf is in

If this is not a pdf, the file does not exist, or no images are extracted, warns and returns empty array ref []

DEBUG

$PDF::GetImages::DEBUG = 1;

FORCE_JPG

By default pdfimages will spit out pbm or ppm image format files which are huge and unruly. If you want to make sure the images output are jpg..

$PDF::GetImages::FORCE_JPG= 1 ;

You must have imagemagick convert installed for this to work.

DEPENDENCIES AND REQUIREMENTS

This module requires Unix family operating system to be installed. You must have xpdf package and Image Magick convert installed. Presently we are using cli pdfgetimages. You must have xpdf installed on your system.

AUTHOR

Leo Charre leocharre at cpan dot org

COPYRIGHT

Copyright (c) 2008 Leo Charre. All rights reserved.

LICENSE

This package is free software; you can redistribute it and/or modify it under the same terms as Perl itself, i.e., under the terms of the "Artistic License" or the "GNU General Public License".

DISCLAIMER

This package is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

See the "GNU General Public License" for more details.

SEE ALSO

http://www.imagemagick.org/, xpdf, CAM::PDF