NAME
PDF::GetImages - get images from pdf document
SYNOPSIS
use PDF::GetImages 'pdfimages';
my $images = pdfimages('/abs/path/tofile.pdf');
DESCRIPTION
Get images out of a pdf document. This code makes use of pdfimages which is part of xpdf. In case CAM::PDF scripts don't work for you, you may want to try using this to extract images from PDF documents. See "DEPENDENCIES AND REQUIREMENTS"
pdfimages()
argument is abs path to pdf doc optional argument is a dir to which to send images extracted returns abs paths to images extracted images are extracted by default to same dir pdf is in
If this is not a pdf, the file does not exist, or no images are extracted, warns and returns empty array ref []
DEBUG
$PDF::GetImages::DEBUG = 1;
FORCE_JPG
By default pdfimages will spit out pbm or ppm image format files which are huge and unruly. If you want to make sure the images output are jpg..
$PDF::GetImages::FORCE_JPG= 1 ;
You must have imagemagick convert installed for this to work.
DEPENDENCIES AND REQUIREMENTS
This module requires Unix family operating system to be installed. You must have xpdf package and Image Magick convert installed. Presently we are using cli pdfgetimages. You must have xpdf installed on your system.
AUTHOR
Leo Charre leocharre at cpan dot org
COPYRIGHT
Copyright (c) 2008 Leo Charre. All rights reserved.
LICENSE
This package is free software; you can redistribute it and/or modify it under the same terms as Perl itself, i.e., under the terms of the "Artistic License" or the "GNU General Public License".
DISCLAIMER
This package is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
See the "GNU General Public License" for more details.
SEE ALSO
http://www.imagemagick.org/, xpdf, CAM::PDF