NAME

SWISH::Filters::PDF2XML - Perl extension for filtering PDF documents

DESCRIPTION

This is a plug-in module that uses the CAM::PDF package to convert PDF documents to XML. Any info tags found in the PDF document are created as meta tags.

You may pass into SWISH::Filter's new method a tag to use as the XML <title> if found in the PDF info tags:

my %user_data;
$user_data{pdf}{title_tag} = 'title';

$was_filtered = $filter->filter(
    document  => $filename,
    user_data => \%user_data,
);

Then if a PDF info tag of "title" is found that will be used as the HTML <title>. If no tag is passed, title will be used as the default tag.

AUTHOR

Peter Karman

SEE ALSO

SWISH::Filter