NAME

File::Extract - Extract Text From Arbitrary File Types

SYNOPSIS

use File::Extract;
my $e = File::Extract->new();
my $r = $e->extract($filename);

my $e = File::Extract->new(encodings => [...]);

my $class = "MyExtractor";
File::Extract->register($class);

DESCRIPTION

File::Extract is a framework to extract text data out of arbitrary file types, useful to collect data for indexing.

CLASS METHODS

register($class)

Registers a new text-extractor. The specified class needs to implement two functions:

mime_type(void): Returns the MIME type that $class can extract files from.
extract($file): Extracts the text from $file. Returns a File::Extract::Result object.

METHODS

encodings: List of encodings that you expect your files to be in. This is used to re-encode and normalize the contents of the file via Encode::Guess.
output_encoding: The final encoding that you the extracted test to be in. The default encoding is UTF8.

new(%args)

extract($file)

AUTHOR

2 POD Errors

The following errors were encountered while parsing the POD:

Around line 125:: '=item' outside of any '=over'
Around line 135:: You forgot a '=back' before '=head2'

To install File::Extract, copy and paste the appropriate command in to your terminal.

cpanm

cpanm File::Extract

CPAN shell

perl -MCPAN -e shell
install File::Extract

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)