NAME
CharsetDetector - Detect charset
SYNOPSIS
use CharsetDetector;
my $binary = "...";
my $charset_str = "charset = ...";
my $charset = CharsetDetector::detect($binary);
$charset = CharsetDetector::detect1($binary, 5);
# Test the charset of $binary
# '' for undef
# 'iso-8859-1' for ''
$charset = CharsetDetector::detect($binary, 5);
$charset = CharsetDetector::detect1($binary, 5);
# Test the charset of substr($binary, 0, 5)
$charset = CharsetDetector::detect($charset_str);
# Test the charset in $charset_str
$charset = CharsetDetector::detect_debug($binary);
print $CharsetDetector::log_txt;
# you can see the log of testing $binary
The synopsis above only lists the major methods and parameters.
Basic Function
detect - detect charset
$charset = CharsetDetector::detect($binary [, $max_len]);
$charset = CharsetDetector::detect($charset_str [, $max_len]);
# $charset_str is like "charset=..."
# if input is '', output is 'iso-8859-1'
# if input is undef, output is ''
detect1 - detect charset only in encoding
$charset = CharsetDetector::detect1($binary [, $max_len]);
# if input is '', output is 'iso-8859-1'
# if input is undef, output is ''
detect_debug - detect charset and then you can see the log
$charset = CharsetDetector::detect_debug($binary [, $max_len]);
print $CharsetDetector::log_txt;
# if input is '', output is 'iso-8859-1'
# if input is undef, output is ''
COPYRIGHT
The CharsetDetector module is Copyright (c) 2003-2006 QIAN YU. All rights reserved.
You may distribute under the terms of either the GNU General Public License or the Artistic License, as specified in the Perl README file.