NAME

Lingua::HE::MacHebrew - transcoding between Mac OS Hebrew encoding and Unicode

SYNOPSIS

(1) using function names exported by default:

use Lingua::HE::MacHebrew;
$wchar = decodeMacHebrew($octet);
$octet = encodeMacHebrew($wchar);

(2) using function names exported on request:

use Lingua::HE::MacHebrew qw(decode encode);
$wchar = decode($octet);
$octet = encode($wchar);

(3) using function names fully qualified:

 use Lingua::HE::MacHebrew ();
 $wchar = Lingua::HE::MacHebrew::decode($octet);
 $octet = Lingua::HE::MacHebrew::encode($wchar);

# $wchar : a string in Perl's Unicode format
# $octet : a string in Mac OS Hebrew encoding

DESCRIPTION

This module provides decoding from/encoding to Mac OS Hebrew encoding (denoted MacHebrew hereafter).

Features

bidi support: Functions provided here should cope with Unicode accompanied with some directional formatting codes: i.e. PDF (or U+202C), LRO (or U+202D), and RLO (or U+202E).
expansion/contraction: e.g. decode("\xC0") returns "\x{F86A}\x{05DC}\x{05B9}" and encode("\x{F86A}\x{05DC}\x{05B9}") returns "\xC0".

Functions

$wchar = decode($octet)

$wchar = decodeMacHebrew($octet)

Converts MacHebrew to Unicode.

decodeMacHebrew() is an alias for decode() exported by default.

$octet = encode($wchar)

$octet = encode($handler, $wchar)

$octet = encodeMacHebrew($wchar)

$octet = encodeMacHebrew($handler, $wchar)

Converts Unicode to MacHebrew.

encodeMacHebrew() is an alias for encode() exported by default.

If the $handler is not specified, any character that is not mapped to MacHebrew is deleted; if the $handler is a code reference, a string returned from that coderef is inserted there. if the $handler is a scalar reference, a string (a PV) in that reference (the referent) is inserted there.

The 1st argument for the $handler coderef is the Unicode code point (integer) of the unmapped character.

E.g.

sub hexNCR { sprintf("&#x%x;", shift) } # hexadecimal NCR
sub decNCR { sprintf("&#%d;" , shift) } # decimal NCR

print encodeMacHebrew("ABC\x{100}\x{10000}");
# "ABC"

print encodeMacHebrew(\"", "ABC\x{100}\x{10000}");
# "ABC"

print encodeMacHebrew(\"?", "ABC\x{100}\x{10000}");
# "ABC??"

print encodeMacHebrew(\&hexNCR, "ABC\x{100}\x{10000}");
# "ABC&#x100;&#x10000;"

print encodeMacHebrew(\&decNCR, "ABC\x{100}\x{10000}");
# "ABC&#256;&#65536;"

CAVEAT

Sorry, the author is not working on a Mac OS. Please let him know if you find something wrong.

Maybe bug?: The (default) paragraph direction is not resolved. Does Mac always surround by LRO..PDF or RLO..PDF the characters with bidirectional type to be overridden?

AUTHOR

SADAHIRO Tomoyuki <SADAHIRO@cpan.org>

This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

Map (external version) from Mac OS Hebrew character set to Unicode 2.1 and later (version: c02 2005-Apr-05): http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/HEBREW.TXT
Registry (external version) of Apple use of Unicode corporate-zone characters (version: c03 2005-Apr-04): http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/CORPCHAR.TXT
The Bidirectional Algorithm: http://www.unicode.org/reports/tr9/

To install Lingua::HE::MacHebrew, copy and paste the appropriate command in to your terminal.

cpanm

cpanm Lingua::HE::MacHebrew

CPAN shell

perl -MCPAN -e shell
install Lingua::HE::MacHebrew

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)