NAME
MARC::Charset::EastAsian - MARC8/UTF8 mappings
SYNOPSIS
use MARC::Charset::EastAsian;
my $cs = MARC::Charset::EastAsian->new();
DESCRIPTION
MARC::Charset::EastAsian provides a mapping between the MARC8 EastAsian character set and Unicode(UTF8). It is typically used by MARC::Charset, so you probably don't need to use this yourself.
Because there are so many EastAsian characters, lookup() uses a tied Berkeley DB file so as to conserve memory. This db was built and installed when you installed MARC::Charset.
METHODS
The constructor, which will return you a MARC::Charset::EastAsian object.
name()
Returns the name of the character set.
lookup()
The workhorse method that does the lookup. Pass it an a character and you'll get back the UTF8 character.
combining()
Pass it a character and you'll get back a true value (1) if the character is a combining character, and false (undef) if it is not.
getCharSize()
Returns the number of bytes in each character of this character set.