NAME
Font::TTF::Cmap - Character map table
DESCRIPTION
Looks after the character map. The primary structure used for handling a cmap is the Font::TTF::Segarr which handles the segmented arrays of format 4 tables, and in a simpler form for format 0 tables.
Due to the complexity of working with segmented arrays, most of the handling of such arrays is via methods rather than via instance variables.
One important feature of a format 4 table is that it always contains a segment with a final address of 0xFFFF. If you are creating a table from scratch this is important (although Font::TTF::Segarr can work quite happily without it).
INSTANCE VARIABLES
The instance variables listed here are not preceeded by a space due to their emulating structural information in the font.
- Num
-
Number of subtables in this table
- Tables
-
An array of subtables ([0..Num-1])
Each subtables also has its own instance variables which are, again, not preceeded by a space.
- Platform
-
The platform number for this subtable
- Encoding
-
The encoding number for this subtable
- Format
-
Gives the stored format of this subtable
- Ver
-
Gives the version (or language) information for this subtable
- val
-
A hash keyed by the codepoint value (not a string) storing the glyph id
METHODS
$t->read
Reads the cmap into memory. Format 4 subtables read the whole subtable and fill in the segmented array accordingly.
Format 2 subtables are not read at all.
$t->ms_lookup($uni)
Given a Unicode value in the MS table (Platform 3, Encoding 1) locates that table and looks up the appropriate glyph number from it.
$t->find_ms
Finds the Microsoft Unicode table and sets the mstable
instance variable to it if found. Returns the table it finds.
$t->out($fh)
Writes out a cmap table to a filehandle. If it has not been read, then just copies from input file to output
$t->XML_element($context, $depth, $name, $val)
Outputs the elements of the cmap in XML. We only need to process val here
@map = $t->reverse([$num])
Returns a reverse map of the table of given number or the Microsoft cmap. I.e. given a glyph gives the Unicode value for it.
is_unicode($index)
Returns whether the table of a given index is known to be a unicode table (as specified in the specifications)
BUGS
No support for format 2 tables (MBCS)
Read only support for formats 8, 10 & 12. No ability to write these formats
AUTHOR
Martin Hosken Martin_Hosken@sil.org. See Font::TTF::Font for copyright and licensing.