NAME
NexTrieve::UTF8 - change encoding to UTF-8
SYNOPSIS
use NexTrieve;
$ntv = NexTrieve->new( | {method => value} );
$xml = $ntv->recode( $to,$xml,$from );
DESCRIPTION
The UTF8 package of the Perl support for NexTrieve. Does not create an object but is accessed indirectly through the "recode" method of NexTrieve.pm.
NO METHODS
This is only a support package: hence there is no object and hence there are no methods to call.
CONVERT TO UTF-8
The following subroutines are available from this package for performing conversions to UTF-8. Each of these subroutines do an in-place conversion of the input parameter, e.g. NexTrieve::UTF8::iso88591( $xml );. If any characters are found that are illegal for the encoding, they will be replaced by spaces.
ucs2be convert from ucs-2be to UTF-8
ucs2le convert from ucs-2le to UTF-8
ucs4be convert from ucs-4be to UTF-8
ucs4le convert from ucs-4le to UTF-8
iso88591 convert from iso-8859-1 to UTF-8
iso885910 convert from iso-8859-10 to UTF-8
iso885913 convert from iso-8859-13 to UTF-8
iso885914 convert from iso-8859-14 to UTF-8
iso885915 convert from iso-8859-15 to UTF-8
iso885916 convert from iso-8859-16 to UTF-8
iso88592 convert from iso-8859-2 to UTF-8
iso88593 convert from iso-8859-3 to UTF-8
iso88594 convert from iso-8859-4 to UTF-8
iso88595 convert from iso-8859-5 to UTF-8
iso88596 convert from iso-8859-6 to UTF-8
iso88597 convert from iso-8859-7 to UTF-8
iso88598 convert from iso-8859-8 to UTF-8
iso88599 convert from iso-8859-9 to UTF-8
windows1250 convert from windows-1250 to UTF-8
windows1251 convert from windows-1251 to UTF-8
windows1252 convert from windows-1252 to UTF-8
windows1253 convert from windows-1253 to UTF-8
windows1254 convert from windows-1254 to UTF-8
windows1255 convert from windows-1255 to UTF-8
windows1256 convert from windows-1256 to UTF-8
windows1257 convert from windows-1257 to UTF-8
windows1258 convert from windows-1258 to UTF-8
windows874 convert from windows-874 to UTF-8
xmacroman convert from x-mac-roman to UTF-8
REMOVE ILLEGAL CHARACTERS
The following subroutines are available from this packages for removing characters that are illegal in the indicated encoding. Each of these subroutines do an in-place conversion of the input parameter, e.g. NexTrieve::UTF8::iso88593( $xml ).
illegal_iso88593 remove characters illegal for iso-8859-3
illegal_iso88596 remove characters illegal for iso-8859-6
illegal_iso88597 remove characters illegal for iso-8859-7
illegal_iso88598 remove characters illegal for iso-8859-8
illegal_windows1250 remove characters illegal for windows-1250
illegal_windows1251 remove characters illegal for windows-1251
illegal_windows1252 remove characters illegal for windows-1252
illegal_windows1253 remove characters illegal for windows-1253
illegal_windows1254 remove characters illegal for windows-1254
illegal_windows1255 remove characters illegal for windows-1255
illegal_windows1257 remove characters illegal for windows-1257
illegal_windows1258 remove characters illegal for windows-1258
illegal_windows874 remove characters illegal for windows-874
Please note that in some encodings there are no characters that are considered to be illegal (such as iso-8859-1). In that case, the subroutine still exists but does not perform any operation.
AUTHOR
Elizabeth Mattijsen, <liz@dijkmat.nl>.
Please report bugs to <perlbugs@dijkmat.nl>.
SUPPORT
NexTrieve is no longer being supported.
COPYRIGHT
Copyright (c) 1995-2003 Elizabeth Mattijsen <liz@dijkmat.nl>. All rights reserved. This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
SEE ALSO
The NexTrieve.pm and the other NexTrieve::xxx modules.