NAME

NexTrieve::UTF8 - change encoding to UTF-8

SYNOPSIS

use NexTrieve;
$ntv = NexTrieve->new( | {method => value} );
$xml = $ntv->recode( $to,$xml,$from );

DESCRIPTION

The UTF8 package of the Perl support for NexTrieve. Does not create an object but is accessed indirectly through the "recode" method of NexTrieve.pm.

NO METHODS

This is only a support package: hence there is no object and hence there are no methods to call.

CONVERT TO UTF-8

The following subroutines are available from this package for performing conversions to UTF-8. Each of these subroutines do an in-place conversion of the input parameter, e.g. NexTrieve::UTF8::iso88591( $xml );. If any characters are found that are illegal for the encoding, they will be replaced by spaces.

ucs2be	convert from ucs-2be to UTF-8
ucs2le	convert from ucs-2le to UTF-8
ucs4be	convert from ucs-4be to UTF-8
ucs4le	convert from ucs-4le to UTF-8
iso88591	convert from iso-8859-1 to UTF-8
iso885910	convert from iso-8859-10 to UTF-8
iso885913	convert from iso-8859-13 to UTF-8
iso885914	convert from iso-8859-14 to UTF-8
iso885915	convert from iso-8859-15 to UTF-8
iso885916	convert from iso-8859-16 to UTF-8
iso88592	convert from iso-8859-2 to UTF-8
iso88593	convert from iso-8859-3 to UTF-8
iso88594	convert from iso-8859-4 to UTF-8
iso88595	convert from iso-8859-5 to UTF-8
iso88596	convert from iso-8859-6 to UTF-8
iso88597	convert from iso-8859-7 to UTF-8
iso88598	convert from iso-8859-8 to UTF-8
iso88599	convert from iso-8859-9 to UTF-8
windows1250	convert from windows-1250 to UTF-8
windows1251	convert from windows-1251 to UTF-8
windows1252	convert from windows-1252 to UTF-8
windows1253	convert from windows-1253 to UTF-8
windows1254	convert from windows-1254 to UTF-8
windows1255	convert from windows-1255 to UTF-8
windows1256	convert from windows-1256 to UTF-8
windows1257	convert from windows-1257 to UTF-8
windows1258	convert from windows-1258 to UTF-8
windows874	convert from windows-874 to UTF-8
xmacroman	convert from x-mac-roman to UTF-8

REMOVE ILLEGAL CHARACTERS

The following subroutines are available from this packages for removing characters that are illegal in the indicated encoding. Each of these subroutines do an in-place conversion of the input parameter, e.g. NexTrieve::UTF8::iso88593( $xml ).

illegal_iso88593	remove characters illegal for iso-8859-3
illegal_iso88596	remove characters illegal for iso-8859-6
illegal_iso88597	remove characters illegal for iso-8859-7
illegal_iso88598	remove characters illegal for iso-8859-8
illegal_windows1250	remove characters illegal for windows-1250
illegal_windows1251	remove characters illegal for windows-1251
illegal_windows1252	remove characters illegal for windows-1252
illegal_windows1253	remove characters illegal for windows-1253
illegal_windows1254	remove characters illegal for windows-1254
illegal_windows1255	remove characters illegal for windows-1255
illegal_windows1257	remove characters illegal for windows-1257
illegal_windows1258	remove characters illegal for windows-1258
illegal_windows874	remove characters illegal for windows-874

Please note that in some encodings there are no characters that are considered to be illegal (such as iso-8859-1). In that case, the subroutine still exists but does not perform any operation.

AUTHOR

Elizabeth Mattijsen, <liz@dijkmat.nl>.

Please report bugs to <perlbugs@dijkmat.nl>.

SUPPORT

NexTrieve is no longer being supported.

COPYRIGHT

Copyright (c) 1995-2003 Elizabeth Mattijsen <liz@dijkmat.nl>. All rights reserved. This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

SEE ALSO

The NexTrieve.pm and the other NexTrieve::xxx modules.