NAME
Win32::MultiLanguage - Interface to IMultiLanguage I18N routines
SYNOPSIS
use Win32::MultiLanguage;
# @@
DESCRIPTION
Win32::MultiLanguage is an experimental wrapper module for the Windows IMultiLanguage interfaces that comes with Internet Explorer version 4 and later. Mlang.dll implements routines for dealing with character encodings, code pages, and locales.
CONSTANTS
MLDETECTCP
- MLDETECTCP_NONE
-
Default setting will be used.
- MLDETECTCP_7BIT
-
Input stream consists of 7-bit data.
- MLDETECTCP_8BIT
-
Input stream consists of 8-bit data.
- MLDETECTCP_DBCS
-
Input stream consists of double-byte data.
- MLDETECTCP_HTML
-
Input stream is an HTML page.
MIMECONTF
- MIMECONTF_MAILNEWS
-
Code page is meant to display on mail and news clients.
- MIMECONTF_BROWSER
-
Code page is meant to display on browser clients.
- MIMECONTF_MINIMAL
-
Code page is meant to display in minimal view. This value is generally not used.
- MIMECONTF_IMPORT
-
Value that indicates that all of the import code pages should be enumerated.
- MIMECONTF_SAVABLE_MAILNEWS
-
Code page includes encodings for mail and news clients to save a document in.
- MIMECONTF_SAVABLE_BROWSER
-
Code page includes encodings for browser clients to save a document in.
- MIMECONTF_EXPORT
-
Value that indicates that all of the export code pages should be enumerated.
- MIMECONTF_PRIVCONVERTER
-
Value that indicates the encoding requires (or has) a private conversion engine. A client of IEnumCodePage doesn't use this value.
- MIMECONTF_VALID
-
Value that indicates the corresponding encoding is supported on the system.
- MIMECONTF_VALID_NLS
-
Value that indicates that only the language support file should be validated. Normally, both the language support file and the supporting font are checked.
- MIMECONTF_MIME_IE4
-
Value that indicates the Microsoft® Internet Explorer 4.0 MIME data from MLang's internal data should be used.
- MIMECONTF_MIME_LATEST
-
Value that indicates that the latest MIME data from MLang's internal data should be used.
- MIMECONTF_MIME_REGISTRY
-
Value that indicates that the MIME data stored in the registry should be used.
MLDETECTF
- MLDETECTF_MAILNEWS
-
Not currently supported.
- MLDETECTF_BROWSER
-
Not currently supported.
- MLDETECTF_VALID
-
Detection result must be valid for conversion and text rendering.
- MLDETECTF_VALID_NLS
-
Detection result must be valid for conversion.
- MLDETECTF_PRESERVE_ORDER
-
Preserve preferred code page order. This is meaningful only if you have set the @@puiPreferredCodePages parameter in
DetectOutboundCodePage
. - MLDETECTF_PREFERRED_ONLY
-
Only return one of the preferred code pages as the detection result. This is meaningful only if you have set the @@puiPreferredCodePages parameter in
DetectOutboundCodePage
. - MLDETECTF_FILTER_SPECIALCHAR
-
Filter out graphical symbols and punctuation.
ROUTINES
- DetectInputCodepage($octets [, $flags [, $codepage]])
-
Detects the code page of the given string $octets. An optional $flags parameter may be specified, a combination of
MLDETECTCP
constants as defined above, if not specifiedMLDETECTCP_NONE
will be used as default. An optional $codepage can also be specified, if this value is set to zero, this API returns all possible encodings. Otherwise, it lists only those encodings related to this parameter. The default is zero.It will return a reference to an array of hash references of which each represents a
DetectEncodingInfo
strucure with the following keysLangID => ..., # primary language identifier CodePage => ..., # detected Win32-defined code page DocPercent => ..., # Percentage in the detected language Confidence => ..., # degree to which the detected data is correct
See http://msdn.microsoft.com/workshop/misc/mlang/reference/structures/detectencodinginfo.asp for details.
- GetCodePageInfo($codepage, $langid)
-
...
- GetCodePageDescription($codepage, $locale)
-
...
- GetRfc1766FromLcid($locale)
-
...
- DetectOutboundCodePage($utf8 [, $flags [, \@cp ]])
-
...
- GetCharsetInfo($charset)
-
...
IMPLEMENTATION STATUS
Legend:
+ means is implemented
? means might get implemented
- means unlikely that this gets implemented
IMultiLanguage
+ GetCharsetInfo
+ GetRfc1766FromLcid
? GetNumberOfCodePageInfo
? GetFamilyCodePage
? EnumCodePages
? IsConvertible
? ConvertString
? GetLcidFromRfc1766
? EnumRfc1766
? GetRfc1766Info
- ConvertStringToUnicode
- ConvertStringFromUnicode
- ConvertStringReset
- CreateConvertCharset
IMultiLanguage2
+ GetCodePageInfo
+ DetectInputCodepage
+ GetCodePageDescription
? ValidateCodePage
? IsCodePageInstallable
? GetNumberOfScripts
? EnumScripts
? ValidateCodePageEx
- ConvertStringInIStream
- ConvertStringToUnicodeEx
- ConvertStringFromUnicodeEx
- DetectCodepageInIStream
- SetMimeDBSource
IMultiLanguage3
+ DetectOutboundCodePage
- DetectOutboundCodePageInIStream
SEE ALSO
WARNING
This is pre-alpha software.
AUTHOR AND COPYRIGHT
Copyright (C) 2004 by Bjoern Hoehrmann <bjoern@hoehrmann.de>.
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.2 or, at your option, any later version of Perl 5 you may have available.
1 POD Error
The following errors were encountered while parsing the POD:
- Around line 151:
Non-ASCII character seen before =encoding in 'Microsoft®'. Assuming CP1252