0.01_05 2011-09-20
- Correct maximal subpart implementation
An initial subsequence of a ill-formed sequence is not maximal subpart.
<C0 80> -> <FFFD FFFD>
<ED A0 80> -> <FFFD FFFD FFFD>
<EF BF BF> -> <FFFD>
<F4 80 80> -> <FFFD>
<F4 90 80 80> -> <FFFD FFFD FFFD FFFD>
Unicode v6.0:
D93b Maximal subpart of an ill-formed subsequence: The longest code unit
subsequence starting at an unconvertible offset that is either:
a. the initial subsequence of a well-formed code unit sequence, or
b. a subsequence of length one.
0.01_04 2011-09-17
- croak if Perl's internal representation of wide characters is ill-formed.
- Fixed a bug in replacement handling.
- Added a test for replacement handling.
0.01_03 2011-09-16
- Removed the "Can't represent restricted code point" error, code points
above U+10FFFF is reported as "Can't represent super code point".
- Instead of just croaking use the 'utf8' warnings category and
leave the choise of error reporting to the user.
- Maximal subpart of an ill-formed subsequence is replaced with U+FFFD as
recomended by Unicode.
0.01_02 2011-09-13
- Changed wording in encoding exception messages from "Can't map \w+ code point"
to "Can't represent \w+ code point",
- Added a taint test.
- Added a leaks test.
0.01_01 2011-09-12
- Initial CPAN release.