Changes for version 1.003.520
- NOTE: For changes on linebreak library see linebreak/ChangeLog.
- Fix: Makefile.PL: fix for dmake+MSWin32 (hopefully).
- Fix: LineBreak.xs: UTF8_MAXBYTES exists as of Perl>=5.8.8. Use UTF8_MAXLEN.
- Fix: URGENT_BREAKING_FUNCS{FORCE}: Workaround for Perl5 RT #69422 - utf8_mg_pos_cache_update panic - Perl 5.10.1 & (maybe) 5.10.0.
- Imp: LineBreak.xs: Check data type of self argument.
- Imp: preprocess() get Perl (utf8) string then returns array. Faster.
- Imp: XS'ized break().
- New: Unicode::GCString::join().
- Added test for Format feature.
- Doc: Small fixes.
Changes for version 1.003_11
- Imp: Almost full XS version. Isolated pure-C codes to linebreak library.
- Chg: Update Unicode data to version 5.2.0.
- Doc: Small fix.
Changes for version 1.003_09
- Transitional release - NOTE that this release is VERY slow.
- Imp/Chg: Custom functions (for sizing, user breaking, urgent breaking and format) accepts grapheme cluster string.
- Cleanup: separate C codes.
- Doc: Added more example.
- Doc: Added link to CPAN RT page.
- Doc: Added a bug to BUGS section.
Changes for version 1.003_06
- Fix: Get back LB9: Some CM characters may be single grapheme cluster as their Grapheme_Cluster_Break property is Control.
- Imp: Almost XS'ized grapheme cluster processing. Approx. 25% faster.
Changes for version 1.003_05
- Chg: Include suffixing ``URL:'' to URI pattern.
- Imp: Efficient user-breaking. Approx. 10% faster.
Changes for version 1.003_03
- Chg: Deprecated non-XS version.
- Imp: XS'ized character data. Implemented hash index for searching. Approx. 10% faster in speed, approx. 40KB more in memory usage.
- Removed ranges of trivial entries (e.g. XX on lbmap), reserved for CJK ideographs etc. from character data maps.
- Update Unicode data for 5.2.0beta to those of Aug 24.
- Doc: Fixed attributes for some UCS ranges. Correct several typos.
Documentation
プレインテキストの行折り
UAX #29 書記素クラスタの列としての文字列
UAX #14 Unicode 行分割アルゴリズム
Modules
Line Folding for Plain Text
String as Sequence of UAX #29 Grapheme Clusters
UAX #14 Unicode Line Breaking Algorithm
Provides
in lib/Unicode/LineBreak/Constants.pm
in lib/Unicode/LineBreak/Version.pm