1.09 Thu Dec 18 21:39:18 2014
- XS: a workaround for perl 5.6.x to handle noncharacters U+FFFF etc.
is abandoned. Perl 5.8.0 or later is recommended for handling these
noncharacters.
1.08 Sat Dec 6 20:12:55 2014
- DUCET is updated (for Unicode 7.0.0) as Collate/allkeys.txt.
! Please notice that allkeys.txt will be overwritten if you have had
other allkeys.txt already.
- The default UCA_Version is 30.
- *.pl and *.pm are updated so that they have same the version number.
- XS: for the world without utf8n_to_uvuni().
1.06 Tue May 27 21:11:09 2014
- 0.67's improved discontiguous contractions is invalidated by default
and is supported as a parameter 'long_contraction.'
1.05 Sat May 24 16:30:42 2014
- XS: avoid unused expression 1; for no-op.
(see [rt.cpan.org #95866] compilation noise)
Do it for all packages that
* mention perl, or
* have a directory name starting with p5-*, or
* depend on a package starting with p5-
like last time, for 5.18, where this didn't lead to complaints.
Let me know if you have any this time.
1.04 Sat Dec 7 11:34:18 2013
- XS: a workaround for perl 5.6.x to handle U+FFFF correctly.
unpack_U() is implemented by using XS again as well as that in 1.02,
but now that is used only in the versions before perl 5.8.0.
1.03 Sun Dec 1 21:45:46 2013
- XS: now unpack_U() uses unpack('U*') in pure perl.
avoid XS for the internal "utf8" encoding of perl.
1.02 Sun Nov 10 18:39:37 2013
- POD: fix [rt.cpan.org #90170] about iso-8859-1 letters in pod.
E<> is used for the compatibility with perl 5.6.1 and possibly EBCDIC.
- 1.01 forgot to increase the version number of CJK/Korean.pm.
- modified tests: cjkrange.t, compatui.t, hangtype.t, illegal.t,
loc_ja.t, loc_ta.t, overcjk0.t, overcjk1.t, view.t in t.
1.01 Sat Nov 2 19:00:38 2013
- DUCET is updated (for Unicode 6.3.0) as Collate/allkeys.txt.
! Please notice that allkeys.txt will be overwritten if you have had
other allkeys.txt already.
- The default UCA_Version is 28.
- Locale/*.pl (except fr.pl) and CJK/Korean.pm are updated.
- modified tests: loc_es.t, loc_estr.t, rewrite.t, version.t in t.
1.00 Sun Oct 27 13:22:17 2013
- When a subroutine by 'overrideOut' taking a out-of-range value and
returning undef, now the value is treated as if it were U+FFFD.
* 0.99 wrongly calculates implicit weights based on out-of-range values.
- Assertion using unpack 'U' is added. If not only pack('U') but also
unpack('U') of CORE:: don't work as expected, this module will die.
0.99 Sun Sep 1 12:46:14 2013
- by default out-of-range values are treated as if it were U+FFFD
when UCA_Version >= 22.
- supported overriding out-of-range values (see 'overrideOut' in POD).
- modified tests: override.t, illegal.t in t.
0.98 Sat Jun 15 19:44:06 2013
- typo (see [rt.cpan.org #85655] typo fixes)
a) refer 'perl' in their Makefile, or
b) have a directory name of p5-*, or
c) have any dependency on any p5-* package
Like last time, where this caused no complaints.
Changes from previous:
----------------------
0.97 Sat Dec 22 14:25:50 2012
- bug fix: XS of 0.96 (if UCA_Version is 9 to 11) wrongly referred to
DUCET for completely ignorable characters, even though the collator
don't use DUCET.
- separated t/notable.t from t/test.t.
0.96 Sat Dec 15 19:43:10 2012
- special noncharancter tailorings ('highestFFFF' and 'minimalFFFE')
* some locales are modified for 'highestFFFF': as, bn, fa, gu, hi, hy,
kn, kok, mr, or, sa, si, si_dict, ta, te, th, ur.
- U::C::Locale now allows 'entry' to add or override mappings.
- bug fix: using DUCET through XS wrongly prevented completely ignorable
characters from tailoring.
- modified tests: default.t, loc_as.t, loc_bn.t, loc_fa.t, loc_gu.t,
loc_hi.t, loc_hy.t, loc_kn.t, loc_kok.t, loc_mr.t, loc_or.t, loc_sa.t,
loc_si.t, loc_sidt.t, loc_ta.t, loc_te.t, loc_test.t, loc_th.t,
loc_ur.t, nonchar.t in t.
Upstream changes:
0.95 Sat Dec 8 15:11:09 2012
- U::C::Locale newly supports locales: bs_Cyrl, ee.
- updated to CLDR 21: uk.
- updated to CLDR 22: th, to.
- added loc_bscy.t, loc_ee.t in t.
- modified tests: loc_th.t, loc_to.t, loc_uk.t in t.
0.94 Fri Nov 23 18:45:53 2012
- U::C::Locale newly supports locale: zh__zhuyin.
- added Unicode::Collate::CJK::Zhuyin for zh__zhuyin.
- doc: added CAVEAT to CJK/Stroke.pm
- modified tests: loc_cjk.t, loc_cjkc.t in t.
- added cjk_zy.t, loc_zhzy.t in t.
0.93 Sun Nov 18 18:13:42 2012
- DUCET is updated (for Unicode 6.2.0) as Collate/allkeys.txt.
! Please notice that allkeys.txt will be overwritten if you have had
other allkeys.txt already.
- The default UCA_Version is 26.
- Locale/*.pl (except fr.pl) and CJK/Korean.pm are updated.
- modified tests: loc_es.t, loc_estr.t, version.t in t.
0.92 Wed Nov 14 20:58:19 2012
- fix: index() etc. with preprocess/normalization should be always croaked.
- doc: referred to the latest UTS #10 and updated its section numbers.
- supported the identical level (see 'identical' in POD).
- Now UCA_Version 26 (for Unicode 6.2.0) is supported.
* But the default UCA_Version is still 24.
- added ident.t in t.
- modified tests: cjkrange.t, compatui.t, hangtype.t, index.t,
overcjk0.t, overcjk1.t, test.t, view.t in t.
0.91 Sun Nov 4 17:00:20 2012
- XSUB: use PERL_NO_GET_CONTEXT (see perlguts)
(see [rt.cpan.org #80313])
0.90 Sun Sep 23 10:42:26 2012
- perl 5.11.0 or later: Install to 'site' instead of 'perl'
(see [rt.cpan.org #79800])
Changes:
0.89 Sat Mar 10 20:19:11 2012
- avoid "use Test".
0.88 Mon Mar 5 21:56:13 2012
- DUCET is updated (for Unicode 6.1.0) as Collate/allkeys.txt.
! Please notice that allkeys.txt will be overwritten if you have had
other allkeys.txt already.
- U+9FCC is a new CJK unified ideograph.
- The default UCA_Version is 24.
- Locale/*.pl (except fr.pl) and CJK/Korean.pm are updated.
- modified tests: cjkrange.t, compatui.t, hangtype.t, loc_cjkc.t,
loc_es.t, loc_estr.t, overcjk0.t, overcjk1.t, version.t in t.
Unicode::Collate implements Unicode Collation Algorithm (UCA). The UCA
provides a specification for how to compare two Unicode strings while
remaining conformant to the requirements of The Unicode Standard. The UCA
also supplies the Default Unicode Collation Element Table (DUCET), which
is data specifying the default collation order for all Unicode characters.
(imported and updated from wip)