Commit graph

169 commits

Author SHA1 Message Date
jperkin
0781d62eaf icu: Revert previous clang patch, clang was changed instead. 2018-01-16 13:28:12 +00:00
jperkin
38339513e8 icu: Don't perform SunOS _STDC_C99 workaround with clang. 2018-01-12 12:43:07 +00:00
adam
d25844a725 icu: updated to 60.2
60.2:
New API for direct-UTF-8 normalization.
- It also optionally records changes, for source-to-result index mapping and tracking of text metadata.
More convenient case mapping API (StringPiece→ByteSink).
ICU now handles ill-formed UTF-8 byte sequences as specified in the W3C Encoding Standard.

Bug fixes:
CLDR 32.0.1
- Change of some German AM/PM to English strings reverted; will be revisited.
- BGN transliterations restored.
The Script_Extensions property value for 5 CJK characters is wrong.
ICU4J DecimalFormat
- getGroupingSize() returns -1 instead of 0 in ICU60 if grouping is disabled
- setPositivePrefix also changes negative prefix
- unsets maxFrac when minFrac is set on a currency instance
DateFormat
- Urdu Islamic calendar eras
- Narrow format of noon time is used for abbreviated day period pattern letter 'b' and 'bb'
Conversion buffer overflow
Calendar buffer overrun
Windows C++: The header file "stringoptions.h" is not included in the the pre-built binary .zip file download.
Fix various typos and spelling mistakes.
2017-12-14 21:28:32 +00:00
maya
ebf3ec8121 icu: bump minimum requirement for library major bump 2017-11-30 17:54:18 +00:00
adam
ec42a41308 icu: updated to 60.1
Changes 60.1:
* Unicode 10.0: 8,518 new characters, including four new scripts, 7,494 new Han characters, and 56 new emoji characters.
  - Properties newly supported in ICU: Emoji_Component, Regional_Indicator, Prepended_Concatenation_Mark
* CLDR 32:
  - Data for several (mostly Asian) new languages, date formatting patterns using colloquial day period formats ("h:mm B" → “1:30 in the afternoon”), and many other data improvements.
  - See the CLDR download page for other CLDR features and migration issues in CLDR 32.
* NumberFormatter, a new number formatting API: A long-overdue refresh of number formatting in ICU with a focus on usability, robustness, and performance.  The 30+ settings in DecimalFormat are reduced to 8 in NumberFormatter; all NumberFormatter objects are thread-safe and immutable; and the code is efficient in both the client-side (constant locale) and server-side (variable locale) use cases.
  - New users are encouraged to use the new API for number formatting.  However, preexisting code can continue using the old API, which has been partially made into a wrapper over the new API.
  - Documentation: in Java, see com.ibm.icu.number.NumberFormatter, and in C++, see i18n/unicode/numberformatter.h.
* New options for titlecasing:
  - Sentence titlecasing and whole-string titlecasing without custom BreakIterator instances.
  - The default index adjustment has been changed from "find first cased character" to "find first letter, number, or symbol"; a new option is available for selecting the previous adjustment behavior.
* Smaller data files for BreakIterator.
  - Reverse rules no longer used: Easier updates, easier to conform to Unicode Standard.
  - Old source rule files continue to work, reverse rules are ignored.
  - Rule-based data files: 1.2MB→0.8MB.

ICU4C Specific Changes
* New API for direct-UTF-8 normalization.
  - It also optionally records changes, for source-to-result index mapping and tracking of text metadata.
* More convenient case mapping API (StringPiece→ByteSink).
* ICU now handles ill-formed UTF-8 byte sequences as specified in the W3C Encoding Standard.
2017-11-30 16:03:18 +00:00
he
88729a741e Apply a fix for CVE-2017-14952 from
http://bugs.icu-project.org/trac/changeset/40324/trunk/icu4c/source/i18n/zonemeta.cpp
Bump PKGREVISION.
2017-11-16 09:58:26 +00:00
maya
04a5c878e9 icu: include xlocale on all non-netbsd non-linux systems.
netbsd&linux do not have it (glibc had it, but removed in 2.26, and
was satisfied by locale.h always, if their release notes is to be believed)

this should cover BSDs other than netbsd, etc.
2017-10-06 15:20:42 +00:00
adam
4d02a31a89 Fix building on Darwin 2017-10-06 05:42:01 +00:00
maya
f03707add3 icu: never include xlocale.h, always use locale.h
This was a glibc header, whereas locale.h is a POSIX one.
glibc went ahead and removed it in the new version.

change suggested by Thomas Orgis on tech-pkg but probably not applied
exactly.
2017-10-05 22:50:30 +00:00
jperkin
5feb8764da icu: Remove -nodefaultlibs -nostdlib from SunOS linker args.
This prevented GCC libraries from being used and thus disabled SSP and
other features.  Bump PKGREVISION.
2017-10-04 10:52:40 +00:00
maya
33ebf687dc revbump for requiring ICU 59.x 2017-09-18 09:52:56 +00:00
maya
d157e4f757 icu: belatedly require at least ICU 59.x
59 was a major bump, ICU 58 will not have the right library name.

pointed out by kapper, thanks.
ok leot
2017-09-18 07:52:08 +00:00
maya
a5d826d357 icu: remove part of configure script stripping -std=c++11 on Solaris
Blind build fix attempt for SmartOS.
2017-04-25 08:58:44 +00:00
maya
ada8fe73ed icu: require at least GCC 4.8
we need some c++11 (uchar16_t)
It's possible that lower will suffice

fixes build under netbsd 6.0 (gcc 4.5.3)
2017-04-24 21:20:10 +00:00
wiz
e85daf603e Remove -std=c++11 from here, it causes breakage in code that uses
C++11 extensions or wants C++14.
2017-04-24 12:25:09 +00:00
wiz
d736385972 Add -std=c++11 to CXXFLAGS.
icu uses char16_t in a central header file; this type is added by c++11.
2017-04-23 09:58:31 +00:00
adam
ba75e9ac7f Changes 59.1:
* Emoji 5.0 data
* Includes bidi data files from Unicode 10 beta.
* Includes segmentation data files and rules from Unicode 10 beta and CLDR 31.0.1.
* Does not yet include the Emoji_Component property.
* Otherwise ICU 59 continues to use Unicode 9 data.
CLDR 31.0.1
* Including updates for emoji 5.0, for example local names for England, Scotland, and Wales.
* GMT and UTC are no longer unified, and CLDR provides distinct UTC display names, avoiding confusion with standard (winter) time in Britain.
* See the CLDR download page for other CLDR features and migration issues in CLDR v31.
New case mapping API (C++ & Java classes CaseMap) supports styled text.
2017-04-22 20:11:45 +00:00
adam
c792894e30 Updates in ICU 58.2
Common Changes
* CLDR 30.0.3
* Time zone database version 2016j
* ICU SVN repository structure change. See the note on the Source Code Access page for more information.

ICU4C Fixes
* 12815 uspoof_getSkeleton sets backwards-incompatible illegal argument exception
* 12822 digitlist.cpp won't compile on msvc under Node.js
* 12825 uspoof_check goes into an "infinite loop" when U+30FB is in an input string
* 12832 GreekUpper::toUpper skips the final character on a non-terminated UTF-8 string
* 12849 u_strToTitle returns incorrect length if destination is NULL
* 12868 uprv_convertToPosix() Windows bug
2016-12-12 17:46:39 +00:00
wiedi
a72aa338a0 needs c99, fixes SunOS 2016-12-08 12:15:43 +00:00
adam
f1aa6695a4 PLIST fix for Darwin 2016-12-04 09:01:20 +00:00
ryoon
6aaec02c32 Update to 58.1
* Fix regression with upstream patch,
  https://ssl.icu-project.org/trac/ticket/12827

Changelog:
Common Changes

    CLDR 30.0.2: For details of the many changes in CLDR, see CLDR 30. Some things to note:
        For some combinations of numbering system (arab, arabext, latn) and/or locale (ar, fa, he), there were changes to the bidirectional control characters used with certain symbols (percent, minus, plus), and changes to number patterns (currency and/or percent, including addition of bidirectional control characters in some cases).
        New in this release, the bidirectional controls used for such purposes include U+061C ARABIC LETTER MARK (ALM), which requires use of the bidirectional algorithm from Unicode 6.3 or later.
        The time separator for Norwegian locales (nb, nn) was changed to be ':' throughout.
    Unicode 9.0: Version 9.0 adds exactly 7,500 characters, for a total of 128,172 characters. These additions include six new scripts, 19 symbols for the new 4K TV standard, and 72 new emoji characters.
    Draft Emoji 4.0 data
        Emoji updates for word & line breaking. (#12664 & Unicode 9 update #12526)
    UBiDiTransform/BidiTransform API for convenient transformation of text between different Bidi layouts. (#11679)
    MeasureFormat API for measurement unit display names. (#12029)
    Most COUNT and LIMIT enum constants have been deprecated. (#12420)
    SpoofChecker: Handling of "whole script confusables" has been removed from ICU, in accordance with its removal from UTS #39 Version 9.0.0 and the removal of the corresponding Unicode data file. (#12549)
    Greek uppercasing ("el" locale ID) removes most diacritics. (#5456)
    More robust locale data loading across ICU implementation code.
    Reduced heap memory usage in DateTimePatternGenerator. (#11782)

ICU4C Specific Changes

    The layout engine code has been removed; the ParagraphLayout is not deprecated and remains (and must now be built on top of HarfBuzz). See http://userguide.icu-project.org/layoutengine (#12708)
    Windows: Supports & requires Visual Studio 2015.
2016-12-04 04:11:32 +00:00
tron
c6344ef132 Revert incorrect change of the package list which was part of yesterday's
update of this package. It broke packaging under e.g. Mac OS X.
2016-04-12 19:51:54 +00:00
ryoon
55ebdcd8b5 Update to 57.1
Changelog:
Common Changes
    CLDR 29: For details of the many changes in CLDR, see CLDR 29.
    Grapheme/word/line breaking for emoji sequences, based on Unicode 9 proposed rules. See the Unicode emoji break proposal and the Unicode Emoji Technical Report Proposed Update describing the new emoji sequences.  (#12081).
    Four new Unicode emoji properties (#11802).
    DateFormat day period formatting of "noon", "at night", etc. via new pattern characters b & B, and DateTimePatternGenerator support of C for selecting the customary form (#11872).
        Except: Formatting of "0:00 midnight" has been disabled because it is confusing except for at the end of an interval.
    RelativeDateTimeFormatter: Simpler formatting API (#12072).
    More robust CLDR data loading for MeasureFormat (#11986, #12030), RelativeDateTimeFormatter (#12018), and DateIntervalFormat/DateIntervalInfo (#12013).
    New simple & fast SimpleFormatter class for a trivial subset of MessageFormat as used in CLDR data, e.g., "{0} {1}" (#10896).

ICU4C Specific Changes
    C API support for RelativeDateTimeFormatter (#12072).
    Clang annotations for intended switch case fallthroughs, can now compile with -Wimplicit-fallthrough (#12166).
    Internal header files can be compiled by themselves, for simpler alternative build scripts (#12141).
2016-04-11 16:15:22 +00:00
bsiegert
f0b84f1d29 Apply patch from upstream to fix compilation on CentOS 7. From Thomas
Orgis via mail.
2016-02-13 20:41:58 +00:00
agc
2eddae48e5 Add SHA512 digests for distfiles for textproc category
Problems found locating distfiles:
	Package cabocha: missing distfile cabocha-0.68.tar.bz2
	Package convertlit: missing distfile clit18src.zip
	Package php-enchant: missing distfile php-enchant/enchant-1.1.0.tgz

Otherwise, existing SHA1 digests verified and found to be the same on
the machine holding the existing distfiles (morden).  All existing
SHA1 digests retained for now as an audit trail.
2015-11-04 01:59:17 +00:00
jperkin
da7bc3ac4d Fix preprocessor logic bug causing __STRICT_ANSI__ to be undefined on all
platforms, breaking SunOS/clang's use of GCC headers around __float128.
2015-10-26 09:48:14 +00:00
jperkin
239e6a6e68 Use the GCC build file when using SunOS/clang, and patch it to pass the -h
linker argument correctly.
2015-10-26 09:28:20 +00:00
adam
21bdf9374f PLIST fix 2015-10-10 10:02:55 +00:00
ryoon
3e788a0bf9 Bump ABI. 2015-10-10 01:55:28 +00:00
ryoon
9c0d06ea3f Update to 56.1
Changelog:
Release Overview
The features for this release include support of CLDR 28 and Unicode 8.0.

For more details, including migration issues, see below.
Common Changes

    CLDR 28: For details of the many changes in CLDR, see CLDR 28.
    Unicode data updated to Unicode 8.0: 41 new emoji characters, 5,771 new ideographs for Chinese/Japanese/Korean, 6 new scripts, improved character properties data, etc.
    ICU data size reduced by about 7.2% (1.8MB) via sharing string values across resource bundles. [#11537]
    DateIntervalFormat now handles intervals with seconds, and sets FieldPosition more consistently. [#11706, #11726]
    DateFormat::createInstanceForSkeleton() caches DateFormat patterns rather than DateTimePatternGenerator instances, for better performance (for cache hits) and lower heap memory consumption. [#11780]
    StringSearch (based on collation) defaults to matches on normalization boundaries rather than grapheme cluster boundaries, which yields more matches on Indic text. [#11750]
    RuleBasedNumberFormat (spelled-out numbers) now handles rounding (Java only), infinity, NaN. [#11653, #11760, #8223]
    Most of the old Normalizer/unorm.h had been replaced by (and reimplemented via) Normalizer2, and is now deprecated. [#7303]
    COLON has been withdrawn as a date pattern character corresponding to the date field [UDAT_]TIME_SEPARATOR_FIELD; there is currently no pattern character corresponding to that field. [#11773]
    Support for locale key "cf" to specify currency format style, and interaction with NumberFormat values for UNumberFormatStyle: [#11787]
        For NumberFormat style UNUM_CURRENCY / CURRENCYSTYLE, the default is "standard" currency style (typically using minus sign for negative numbers), but the new locale key "cf" may be used with values "standard" or "account" to specify currency format style ("account" indicates accounting style, often using parentheses for negative numbers).
        For other NumberFormat styles, the locale key "cf" is ignored (they override the locale preference):
            UNUM_CURRENCY_ISO / ISOCURRENCYSTYLE
            UNUM_CURRENCY_PLURAL / PLURALCURRENCYSTYLE
            UNUM_CURRENCY_ACCOUNTING / ACCOUNTINGCURRENCYSTYLE
            UNUM_CASH_CURRENCY / CASHCURRENCYSTYLE
        A new NumberFormat style is availble to explicitly specify standard style, ignoring the  the locale key "cf"
            UNUM_CURRENCY_STANDARD / STANDARDCURRENCYSTYLE

ICU4C Specific Changes

    C API support for CompactDecimalFormat via UNumberFormatStyle additions: UNUM_DECIMAL_COMPACT_SHORT, UNUM_DECIMAL_COMPACT_LONG [#11693]
    Larger UnicodeString object stores more characters inside the object without heap allocation; the UnicodeString object size is now build-time-configurable. [#11551]
        On 64-bit machines, increase from object size 40 bytes with 15 internal UChars to a new default of 64 bytes with 27 UChars.
    Some C++ classes now have swap() and moveFrom() methods, and support C++11 move semantics on compilers that support them. [#10086]
        UnicodeString, LocalPointer, LocalArray
    DecimalFormat code refactored to fix bugs, improve maintainability, and improve performance. [#10458]
    New FilteredBreakIterator suppresses certain segment boundaries. For example, it can suppress the sentence boundary in the middle of "Mr. Smith". [#11248]
    The internal, shared cache has been changed from unbounded to bounded. [#11767]
    For [U]BreakIterator with type UBRK_SENTENCE, the locale key "ss" can now be used with value "standard" to specify that standard sentence break suppression data should be used, or with value "none" to indicate that no break suppression data should be used (the default). [#11770]
    Collator: first-time startup time improved 20% due to precalculated unsafe-backward table [#11886]
    A number of memory leaks and buffer overruns have been fixed based on static code analysis, mostly in data build tools
2015-10-09 23:15:34 +00:00
tnn
96f5f18555 Patch CVE-2015-1270. Via Debian. 2015-09-29 02:15:54 +00:00
jperkin
8091aee373 Now that _STRIPFLAG_INSTALL is disabled by default on Darwin, remove manual
settings of INSTALL_UNSTRIPPED=yes for Darwin in individual packages.
2015-09-07 12:02:05 +00:00
sevan
444a1fe46c Unbreak on Bitrig by adding necessary parts to autoconf related files
Add Bitrig to runConfigure script

http://bugs.icu-project.org/trac/ticket/11881
http://bugs.icu-project.org/trac/ticket/11882
2015-08-31 15:26:18 +00:00
adam
841676da23 Changes 55.1:
The features for this release include support of CLDR 27 (with a major cleanup of region locales, among many other improvements), formatting for scientific notation ("1.2 × 10³"), an update to Unicode 7.0 data for spoof-checking, narrow AM/PM markers ("7:45p"), and various performance enhancements. For C/C++, there are new methods for flexible dates ("Nov 10", or "Sept 2015"), named capture groups for regular expressions, formatting of compound units ("3.5 meters per second"), new C wrappers, and independent timezone resource loading. ICU4J has been improved and tested for using ICU4C data and for running on Android.
2015-04-06 07:22:54 +00:00
spz
35cd6d62be add patch for CVE-2014-7923 and CVE-2014-7926 found at
6242e2fbb3
2015-03-06 14:43:15 +00:00
bsiegert
26c4043246 Fix compilation on Mac OS 10.4. From Sevan Janiyan in PR pkg/49077. 2014-10-26 19:46:48 +00:00
adam
243c29c4cc Revbump after updating libwebp and icu 2014-10-07 16:47:10 +00:00
adam
7d59627492 ICU 54 is a major release of ICU, with new features, new APIs and many bug fixes in data and code. It supports the latest versions of the Unicode locale data (CLDR 26, September 2014) and Unicode Standard (Unicode 7.0, June 2014). The improvements include 72 new measurement units, Unihan radical-stroke collation moved into root, new RBNF PluralFormat syntax, dictionary-based word and line break for Burmese, support for short locale display names, compatibility support for IANA time zone data abbreviations, a tech preview of FilteredBreakIterator using ULI break data, ICU4C thread safety fixes, and the ability to build ICU4C Paragraph Layout with HarfBuzz. 2014-10-07 16:09:53 +00:00
ryoon
17b19fd399 Fix SCO OpenSrver 5.0.7/3.2 build.
Add configuration for SCOOSR5.
2014-07-18 10:28:11 +00:00
obache
3396437a9d whitespace cleanup. 2014-06-08 09:09:01 +00:00
obache
1419bc26e4 Remove hard-coded RPATH flags from patch and use PKGCONFIG_OVERRIDE instead.
Fixes unwanted linker flags for platforms missing rpath support.
2014-06-08 09:08:19 +00:00
obache
2e70dcce49 Use Cygwin package way instead of tons patches affect to other platforms. 2014-06-08 08:32:10 +00:00
obache
854bc63edd Update MH_NAME list from ICU_CHECK_MH_FRAG in source/acinclude.m4
Fixes PLIST for various platforms.
2014-06-03 09:08:05 +00:00
ryoon
847ac8ceb7 Fix OpenBSD 5.5 build
* OpenBSD 5.5 has /usr/include/sys/atomic.h, but it is different from NetBSD's
  one
2014-05-05 21:25:11 +00:00
obache
d8fc20e0b0 recursive bump from icu shlib major bump. 2014-04-09 07:26:56 +00:00
adam
31f860f4a7 Changes 53.1:
Data from the CLDR 25 release: Many bug fixes
Time zone data: 2014b,  including post CLDR 25 time zone data update to CLDR.
U+20BD Ruble Sign added (from Unicode 7.0, otherwise ICU 53 still uses Unicode 6.3)
MeasureFormat API for new units in CLDR 24
Hoisted setContext/getContext from SimpleDateFormat to DateFormat, implement context-sensitive capitalization of relative dates
Added setContext/getContext methods to NumberFormat (and unum_setContext/unum_getContext for UNumberFormat), implement context-sensitive number formatting (for RBNF spellout)
Improved lenient date parsing consistency between ICU4C and ICU4J, add finer-grained control of date parsing leniency
Fixed numeric rounding in TimeUnitFormat
Fixes to Unicode 6.3 bidirectional algorithm implementations to behave exactly like reference implementations
Improved UTF-16 charset detection
Collation code re-implemented
Many bugs fixed, some enhancements implemented (link for ticket query)
Passes full UCA conformance tests now
Updated to UCA 6.3/CLDR 24 root collation
Performance: C++ UTF-8 and Java string comparisons significantly faster (very small reduction for C++ UTF-16)
Collation data size (uncompressed) reduced from 4.48MB (ICU 52) to 2.62MB
New data format, removed empty files, fixed genrb bug
More APIs function when collation rule strings have been omitted from the data files (e.g., getTailoredSet())
Java Collator.compare(Object, Object) now works with CharSequence, not just String
Java Collator base class (does not apply to RuleBasedCollator instances): getters for strength, decomposition mode, and locales return hardcoded default values; their setters do nothing
Rule syntax and semantics tightened and improved, matching LDML 25 Collation Rule Syntax
In particular, rule chains now must start with a reset.
Setting of variableTop deprecated, and not supported in rule syntax any more
Replaced by the new maxVariable setting; see LDML 25 Collation Settings
Accounting format supported in NumberFormat
RelativeDateTimeFormatter class for formatting relative times such as "3 weeks ago" or "next Tuesday."
Updated Spoof Checker for Unicode Security Standard version 6.3.
2014-04-07 11:49:57 +00:00
obache
b19d0b31b2 Cygwin DLLs are handled automatically now. 2014-03-08 09:10:52 +00:00
obache
c3170fa226 Add NetBSD MI atomic_ops support.
Based on PR pkg/48608 by Izumi Tsutsui.
2014-03-04 12:07:16 +00:00
richard
3f38bc5fa3 Fix solaris build for icu, namely problems in general with CFLAGS/CXXFLAGS
from typos in configure and acinclude.m4 to needing to add the flags to
properly generate dependency files with gcc.
2013-12-04 21:02:23 +00:00
bsiegert
33c650ad7c Fix MirBSD build by adding a <sys/time.h> include. 2013-12-03 21:16:06 +00:00