2015-12-05: php-mecab 0.6.0
===========================
* Add support for PHP 7.
* Add `MeCab` namespace classes and constants.
* Non-namespace classes are not supported on PHP 7.
* Non-namespace constants are not supported on PHP 7.
* Procedural APIs (`mecab_*` functions) are not supported on PHP 7.
* Non-namespace classes are deprecated on PHP 5.
* PHP versions earlier than 5.3 are no longer supported.
* MeCab versions earlier than 0.99 are no longer supported.
* Removed features:
* `$filter` callback argument of `mecab_split()` function.
* The persistent resource.
v2.0.4
Major Enhancements
* Classification thresholds can be enabled or disabled. The default is
disabled. The threshold value can be set at initialization time or
dynamically during processing (#47)
* Made auto-categorization optional, defaulting to false (#45)
* Added the ability to handle an array of classifications to the constructor
(#44)
* Classification with a threshold has been added to the api (#39)
Minor Enhancements
* Documentation around threshold usage (#54)
* Fixed UTF-8 encoding for hasher.rb (#50)
* Removed some unnecessary methods (#43)
* Add optional CachedContentNode (GSL only) (#43)
* Caches the transposed search_vector (#43)
* Added custom marshal_ methods to not save the cache when dumping/loading
(#43)
* Optimized some numeric comparisons and iterators (#43)
* Added cached calculation table when computing raw_vectors (#43)
* If a category name is already a symbol, just return it (#45)
* Various Hash improvements (#45)
* Eliminated several Ruby ⚠️s when run with RUBYOPT="-w" (#38)
* Simple performance improvements for the Hasher process (#41)
* Fixes for broken regex splitting for non-ascii characters and removal of the
unused punctuation filter (#41)
* Add multiple language stopwords with customizable stop word paths (#40)
Bug Fixes
* Fixed the bug where adding the same category a second time would clobber the
category that was already there (#45)
* Fixed deprecation warning for <=> in ls.rb (#33)
* Remove references to Madeline in the README and replace it with Marshal or
Redis (#32)
Development Fixes
* Added development dependency on mini_test and added 2.2 to travis.yml (#36)
=== 1.6.7 / 2015-11-29
==== Notes
This version supports native builds on Windows using the RubyInstaller
DevKit. It also supports Ruby 2.2.x on Windows, as well as making
several other improvements to the installation process on various
platforms.
This version also includes the security patches already applied in
v1.6.6.3 and v1.6.6.4 to the vendored libxml2 and libxslt source.
See #1374 and #1376 for details.
==== Features
* Cross-built gems now have a proper ruby version requirement. (#1266)
* Ruby 2.2.x is supported on Windows.
* Native build is supported on Windows.
* [MRI] libxml2 and libxslt `config.guess` files brought up to date. (#1326) (Thanks, @hernan-erasmo!)
* [JRuby] fix error in validating files with jruby (#1355, #1361) (Thanks, @twalpole!)
* [MRI, OSX] Patch to handle nonstandard location of `iconv.h`. (#1206, #1210, #1218, #1345) (Thanks, @neonichu!)
==== Bug Fixes
* [JRuby] reset the namespace cache when replacing the document's innerHtml (#1265) (Thanks, @mkristian!)
* [JRuby] Document#parse should support IO objects that respond to #read. (#1124) (Thanks, Jake Byman!)
* [MRI] Duplicate-id errors when setting the `id` attribute on HTML documents are now silenced. (#1262)
* [JRuby] SAX parser cuts texts in peices when quare brackets exist. (#1261)
* [JRuby] Namespaced attributes aren't removed by remove_attribute. (#1299)
The markdown2social utility converts simple Markdown documents to Google+
posts ready to be shared on the social network. There is room for writing
output plugins for other target sites, but at the moment only Google+ is
supported.
Provided by NONAKA Kimihiro in PR 50504.
Highway
-------
A fast pattern matching tool like pt and ag.
Feature:
- It can search the PATTERN from your source code very fast.
- It can search also the PATTERN as a regular expression.
- It supports UTF-8 and some Japanese encodings (Because I'm
Japanese. EUC-JP and Shift_JIS is supported).
- It ignores file patterns form your .gitignore by default.
pkgsrc changes:
o Switch MASTER_SITES to the author's site (in order to avoid the various
auto* tools)
Changes:
o Add support for Crystal file types
o Add support for Racket file types
o Add support for Standard ML file types
o Add support for Elixir file type
o Misc bugfixes
Upstream changes:
0.50 Sat May 9 09:06:41 EDT 2015
- Improve performance of is_LsubsetR via patches submitted by Mich Rawson
at New York Perl Hackathon (May 2 2015).
0.51 Thu May 14 21:49:00 EDT 2015
- Changes to List::Compare::Functional::is_LsubsetR introduced in v0.50
found to be buggy. See: https://rt.cpan.org/Ticket/Display.html?id=104452.
Pending further investigation, reverted to v0.49 of
List::Compare::Base::_Auxiliary; added two test files.
0.52 Thu May 21 21:57:18 EDT 2015
- v0.51 changes were not applied to master branch; apply them.
Remove two hash-to-hash assignments in List::Compare::Base::_Auxiliary::_subset_subengine.
0.53 Sun Jun 7 10:49:24 MDT 2015
- Accept patch submitted by Paulo Custodio reworking, and correcting,
changes originally published in 0.50.
(.xls) spreadsheet. FreeXL design goals:
* to be simple and lightweight
* to be stable, robust and efficient
* to be easily and universally portable
* completely ignoring any GUI-related oddity
Note that the final goal means that FreeXL ignores at all fonts, sizes and
alignments, and most formats. It ignores Pivot Table, Charts, Formulas, Visual
Basic macros and so on. FreeXL is structurally simple and quite light-weight
(typically 40-80K of object code, stripped).
The find-prefix infrastructure was required in a pkgviews world where
packages installed from pkgsrc could have different installation
prefixes, and this was a way for a dependency prefix to be determined.
Now that pkgviews has been removed there is no longer any need for the
overhead of this infrastructure. Instead we use BUILDLINK_PREFIX.pkg
for dependencies pulled in via buildlink, or LOCALBASE/PREFIX where the
dependency is coming from pkgsrc.
Provides a reasonable performance win due to the reduction of `pkg_info
-qp` calls, some of which were redundant anyway as they were duplicating
the same information provided by BUILDLINK_PREFIX.pkg.
add missing lzmalib ( fix librsvg build )
-llzma, from libcroco-0.6.8/src/libcroco-0.6.la#20
pkgsrc/graphics/librsvg:
ld: warning: liblzma.so.1, needed by /usr/pkgsrc/graphics/librsvg/work/.buildlink/lib/libcroco-0.6.so, not found (try using -rpath or -rpath-link)
Shared object "liblzma.so.1" not found
=== 1.6.6.4 / 2015-11-19
This version pulls in an upstream patche to the vendored libxml2 to address:
* unclosed comment uninitialized access issue (#1376)
This issue does not have a CVE assigned to it as this time.
v2.9.3: Nov 20 2015
Security:
CVE-2015-8242 Buffer overead with HTML parser in push mode (Hugh Davenport),
CVE-2015-7500 Fix memory access error due to incorrect entities boundaries (Daniel Veillard),
CVE-2015-7499-2 Detect incoherency on GROW (Daniel Veillard),
CVE-2015-7499-1 Add xmlHaltParser() to stop the parser (Daniel Veillard),
CVE-2015-5312 Another entity expansion issue (David Drysdale),
CVE-2015-7497 Avoid an heap buffer overflow in xmlDictComputeFastQKey (David Drysdale),
CVE-2015-7498 Avoid processing entities after encoding conversion failures (Daniel Veillard),
CVE-2015-8035 Fix XZ compression support loop (Daniel Veillard),
CVE-2015-7942-2 Fix an error in previous Conditional section patch (Daniel Veillard),
CVE-2015-7942 Another variation of overflow in Conditional sections (Daniel Veillard),
CVE-2015-1819 Enforce the reader to run in constant memory (Daniel Veillard)
CVE-2015-7941_2 Cleanup conditional section error handling (Daniel Veillard),
CVE-2015-7941_1 Stop parsing on entities boundaries errors (Daniel Veillard),
Documentation:
Correct spelling of "calling" (Alex Henrie),
Fix a small error in xmllint --format description (Fabien Degomme),
Avoid XSS on the search of xmlsoft.org (Daniel Veillard)
Portability:
threads: use forward declarations only for glibc (Michael Heimpold),
Update Win32 configure.js to search for configure.ac (Daniel Veillard)
Bug Fixes:
Bug on creating new stream from entity (Daniel Veillard),
Fix some loop issues embedding NEXT (Daniel Veillard),
Do not print error context when there is none (Daniel Veillard),
Avoid extra processing of MarkupDecl when EOF (Hugh Davenport),
Fix parsing short unclosed comment uninitialized access (Daniel Veillard),
Add missing Null check in xmlParseExternalEntityPrivate (Gaurav Gupta),
Fix a bug in CData error handling in the push parser (Daniel Veillard),
Fix a bug on name parsing at the end of current input buffer (Daniel Veillard),
Fix the spurious ID already defined error (Daniel Veillard),
Fix previous change to node sort order (Nick Wellnhofer),
Fix a self assignment issue raised by clang (Scott Graham),
Fail parsing early on if encoding conversion failed (Daniel Veillard),
Do not process encoding values if the declaration if broken (Daniel Veillard),
Silence clang's -Wunknown-attribute (Michael Catanzaro),
xmlMemUsed is not thread-safe (Martin von Gagern),
Fix support for except in nameclasses (Daniel Veillard),
Fix order of root nodes (Nick Wellnhofer),
Allow attributes on descendant-or-self axis (Nick Wellnhofer),
Fix the fix to Windows locking (Steve Nairn),
Fix timsort invariant loop re: Envisage article (Christopher Swenson),
Don't add IDs in xmlSetTreeDoc (Nick Wellnhofer),
Account for ID attributes in xmlSetTreeDoc (Nick Wellnhofer),
Remove various unused value assignments (Philip Withnall),
Fix missing entities after CVE-2014-3660 fix (Daniel Veillard),
Revert "Missing initialization for the catalog module" (Daniel Veillard)
Improvements:
Reuse xmlHaltParser() where it makes sense (Daniel Veillard),
xmlStopParser reset errNo (Daniel Veillard),
Reenable xz support by default (Daniel Veillard),
Recover unescaped less-than character in HTML recovery parsing (Daniel Veillard),
Allow HTML serializer to output HTML5 DOCTYPE (Shaun McCance),
Regression test for bug #695699 (Nick Wellnhofer),
Add a couple of XPath tests (Nick Wellnhofer),
Add Python 3 rpm subpackage (Tomas Radej),
libxml2-config.cmake.in: update include directories (Samuel Martin),
Adding example from bugs 738805 to regression tests (Daniel Veillard)
Changelog:
* Noteworthy changes in release 2.22 (2015-11-01) [stable]
** Improvements
Performance has improved for patterns containing very long strings,
reducing preprocessing time for an N-byte regexp from O(N^2) to
only slightly superlinear for most patterns. Before, a command like
the following would take over a minute, but now, it takes less than
a second:
: | grep -f <(seq -s '' 99999)
When building grep, 'configure' now uses PCRE's pkg-config module for
configuration information, rather than attempting to guess it by hand.
** Bug fixes
A DFA matcher bug made this command mistakenly print its input line:
echo axb | grep -E '^x|x$'
Likewise for this equivalent command:
echo axb | grep -e '^x' -e 'x$'
[bug introduced in grep-2.19 ]
grep no longer reads from uninitialized memory or from beyond the end
of the heap-allocated input buffer. This fix addressed CVE-2015-1345.
[bug introduced in grep-2.19 ]
With -z, '.' and '[^x]' in a pattern now consistently match newline.
Previously, they sometimes matched newline, and sometimes did not.
[bug introduced in grep-2.4]
When the JIT stack is exhausted, grep -P now grows the stack rather
than reporting an internal PCRE error.
'grep -D skip PATTERN FILE' no longer hangs if FILE is a fifo.
[bug introduced in grep-2.12]
--exclude and related options are now matched against entire
command-line arguments, not against command-line components.
[bug introduced in grep-2.6]
Fix performance degradation of grep -Fw in unibyte locales.
[bug introduced in grep-2.19 ]
Transform DocBook XSLT URLs to point to local copies coming from textproc/docbook-xsl.
This package should use XML Catalog, but doesn't for some unclear reason.
pkgsrc change: Add pkg_alternatives support.
=== 1.6.6.3 / 2015-11-16
This version pulls in several upstream patches to the vendored libxml2 and libxslt to address:
* CVE-2015-1819
* CVE-2015-7941_1
* CVE-2015-7941_2
* CVE-2015-7942
* CVE-2015-7942-2
* CVE-2015-8035
* CVE-2015-7995
See #1374 for details.
3.5.0 (2015-11-13)
==================
Bugs fixed
----------
* Unicode string results failed XPath queries in PyPy.
* LP#1497051: HTML target parser failed to terminate on exceptions
and continued parsing instead.
* Deprecated API usage in doctestcompare.
3.5.0b1 (2015-09-18)
====================
Features added
--------------
* ``cleanup_namespaces()`` accepts a new argument ``keep_ns_prefixes``
that does not remove definitions of the provided prefix-namespace
mapping from the tree.
* ``cleanup_namespaces()`` accepts a new argument ``top_nsmap`` that
moves definitions of the provided prefix-namespace mapping to the
top of the tree.
* LP#1490451: ``Element`` objects gained a ``cssselect()`` method as
known from ``lxml.html``. Patch by Simon Sapin.
* API functions and methods behave and look more like Python functions,
which allows introspection on them etc. One side effect to be aware of
is that the functions now bind as methods when assigned to a class
variable. A quick fix is to wrap them in ``staticmethod()`` (as for
normal Python functions).
* ISO-Schematron support gained an option ``error_finder`` that allows
passing a filter function for picking validation errors from reports.
* LP#1243600: Elements in ``lxml.html`` gained a ``classes`` property
that provides a set-like interface to the ``class`` attribute.
Original patch by masklinn.
* LP#1341964: The soupparser now handles DOCTYPE declarations, comments
and processing instructions outside of the root element.
Patch by Olli Pottonen.
* LP#1421512: The ``docinfo`` of a tree was made editable to allow
setting and removing the public ID and system ID of the DOCTYPE.
Patch by Olli Pottonen.
* LP#1442427: More work-arounds for quirks and bugs in pypy and pypy3.
* ``lxml.html.soupparser`` now uses BeautifulSoup version 4 instead
of version 3 if available.
Bugs fixed
----------
* Memory errors that occur during tree adaptations (e.g. moving subtrees
to foreign documents) could leave the tree in a crash prone state.
* Calling ``process_children()`` in an XSLT extension element without
an ``output_parent`` argument failed with a ``TypeError``.
Fix by Jens Tröger.
* GH#162: Image data in HTML ``data`` URLs is considered safe and
no longer removed by ``lxml.html.clean`` JavaScript cleaner.
* GH#166: Static build could link libraries in wrong order.
* GH#172: Rely a bit more on libxml2 for encoding detection rather than
rolling our own in some cases. Patch by Olli Pottonen.
* GH#159: Validity checks for names and string content were tightened
to detect the use of illegal characters early. Patch by Olli Pottonen.
* LP#1421921: Comments/PIs before the DOCTYPE declaration were not
serialised. Patch by Olli Pottonen.
* LP#659367: Some HTML DOCTYPE declarations were not serialised.
Patch by Olli Pottonen.
* LP#1238503: lxml.doctestcompare is now consistent with stdlib's doctest
in how it uses ``+`` and ``-`` to refer to unexpected and missing output.
* Empty prefixes are explicitly rejected when a namespace mapping is used
with ElementPath to avoid hiding bugs in user code.
* Several problems with PyPy were fixed by switching to Cython 0.23.
Upstream changes:
1.19 2015-10-28 MANWAR
- Added key 'provides' to the Makefile.PL script.
- Added standard unit test script (t/00-load.t).
1.18 2015-10-26 MANWAR
- Added MIN_PERL_VERSION to the Makefile.PL script.
- Tidied up and documented package PDF::Create.
- Tidied up and documented package PDF::Create::Page.
- Tidied up README file.
1.17 2015-10-25 MANWAR
- Renamed file CHANGES to Changes.
- Tidied up Changes file.
- Added standard unit test for Changes file (t/changes.t).
1.16 2015-10-24 MANWAR
- Fixed the warning thrown by method PDF::Image::GIF::UnLZW().
- Tidied up example scripts (eg/sample.pl and eg/sample-cgi.pl).
- Added sample images used by example scripts.
- Added standard unit test for MANIFEST file.
1.15 2015-10-24 MANWAR
- Added key 'CreationDate' to the list of valid keys to constructor.
- Tidied up pod document of PDF::Create.
- Tidied up Makefile.PL script.
- Removed MANIFEST from .gitignore file.
1.14 2015-10-23 MANWAR
- Added section REPOSITORY, LICENSE to the pod document.
- Added standard pod syntax testing script t/pod.t
- Added MANIFEST file.
1.13 2015-10-22 MANWAR
- Fixed the distribution as earlier v1.12 didn't have any code (MANWAR).
1.12 2015-10-22 MANWAR
- Updated repository link in the Makefile.PL script (MANWAR).
2.01 2015-11-07 NEILB
* Added abstracts to all of the submodules -- a PR from GARU.
* Changed README to README.md -- another PR from GARU.
* Fixed runtime warning for Pod::POM::ERROR -- another PR from GARU.