a) refer 'perl' in their Makefile, or
b) have a directory name of p5-*, or
c) have any dependency on any p5-* package
Like last time, where this caused no complaints.
Features:
Support for Python3,
Add xmlXPathSetContextNode and xmlXPathNodeEval
Documentation:
Add documentation for xmllint --xpath
Fix the URL of the SAX documentation from James
Fix spelling of "length"
Portability:
Fix python bindings with versions older than 2.7
rebuild docs:Makefile.am
elfgcchack.h after rebuild in doc
elfgcchack for buf module
Fix a uneeded and wrong extra link parameter
Few cleanup patches for Windows
Fix rpmbuild --nocheck
Fix for win32/configure.js and WITH_THREAD_ALLOC
Fix Broken multi-arch support in xml2-config
Fix a portability issue for GCC < 3.4.0
Windows build fixes
Fix a thread portability problem
Downgrade autoconf requirement to 2.63
Bug Fixes:
Fix a linking error for python bindings
Fix a couple of return without value
Improve the hashing functions
Improve handling of xmlStopParser()
Remove risk of lockup in dictionary initialization
Activate detection of encoding in external subset
Fix an output buffer flushing conversion bug
Fix an old bug in xmlSchemaValidateOneElement
Fix configure cannot remove messages
fix schema validation in combination with xsi:nil
xmlCtxtReadFile doesn't work with literal IPv6 URLs
Fix a few problems with setEntityLoader
Detect excessive entities expansion upon replacement
Fix the flushing out of raw buffers on encoding conversions
Fix some buffer conversion issues
When calling xmlNodeDump make sure we grow the buffer quickly
Fix an error in the progressive DTD parsing code
xmllint should not load DTD by default when using the reader
Try IBM-037 when looking for EBCDIC handlers
Fix potential out of bound access
Fix large parse of file from memory
Fix a bug in the nsclean option of the parser
Fix a regression in 2.9.0 breaking validation while streaming
Remove potential calls to exit()
Improvements:
Regenerated API, and testapi, rebuild documentation
Fix tree iterators broken by 2to3 script
update all tests for Python3 and Python2
A few more fixes for python 3 affecting libxml2.py
Fix compilation on Python3
Converting apibuild.py to python3
First pass at starting porting to python3
updated configure.in for python3
Add support for xpathRegisterVariable in Python
Added a regression tests from bug 694228 data
Cache presence of '<' in entities content
Avoid extra processing on entities
Python binding for xmlRegisterInputCallback
Python bindings: DOM casts everything to xmlNode
Define LIBXML_THREAD_ALLOC_ENABLED via xmlversion.h
Adding streaming validation to runtest checks
Add a --pushsmall option to xmllint
Cleanups:
Switched comment in file to UTF-8 encoding
Extend gitignore
Silent the new python test on input
Cleanup of a duplicate test
Cleanup on duplicate test expressions
Fix compiler warning after 153cf15905cf4ec080612ada6703757d10caba1e
Spec cleanups and a fix for multiarch support
Silence a clang warning
Cleanup the Copyright to be pure MIT Licence wording
rand_seed should be static in dict.c
Fix typos in parser comments
Upstream changes:
1.52 2013-05-21
- Add t/style-trailing-space.t.
- Got rid of trailing space.
- Convert to t/cpan-changes.t .
1.51 2013-05-11
- Sort the XML namespaces before outputting.
- became broken in perl-5.18.0-RC1.
CRF++ is a simple, customizable, and open source implementation of Conditional
Random Fields (CRFs) for segmenting/labeling sequential data. CRF++ is designed
for generic purpose and will be applied to a variety of NLP tasks, such as Named
Entity Recognition, Information Extraction and Text Chunking.
Version 2.7
-----------
(codename Translation, released on May 20th 2013)
- Choice and prefix loaders now dispatch source and template lookup
separately in order to work in combination with module loaders as
advertised.
- Fixed filesizeformat.
- Added a non-silent option for babel extraction.
- Added `urlencode` filter that automatically quotes values for
URL safe usage with utf-8 as only supported encoding. If applications
want to change this encoding they can override the filter.
- Added `keep-trailing-newline` configuration to environments and
templates to optionally preserve the final trailing newline.
- Accessing `last` on the loop context no longer causes the iterator
to be consumed into a list.
- Python requirement changed: 2.6, 2.7 or >= 3.3 are required now,
supported by same source code, using the "six" compatibility library.
- Allow `contextfunction` and other decorators to be applied to `__call__`.
- Added support for changing from newline to different signs in the `wordwrap`
filter.
- Added support for ignoring memcache errors silently.
- Added support for keeping the trailing newline in templates.
- Added finer grained support for stripping whitespace on the left side
of blocks.
- Added `map`, `select`, `reject`, `selectattr` and `rejectattr`
filters.
- Added support for `loop.depth` to figure out how deep inside a recursive
loop the code is.
- Disabled py_compile for pypy and python 3.
* Added "center" style hint for block title and desc
* Added style hints for new Getting Started designs
* Fixed incorrect automatic link role for guide links
* Support for Mallard conditional language test tokens
* Added style="continues" for Mallard lists and steps
* Better handling of broken internal Mallard links
* MathML support for Mallard and DocBook
* Audio and video support for DocBook
* Ability to set poster image for video
* TTML subtitles for audio and inline media
* Updated media controls
* Fixed #hash auto-expanding and colorizing
* Translator and publisher credits
* Use quote char on :before for blockquote
* ARIA landmark roles
* Changed marker for expanders
* Switched ui:expanded to non-experimental
* Experimental Mallard UI overlays
* Dropped unused DocBook utility XSLT
* Use itstool "join mode" for translations
* More experimental thumbnail link styles
* Many translation updates
3.2.1 (2013-05-11)
==================
Features added
--------------
* The methods ``apply_templates()`` and ``process_children()`` of XSLT
extension elements have gained two new boolean options ``elements_only``
and ``remove_blank_text`` that discard either all strings or whitespace-only
strings from the result list.
Bugs fixed
----------
* When moving Elements to another tree, the namespace cleanup mechanism
no longer drops namespace prefixes from attributes for which it finds
a default namespace declaration, to prevent them from appearing as
unnamespaced attributes after serialisation.
* Returning non-type objects from a custom class lookup method could lead
to a crash.
* Instantiating and using subtypes of Comments and ProcessingInstructions
crashed.
Other changes
-------------
3.2.0 (2013-04-28)
==================
Features added
--------------
Bugs fixed
----------
* LP#690319: Leading whitespace could change the behaviour of the string
parsing functions in ``lxml.html``.
* LP#599318: The string parsing functions in ``lxml.html`` are more robust
in the face of uncommon HTML content like framesets or missing body tags.
Patch by Stefan Seelmann.
* LP#712941: I/O errors while trying to access files with paths that contain
non-ASCII characters could raise ``UnicodeDecodeError`` instead of properly
reporting the ``IOError``.
* LP#673205: Parsing from in-memory strings disabled network access in the
default parser and made subsequent attempts to parse from a URL fail.
* LP#971754: lxml.html.clean appends 'nofollow' to 'rel' attributes instead
of overwriting the current value.
* LP#715687: lxml.html.clean no longer discards scripts that are explicitly
allowed by the user provided whitelist. Patch by Christine Koppelt.
Other changes
-------------
3.1.2 (2013-04-12)
==================
Features added
--------------
Bugs fixed
----------
* LP#1136509: Passing attributes through the namespace-unaware API of
the sax bridge (i.e. the ``handler.startElement()`` method) failed
with a ``TypeError``. Patch by Mike Bayer.
* LP#1123074: Fix serialisation error in XSLT output when converting
the result tree to a Unicode string.
* GH#105: Replace illegal usage of ``xmlBufLength()`` in libxml2 2.9.0
by properly exported API function ``xmlBufUse()``.
Other changes
-------------
3.1.1 (2013-03-29)
==================
Features added
--------------
Bugs fixed
----------
* LP#1160386: Write access to ``lxml.html.FormElement.fields`` raised
an AttributeError in Py3.
* Illegal memory access during cleanup in incremental xmlfile writer.
Other changes
-------------
* The externally useless class ``lxml.etree._BaseParser`` was removed
from the module dict.
Java 5. Changes from 9.4 to 9.5:
The bulk of the open source parts of Saxon (the parts maintained by Saxonica) are now licensed under Mozilla Public License version 2.0, replacing MPL 1.0.
There have been some other changes to third-party open source components, notably the introduction of a new regular expression engine derived from Jakarta (Apache license), and the dropping of the old Base64 conversion code (Netscape license).
Users interested in building the product from source code need to be aware that the build process now includes a preprocessing phase that generate separate Java code for the EE, PE, and HE editions. The raw (before preprocessing) source code is published in the Subversion repository, but for building Saxon-HE, a more convenient place to start is the post-preprocessing Java code issued on SourceForge as a source.zip download.
A consequence of this change is that the JAR files for Saxon-HE, Saxon-PE, and Saxon-EE contain different class files having the same names. Therefore, the JAR files for different editions should not co-exist on the classpath. If you use internal Saxon APIs in an application, you may need to check that the methods you call are available in all three editions. This won't be a problem for interfaces that are clearly user-facing, but it could be an issue for applications that penetrate deeper into the internals.
More changes here:
http://www.saxonica.com/documentation/index.html#!changes
Common Changes
==============
CLDR 23: Collation tailorings put native script first; non-Gregorian calendar formats are more consistent; much improved data for Armenian (hy), Georgian (ka), Mongolian (mn), and Welsh (cy); …
Time zone data: 2013b
Date format/parse now supports CLDR short weekday names ("EEEEEE", "cccccc").
Support DisplayContext for date formatting, locale display names.
DateTimePatternGenerator behavior is now much more consistent between C and J.
Support new timezone pattern characters in LDML spec: X+, x+, O, OOOO, V, VV, VVV.
Updated SpoofChecker for v5 of UTS39.
AlphabeticIndex enhancements:
New thread-safe ImmutableIndex sub-API
Build an index for a custom Collator.
Make data-driven for Chinese collations.
New API for CLDR script metadata.
ICU4C Specific Changes
======================
Support for “dangi” Korean luni-solar calendar (already in ICU4J).
Add CompactDecimalFormat (already in ICU4J).
Add TerritoryContainment APIs (already in ICU4J).
UnicodeString default constructor and destructor now inline.
Layout engine now supports 'morx' tables.
Fixed some ICU 50 regressions:
Affixes set with e.g. DecimalFormat::setPositivePrefix were ignored for parse.
UNUM_PARSE_INT_ONLY no longer handled grouping separator.
Add ucal_getTimeZoneID.
The C++ AlphabeticIndex implementation is now on par with Java, including full support for all Chinese collation tailorings.
U8_NEXT() and similar low-level macros now support NUL-terminated UTF-8 strings.
New macros like U8_NEXT_OR_FFFD() return U+FFFD for an ill-formed sequence.
Conversion: New "good one-way" mapping type, for example for Variation Selector sequences.
Upstream changes:
2013-05-04 David E. Wheeler <david@justatheory.org>
* Release 3.28
Removed use of 'done_testing' in t/white.t, which was added in
v3.27. Thanks to Paul Howarth for the catch.
Fixed inproperly calculated number of skipped tests in t/xhtml01.t
when HTML::Entities is not intalled.
2013-05-03 David E. Wheeler <david@justatheory.org>
* Release 3.27
Added new warnings as defined by Pod::Checker, thanks to RJBS:
+ "=over" without "=back"
+ "L<>" starts or ends with whitespace
+ raw "|" or "/" in "L<text|link>" text
Added ->keep_encoding_directive, which is false by default, to
prevent formatters from emitting a document's '=encoding'
directive in most cases. That they emitted the encoding was a bug
introduced in v3.26 (RT #84093). Now, only the DumpAsText and
DumpAsXML formats set it to true. Patch from Patrice Dumas.
2013-02-27 David E. Wheeler <david@justatheory.org>
* Release 3.26
Fixed another test script to skip all tests when Encode is not
installed.
Added 'encoding()' and 'detected_encoding()' to return the current
encoding and the encoding actually to decode the input file. The
value from "=encoding" is also now kept in the output tree. Patch
from Patrice Dumas (RT #74390).
Upstream changes:
2.0016 Sat 13 Apr 22:36:51 IDT 2013
- Don't enable XML_PARSE_HUGE by default.
- Fix the previous version due to a mercurial SNAFU.
2.0015 Sat 13 Apr 02:30:33 IDT 2013
- Don't enable XML_PARSE_HUGE by default.
- https://bitbucket.org/shlomif/perl-xml-libxml/pull-request/19
- Thanks to Grant McLean ( https://metacpan.org/author/GRANTM ) for
the bug report and patch.
2.0014 Wed 5 Dec 11:06:26 IST 2012
- Got 40reader_mem_error.t to not fetch the external DTDs.
- https://rt.cpan.org/Public/Bug/Display.html?id=81703
- Thanks to Alexandr Ciornii (CHORNY) for the report and Slaven
Rezic (SREZIC) for the analysis and a proposed fix.
2.0013 Tue 4 Dec 17:40:27 IST 2012
- Fix a memory error (double-free) in XML::LibXML::Reader if we reached
EOF and
then called destroy.
- discovered by Shlomi Fish.
- Fixed by Shlomi Fish.
- see t/40reader_mem_error.t
2.0012 Fri 9 Nov 06:39:32 IST 2012
- Fix support for references to scalars with overloaded stringification
magic.
- https://rt.cpan.org/Public/Bug/Display.html?id=77864
- Thanks to Christian Hansen (CHANSEN) for a report, a testcase, and
a patch.
2.0011 Thu 8 Nov 00:25:15 IST 2012
- Fix crash in removeChild() when not expanding entities
- https://rt.cpan.org/Ticket/Display.html?id=80395
- "removeChild() segfaults when not expanding entities"
- Thanks to GUIDO@cpan.org for the report, for a test case (that
was adapted into t/48_removeChild_crashes_rt_80395.t ) and for
a patch to fix it.
2.0010 Thu 1 Nov 20:13:08 IST 2012
- Passing debug (an undocumented option) to check_lib in Makefile.PL.
- This way we get more meaningful traces on perl Makefile.PL DEBUG=1.
- Thanks to MSTROUT for the report and a proposed fix.
2.0009 Thu 1 Nov 16:26:45 IST 2012
- Fix libxml2 detection in Strawberry Perl.
- Another Devel::CheckOS fallout.
- Thanks to KMX for the report and for a proposed fix. The actual fix
was made to be more generic considering the use-cases.
- https://rt.cpan.org/Ticket/Display.html?id=80540
2.0008 Mon 22 Oct 12:09:11 IST 2012
- Fix build error when using non-standard libxml2 installation
- https://rt.cpan.org/Ticket/Display.html?id=80332
- Thanks to L RW for the report.
Upstream changes:
0.52 ( 4-29-13 )
- Added proper binmode setting when outputting XML to address deficiences on Win32 ( such as failing Strawberry perl due to line endings )
- Added evil OS check in file writing code to address differences in how files must be opened to write utf8 on win32 versus linux/cygwin.
- Added note regarding using utf8 flag instead of using encoding(utf8)
- Altered UTF8 test to also write XML containing UTF8 to a file.
0.51 ( 4-28-13 )
- New 'unsafe' parsing mode, giving a 30% speed increase with some downfalls
- New 'read_more' function, allowing more data to be read in before parsing into a tree
- Fixed some memory leaks
- Added 'Controversy' section to documentation
- Updated speed comparisons for feed2.xml, adding a comparison against XML::Fast as well
as a comparison of running in 'unsafe' mode.
- Spelling correction
- "Modernized" file handling using my variables
0.50 ( 4-24-13 )
- Remove debugging print statements that were left in
- Include the new test cases in the distribution
0.49 ( 4-23-13 )
- Fixes to find_by_perl
Can now be accessed in an object or namespace flexibly.
Now ignores case of keys.
Keys with underscores can be used.
Will work properly when passed a single node instead of an array reference of nodes.
- Fixes to xget
Method of deferencing has been changed to be compatible with newer versions of perl ( 5.10.2+ )
- Fixed handling of nodes named 'value'; they will no longer crash the XS code
- Added a new function similar to new called 'simple', that works in the same way but automatically uses the 'simple' parser
- Altered handling of node values in simple mode, so that it stores values when mixed with nodes or atttributes under 'content' ( like XML::Simple )
This feature was requested by users so it has been added. Note that it only does this when the value is not composed of just spaces and carriage returns.
0.48 ( 4-22-13 )
- Revert code back to 0.45 version
Changes lost in the process:
Content is not stored in the 'content' key any more; it is stored as the node itself when the node contains nothing else. As God intended.
Memory leak fixes
Quote handling
- Code copyright returned fully to David Helkowski; any changes by other parties have been discarded. Any code ported in from other open source projects removed.
This has been done intentionally so as to make it possible to relicense the software commerically.
- XS code modofied to work in a multi-threaded environment
- Reverting Makefile.PL back to original wierd hackish fixes. They are there for specific reasons.
This the br.ispell dictionary for Brazilian Portuguese. It's freely available
under the terms of the GNU GPL. You can freely redistribute and/or use it for
spell checking through ispell. The main page (in Portuguese) for this dictionary
is http://www.ime.usp.br/~ueda/br.ispell.