This is a maintenance release of cbmconvert after a 15-year break. Some old
source code releases were imported as snapshots, because no version control
system had been used in the past.
What's Changed
- Empty files are now supported.
- The build system was converted to CMake.
- Some test cases with CTest were introduced.
- Support for CPack packaging as well as native Debian packages was
implemented.
Full Changelog: https://github.com/dr-m/cbmconvert/commits/cbmconvert-2.1.4
2.0.8
Changed
- Improvement over Vietnamese detection
- MD improvement on trailing data and long foreign (non-pure latin) data
- Efficiency improvements in cd/alphabet_languages from [@adbar](https://github.com/adbar)
- call sum() without an intermediary list following PEP 289 recommendations from [@adbar](https://github.com/adbar)
- Code style as refactored by Sourcery-AI
- Minor adjustment on the MD around european words
- Remove and replace SRTs from assets / tests
- Initialize the library logger with a `NullHandler` by default from [@nmaynes](https://github.com/nmaynes)
- Setting kwarg `explain` to True will add provisionally (bounded to function lifespan) a specific stream handler
bdf2sfd 1.1.7 (2021-11-16)
- Rename the "glyphes" variable to "glyphs"
- Bump CMake minimum required version to 2.8.12
- Add SPDX short license identifier in source file
- Use CMAKE_C_STANDARD instead of hardcoding compiler flags
- Switch to using getline(3) instead of fgets(3)
Overview of changes between 1.0.10 and 1.0.11
=============================================
* Updated Unicode tables to version 14.
* Skip isolates in fribidi_get_par_direction().
* Various fuzzing fixes.
* Various build fixes.
Version 2.0.7
Changes:
Addition: 🍱 Add support for Kazakh (Cyrillic) language detection
Improvement: ❇️ Further improve inferring the language from a given code page (single-byte)
Removed: 🔥 Remove redundant logging entry about detected language(s)
Miscellaneous: 🔧 Trying to leverage PEP263 when PEP3120 is not supported
While I do not think that this (116) will actually fix something, it will rather raise a SyntaxError (Not about ASCII decoding error) for those trying to install this package using a non-supported Python version
Improvement: ⚡ Refactoring for potential performance improvements in loops
Improvement: ✨ Various detection improvement (MD+CD)
Bugfix: 🐛 Fix a minor inconsistency between Python 3.5 and other versions regarding language detection
Version 2.0.6
Changes:
Bugfix: 🐛 Unforeseen regression with the loss of the backward-compatibility with some older minor of Python 3.5.x
Bugfix: 🐛 Fix CLI crash when using --minimal output in certain cases
Improvement: ✨ Minor improvement to the detection efficiency (less than 1%)
Version 2.0.5
Changes:
Internal: 🎨 The project now comply with: flake8, mypy, isort and black to ensure a better overall quality
Internal: 🎨 The MANIFEST.in was not exhaustive
Improvement: ✨ The BC-support with v1.x was improved, the old staticmethods are restored
Remove: 🔥 The project no longer raise warning on tiny content given for detection, will be simply logged as warning instead
Improvement: ✨ The Unicode detection is slightly improved
Bugfix: 🐛 In some rare case, the chunks extractor could cut in the middle of a multi-byte character and could mislead the mess detection
Bugfix: 🐛 Some rare 'space' characters could trip up the UnprintablePlugin/Mess detection
Improvement: 🎨 Add syntax sugar __bool__ for results CharsetMatches list-container
This release push further the detection coverage to 97 % !
Version 2.0.4
Changes:
Improvement: ❇️ Adjust the MD to lower the sensitivity, thus improving the global detection reliability
Improvement: ❇️ Allow fallback on specified encoding if any
Bugfix: 🐛 The CLI no longer raise an unexpected exception when no encoding has been found
Bugfix: 🐛 Fix accessing the 'alphabets' property when the payload contains surrogate characters
Bugfix: 🐛✏️ The logger could mislead (explain=True) on detected languages and the impact of one MBCS match
Bugfix: 🐛 Submatch factoring could be wrong in rare edge cases
Bugfix: 🐛 Multiple files given to the CLI were ignored when publishing results to STDOUT. (After the first path)
Internal: 🎨 Fix line endings from CRLF to LF for certain files
0.0.8 (2021-09-14)
* No functional change in the library code.
* Include Windows binaries for Ruby 3.0.
* Drop support for Ruby 2.1 and earlier.
* Replace Travis CI with Github Actions.
* Fix cross-build after upgrading rake-compiler/rake-compiler-dock to
1.1.1/1.1.0.
Version 3.17.5 released 2021-08-23
* Fix the C extension module to harden is_namedtuple against looks-a-likes such
as Mocks. Also prevent dict encoding from causing an unraised SystemError when
encountering a non-Dict. Noticed by running user tests against a CPython
interpreter with C asserts enabled (COPTS += -UNDEBUG).
help2man (1.48.5) unstable; urgency=medium
* Use @samp{} around the option in the menu description for "--help
recommendations" to avoid rendering the -- as endash. (thanks to
Anders Jonsson).
* Update French translations (thanks to Stéphane Aulery).
* Update standards version to 4.6.0 (no changes).
help2man (1.48.4) unstable; urgency=medium
* Drop dependency on dpkg/install-info (not required since policy
3.9.5.0).
* Fix typo in comment and update some URLs to use https.
* Update Swedish translations (thanks to Anders Jonsson).
General:
* Donald Knuth’s changes for his 2021 tuneup of TeX and Metafont are
incorporated (https://tug.org/TUGboat/tb42-1/tb130knuth-tuneup21.pdf). They
are also available on CTAN as the knuth-dist and knuth-local packages. As
expected, the fixes are for obscure cases and do not affect any behavior in
practice.
* Except in original TeX: if \tracinglostchars is set to 3 or more, missing
characters will result in an error, not just a message in the log file, and
the missing character code will be shown in hex.
* Except in original TeX: a new integer parameter \tracingstacklevels, if
positive, and \tracingmacros is also positive, causes a prefix indicating the
macro expansion depth to be output on each relevant log line (e.g., ~.. at
depth 2). Also, macro logging is truncated at a depth ≥ the parameter value.
Aleph: The Aleph-based LaTeX format, named lamed, has been removed. The aleph
binary itself is still included and supported.
LuaTeX:
* Lua 5.3.6.
* Callback for nesting level used in \tracingmacros, as generalized variant
of the new \tracingstacklevels.
* Mark math glyphs as protected to prevent processing as text.
* Removed width/ic compensation for traditional math code path.
MetaPost:
* SOURCE_DATE_EPOCH environment variable support for reproducible output.
* Avoid wrong final % in mpto.
* Document -T option, other fixes to manual.
* Value of epsilon changed in binary and decimal modes, so mp_solve_rising_cubic
works as expected.
pdfTeX:
* New primitives \pdfrunninglinkoff and \pdfrunninglinkon; e.g., for
disabling generation of links in headers and footers.
* Warn instead of aborting when “\pdfendlink ended up in different nesting
level than \pdfstartlink”.
* Dump \pdfglyphtounicode assignments in fmt file.
* Source: poppler support removed, as it was too hard to keep in sync with
upstream. In native TL, pdfTeX has always used libs/xpdf, which is cut-down
and adapted code from xpdf.
XeTeX: Fixes for math kerning.
Dvipdfmx:
* Ghostscript is now invoked safely by default; to override (assuming all
input files are trusted), use -i dvipdfmx-unsafe.cfg. To use PSTricks with
XeTeX, this is required, as in:
xetex -output-driver="xdvipdfmx -i dvipdfmx-unsafe.cfg -q -E" ...
* If an image file is not found, exit with bad status.
* Extended special syntax for color support.
* Specials for manipulating ExtGState.
* Compatibility specials pdfcolorstack and pdffontattr.
* Experimental support for dviluatex’s extended fnt_def.
* Support new feature of virtual font to fallback Japanese font definition.
Dvips:
* Default PostScript document title is now the basename of the input file,
and can be overridden with the new option -title.
* If an .eps or other image file is not found, exit with bad status.
* Support new feature of virtual font to fallback Japanese font definition.
04/06/2021:
- remove many coverity warnings
03/23/2021:
- add a parser for Canvas 5 ( Mac and Windows )
03/12/2021:
- add basic support for hatch
02/25/2021
- add basic support for drop cap
4.03 Tue Oct 27 19:05:01 CET 2020
- when parsing comments in relaxed mode, JSON::XS would detect garbage
after the JSON text if the comment is after the end and does not end in
a newline (reported by Felipe Gasper).
0.61 2021-02-11
* Add upgrade_utf8 method
0.60 2021-01-26
* Add "read" method to read a file from an object
* Rename "run" to "parse" (old name is still OK)
* "tokenize_child" no longer returns parent on failure
* Add JSON::Whitespace (manipulate JSON whitespace) to the module
0.59 2021-01-26
* "json_file_to_perl" renamed "read_json"
* Documentation work
-- Benchmarks section updated with latest versions, add Cpanel::JSON::XS
-- Add Metacpan vote ratings to other CPAN modules section
-- Sections reordered to put methods above parsing details
0.58 2021-01-01
* Protect against stack overflows by having maximum parsing depth
-- set_max_depth, get_max_depth methods added
* Documentation updated
-- JSON RFC changed to 8259
-- Discussion of Unicode tests in JSON Test Suite
1.0.1.0
* Backwards-compatible support for bytestring ^>= 0.11 (#15)
1.0.0.0
* Merged omnibus PR doing a variety of things in (#10):
- Improves performance by 3-4x for encode, 4-5x for decode.
- The decode signature returning the tuple and actually returns an
error message with offset. The signature will now be ByteString ->
Either String ByteString.
- Actually tests using the test vectors defined in the RFC, and uses
property tests to ensure invariants hold.
- Adds lenient decoders to the API
- Adds -XTrustworthy annotations to the relevant exposed modules
- Rewrites the haddocks to be more up to date and fancy-styled.
- Adds benchmarks to the .cabal file so they can be run at toplevel,
and make them better.
- Bumps the Cabal version to 1.12
Because of the breadth of this change, we are calling this a new epoch
for the base16-bytestring library. Hence, the version 1.0.0.0.
0.1.1.7
- Fix some bugs in lazy decoding (#8).
1.2.0.1
* Package update: support for bytestring >=0.11
1.2.0.0
* Security fix: reject non-canonical base64 encoded values - (#38)
fixing issue #24.
* Security fix: reject bytestrings with improper padding that can be
"completed" by the unpadded-Base64url workflow, and homogenize error
messages (#33)
* Test coverage expanded to 98% of the library. All critical paths
covered.
1.1.0.0
* joinWith has been removed (#32)
* Bugfix: decode formerly allowed for padding chars to be interspersed
in a valid base64-encoded string. This is now not the case, and it
is fully spec-compliant as of #31
* The default behavior for Base64url decode is now to support
arbitrary padding. If you need strict padded or unpadded decode
semantics, use decodePadded or decodeUnpadded.
* Added strict unpadded and padded decode functions for Base64url
(#30)
* Added unpadded encode for Base64url (#26).
1.0.0.3
* Made performance more robust (#27).
* Improved documentation (#23).
* Improved the performance of decodeLenient a bit (#21).
1.5.6.0
* Make Show Value instance print object keys in lexicographic order.
1.5.5.1
* Fix a bug in FromJSON QuarterOfYear instance.
1.5.5.0
* Add instances for Month, Quarter and QuarterOfYear (from time-1.11),
thanks to Oleg Grenrus.
* The aeson repository has been moved to the haskell github
organization!
1.5.4.1
* Use Text.Encoding.decodeLatin1 to speed up ASCII string decoding,
thanks to Dmitry Ivanov.
* Support bytestring 0.11.* and th-abstraction 0.4.*, thanks to Oleg
Grenrus.
1.5.4.0
* Add instances for ToJSONKey and FromJSONKey to Const, thanks to Dan
Fithian.
* Add support for template-haskell 2.17, thanks to Galen Huntington.
* Documentation typo fix, thanks to Jean-Baptiste Mazon.
1.5.3.0
* Add instances for types in strict and data-fix packages, thanks to
Oleg Grenrus.
* CPP cleanup, thanks to Oleg Grenrus.
* Instances for dlist's Data.DList.DNonEmpty.DNonEmpty, thanks to Oleg
Grenrus.
1.5.2.0
* Add Ord Value instance, thanks to Oleg Grenrus.
* Export rejectUnknownFields from Data.Aeson
1.5.1.0
* Add instances for these, thanks to Oleg Grenrus.
1.5.0.0
* Fix bug in rejectUnknownFields not respecting fieldLabelModifier,
thanks to Markus Schirp.
* GFromJSON members are no longer exported from Data.Aeson(.Types), if
you are using gParseJSON consider switching to gParseJSON', thanks
to Oleg Grenrus.
* Aeson no longer accepts unescaped control characters, thanks to Oleg
Grenrus.
* Remove CoerceText since GHC >=7.8 has Coercible, thanks to Oleg
Grenrus.
* Rename the GToJSON class to GToJSON' and expose it, thanks to Oleg
Grenrus.
1.4.7.1
* GHC 8.10 compatibility, thanks to Ryan Scott.
1.4.7.0
Long overdue release (once again), so there's quite a bit of stuff
included even though it's a "minor" release. Big thanks to all the
contributors, the project would not exist without you!
Special thanks to Oleg Grenrus and Xia Li-Yao for reviewing tons of
stuff.
New stuff:
* Add rejectUnknownFields to Options which rejects unknown fields on
deserialization. Useful to find errors during development, but
enabling this should be considered a breaking change as previously
accepted inputs may now be rejected. Thanks to rmanne.
* FromJSON instance of Ratio a now parses numbers in addtion to
standard {numerator=..., denumerator=...} encoding. Thanks to
Aleksey Khudyakov.
* Add more information to parse errors, including a sample of the
surrounding text. Hopefully this will lead to less "Failed to read:
satisfy" confusion! Thanks to Sasha Bogicevic. We expect some
downstream test suites to break because of this, apologies in
advance. Hopefully you will like the improvement anyway :-)
* Add parseFail to Data.Aeson.Types. parseFail = fail but doesn't
require users to know about MonadFail. Thanks to Colin Woodbury.
* Make Template Haskell type family detection smarter when deriving
ToJSON1 instances, thanks to Ryan Scott.
* Optimize string parsing for the common case of strings without
escapes, thanks to Yuras.
Misc:
* Clean up compiler warnings and switch from base-compat to
base-compat-batteries. Thanks to Colin Woodbury & Oleg Grenrus.
* Clarification & fixes to documentation regarding treatment of Maybe
fields, thanks to Roman Cheplyaka.
* Add documentation for internal development workflows. Thanks to Guru
Devanla.
* Drop support for GHC < 7.8. We've chosen to support older GHCs as
long as it doesn't prevent us from adding new features, but now it
does! Thanks to Oleg Grenrus for the patch.
* Allow generic-deriving 1.13 in test suite.
* Some DRY fixes thanks to Mark Fajkus.