Commit graph

9 commits

Author SHA1 Message Date
ryoon
a8753d8a41 orcus: Update to 0.16.1
Changelog:
0.16.1
    fixed a build issue on 32-bit linux platforms, which was
    indirectly caused by ixion.

    fixed json parsing bug caused by an uninitialized variable,
    which manifested itself on debian 32-bit platform.

    removed compiler warnings on unused variables from the base
    parser handlers.

0.16.0
    general
	full formula recalculations are now optional when loading
	documents. It makes more effective use of cached formula
	results.

	added the option of failing on the first faulty cell, or
	skipping them.

	fixed a bug that caused the threaded_sax_token_parser to
	deadlock.

	added base parser handler classes in the public headers so
	that they can be sub-classed to overwrite necessary handler
	methods.

    json-parser
	parsing of numeric values are now more strict for better
	conformance to the specs.

    ods
	added support for loading named expressions from ods
	documents.

	fixed an infinite loop when loading one of the attached
	ods documents from
	https://bugs.documentfoundation.org/show_bug.cgi?id=82414

    xlsx
	fixed a segfault when loading the xlsx document from
	https://bugs.documentfoundation.org/show_bug.cgi?id=83711.

    xls-xml
	fixed a bug that prevented formulas from referencing cells
	located in later sheets.

    xml-map
	adjusted the xml path expressions to be more like XPath.
	Previously, an attribute was expressed as '@' in the old
	expression, but XPath uses '/@'. The new expression uses
	'/@' for an attribute.

	added the ability to identify and import ranges from XML
	documents without map file.

	added the ability to generate map file from XML documents
	for user customization.

	added support to specify default namespace in the map file.

    python
	added orcus.Cell class to represent individual cell values
	and attributes.

	fixed several memory leaks in the python binding layer.

	modified orcus.csv.read() function to take string input,
	instead of bytes.

	added __version__ attribute to the orcus module.

	cleaned up orcus.detect_format function to only take the
	stream parameter.

	added named_expressions properties to Document and Sheet
	class objects.

	added Python API to bulk-process a number of spreadsheet
	documents (orcus.tools.file_processor).

	added Python API to download attachments from bugzilla
	services via REST API (orcus.tools.bugzilla).
2021-02-06 06:29:04 +00:00
ryoon
6dd0e8ac28 orcus: Update to 0.15.3
Changelog:
orcus 0.15.3

* xml-map

  * fixed another bug related to filling of cells down the column in a linked
    range with nested repeat elements.  The bug would occur when the field in
    a linked range is more than one level deeper than the nearest row group
    element.

* xls-xml

  * fixed a bug where TopCell and LeftCell attributes of the Table element
    were not properly honored.

orcus 0.15.2

* xml-map

  * fixed a bug that prevented filling of cells down the column in a linked
    range with nested repeat elements.  The bug would occur when the field in
    a linked range is associated with an element content rather than an
    attribute.

* xls-xml

  * added code to properly pick up and pass the number format codes, including
    named number format values such as 'General Date', 'Long Time, 'Currency'
    etc.

* fixed a build issue on older macOS environment, related to passing an rvalue
  to a tuple expecting a const reference.  The root cause was a bug in libc++
  of LLVM < 7.

* fixed a build issue with gcc5.

orcus 0.15.1

* switched xml_map_tree to using boost::object_pool to manage the life
  cycles of the objects within xml_map_tree, to avoid memory
  fragmentation.

* fixed incorrect handling of newly created elements in xml_map_tree.

* fixed segfault caused by double deletion of allocated memory for
  xml_map_tree::element, which seemed to happen only on 32-bit gcc builds.

* fixed weird test failures related to equality check of two double-precision
  values, caused probably by aggressive compiler optimization which only seems
  to get triggered in 32-bit gcc builds.

orcus 0.15.0

* spreadsheet interface

  * import_sheet::fill_down_cells() has been added as a required method, to
    allow the import filter code to duplicate cell value downward in one step.

* json parser

  * added test cases from JSONTestSuite.

  * fixed a bug on parsing an empty array containing one or more blank
    characters between the brackets.

* sax parser

  * fixed a bug on parsing an attribute value with encoded character
    immediately followed by a ';', such as '&amp;;'.

  * fixed a bug on parsing an assignment character '=' that either preceded or
    followed by whitespaces in attribute definition.

  * optionally use SSE4.2 intrinsics to speed up element name parsing.

* orcus-xml

  * revised its cli interface to make use of boost's program_options.

  * orcus-xml-dump's functionality has been combined into orcus-xml.

  * map mode now supports nested repeat elements to be mapped as range fields.

* orcus-json

  * map mode has been added to allow mapping of JSON documents to spreadsheet
    document model.  This mode either takes explicit mapping rule via map
    file, or performs automatic mapping by auto-identifying mappable ranges by
    analyzing the structure of the JSON document.

  * structure mode has been added to display the logical structures of JSON
    documents.

  * significantly improved performance of json document tree by utilizing
    object pool to manage the life cycles of json value instances.

* xls-xml

  * added support for importing named color values in the ss:Color attributes.

  * added support for handling UTF-16 streams that contains byte order marks.

* spreadsheet document

  * significantly improved performance of flat format output generation.

* internal

  * string_pool now uses boost's object_pool to manage the instances of stored
    strings.

  * file_content class has been added to memory-map file contents instead of
    loading them in-memory.

  * memory_content class has been added to map in-memory buffer with the
    optional ability to perform unicode conversion.

  * dom_tree has been renamed to dom::document_tree, and its interface has
    been cleaned up to hide its implementation details.
2020-02-05 13:54:29 +00:00
ryoon
6eef056983 Update to 0.14.1
Changelog:
orcus 0.14.1

* addressed a number of coverity issues.
* improved precision of points-to-twips measurement conversions by
  reducing the number of numeric operations to be performed.  This
  especially helps on i386 platforms.

orcus 0.14.0

* spreadsheet interface

  * import_data_table::set_range() now receives a parameter of type
    range_t.

  * import_sheet::set_array_formula() interface methods have been
    removed and replaced with import_sheet::get_array_formula() that
    returns an interface of type import_array_formula.

  * import_formula interface class has been added to replace the
    formula related methods of import_sheet.  As a result,
    set_formula(), set_shared_formula(), and set_formula_result()
    methods have been removed from the import_sheet interface class.

  * import_auto_filter::set_range() now receives a parameter of type
    range_t, rather than a string value representing a range.

  * import_sheet::set_fill_pattern_type() interface method now takes
    an enum value of type fill_pattern_t, rather than a string value.

* xls-xml

  * pick up the character set from the XML declaration, and pass it
    to the client app via import_global_settings interface.

  * support importing of array formulas.

* xlsx

  * support importing of array formulas.

  * fixed a bug where sheet indices being passed to the append_sheet()
    interface method were incorrect.

* shared formula handling code has been re-worked.

* spreadsheet::sheet class has been de-coupled from the import and
  export interfaces.

* previously known as import_styles class is now split into styles
  class and import_styles factory wrapper class.

* sax_parser now gracefully ignores leading whitespace(s) if any,
  rather than aborting the parsing for it's not a valid XML stream
  to have leading whitespace(s).  In the future we should make this
  behavior configurable.

* python

  * add orcus.xlsx.read() function that takes a file object to load
    an xlsx file as a replacement for orcus.xlsx.read_file().

  * add orcus.ods.read(), orcus.xls_xml.read(), orcus.csv.read(),
    and orcus.gnumeric.read() functions.

  * add orcus.Sheet.write() method which exports sheet content to
    specified format.  For now only the csv format type is
    supported.

* xml_map_tree no longer requires the source stream persisted in
  memory between the read and write.

* the sax parser now stores the offset positions of each element
  rather than their memory positions, in order to make the position
  values usable between duplicated stream instances.

* xml_structure_tree to support selection of an element by element
  path.

* document

  * correctly set the argument separator depending on the formula
    grammar type.  This change fixes loading of ods documents with
    formula cells.

* fixed a build issue with boost 1.67.
2019-02-19 16:06:34 +00:00
ryoon
75c1731771 converters/orcus: import orcus-0.13.2
Standalone file import filter library for spreadsheet documents.

This package contains the 0.13 branch of the library.
2018-02-03 00:04:10 +00:00
wiz
da894d10b9 Remove orcus, unused. 2017-02-06 07:10:00 +00:00
ryoon
befeaf226b Update to 0.11.2
Changelog:
2016-05-11  Kohei Yoshida  <kohei.yoshida@gmail.com>  [d6084fe1771052e516ecfb270cb24dd9917a1895]

        Up the version to 0.11.2.


2016-05-11  Kohei Yoshida  <kohei.yoshida@gmail.com>  [70fd8327c94b27a99e2c7800e91c13e5099cceda]

        Make it buildable with mdds-1.2.
2016-12-06 15:13:20 +00:00
wiz
2f55a1a0cd Update to 0.9.2:
No Changelog found. Major API change -- 0.10 in directory names instead
of 0.8 before.
2015-09-08 19:40:59 +00:00
ryoon
f7e550c4ca Update to 0.7.0
* Change to 0.8.0 branch
* Change license to mpl-2.0 from mit

Changelog:
Add some more formats.
2014-10-13 21:43:20 +00:00
jaapb
b4d250296d Packaged converters/orcus, a library that deals with spreadsheet documents
(libreoffice dependency).
2013-08-24 16:12:10 +00:00