* Fixed bugs in the exiv2, OpenOffice, and OLE2 plug-ins.
* Static relocatable glib no longer required.
* getKeywords2 function is now included in the code.
Bugfixes in exiv2 extractor fixing remaining issues.
Changed plugins to not use filename but always only
rely on mmapped memory. Extended API with function
that allows running getKeywords on data in memory
(instead of filename). Extended API with encode
and decode functions for binary metadata.
Changes 0.5.1:
Preliminary integration of exiv2 support (not enabled
by default due to bugs). Moved Python and Java
bindings into separate packages.
print keywords.
Fixes (second half of) Secunia SA: http://secunia.com/advisories/15651/
(first half was fixed in 0.3.11nb1)
ChangeLog excerpts:
Thu Feb 24 00:32:44 EST 2005
Added extractor that extracts binary (!) thumbnails from
images using ImageMagick. Decoder function for the binary
string is in the thumbnailextractor.c source.
Sun Feb 20 16:36:17 EST 2005
Fixed similar problem in REAL extractor. Added support
for new Helix/Real format to REAL extractor.
Sun Feb 20 12:48:15 EST 2005
Fixed (rare) integer overflow bug in PNG extractor.
Fri Jan 21 15:23:43 PST 2005
Fixed security problem in PDF extractor.
Fri Dec 24 13:28:59 CET 2004
Added support for Unicode to the pdf extractor.
Thu Dec 23 18:14:10 CET 2004
Avoided exporting symbol OPEN (conflicts on OSX
with same symbol from GNUnet). Added conversion
to utf8 to various plugqins (see todo) and
added conversion from utf8 to current locale to
print keywords.
Fri Nov 12 19:20:37 EST 2004
Fixed bug in PDF extractor (extremely rare segfault).
Fixed#787.
Fixed bug in man extractor (undocumented return value running on
4 GB file not taken care of properly).
Sat Oct 30 20:18:21 EST 2004
Fixing various problems on Sparc64 (bus errors).
Workaround for re-load glib problem of OLE2 extractor.
libextractor is a simple library for keyword extraction. libextractor
does not support all formats but supports a simple plugging mechanism
such that you can quickly add extractors for additional formats, even
without recompiling libextractor. libextractor typically ships with a
dozen helper-libraries that can be used to obtain keywords from common
file-types.