The rlimit issue adressed in patches ac,ad,ae was already addressed in
release 1.0.11, so remove them.
Omega 1.0.18 (2010-02-14):
indexers:
* Make the default charset "utf-8" not "UTF-8" as we lower case explicitly
specified character sets to compare to see if we need to reparse. Previously
XML documents which explicitly specified their character set as UTF-8 would
cause needless restart or the parser.
* omindex:
+ Increase the wdf boost for the document title from 2 to 5, since 2 isn't
really enough.
* scriptindex:
+ Don't abort with "Unknown Exception" if indexing is disallowed or we hit
</body> for a document which had an overridden character set. Fixes
ticket#410.
Omega 1.0.17 (2009-11-18):
indexers:
* omindex:
+ On Linux, change the memory limit on external filters to use _SC_PHYS_PAGES
since _SC_AVPHYS_PAGES excludes pages used by the OS cache and so will
often report a really low value. Fixes Debian bug#548987 and ticket#358.
+ Fix likely crash when reading output from external filter program if read()
is interrupted by a signal.
+ Fix potential crash when indexing PostScript files (fixed by using delete[]
(not delete) for array allocated by new[]).
testsuite:
* utf8converttest: Charset "8859_1" isn't understood by Solaris libiconv, and
isn't a standard charset name, so just test it when using our built-in
converter and GNU libc.
portability:
* Fix build failure on Mac OS X 10.6.
* Also check for socketpair() in -lxnet if it isn't found without, which
enables resource limits on external filter programs called by omindex on
Solaris, and possibly some other platforms. Fixes ticket#412.
general:
* omegascript.vim: The list of OmegaScript commands in the vim mode was rather
out of date, and a few commands were misclassified. Fix both problems and
avoid future recurrences by automatically generating those lists from the
command list in query.cc.
documentation:
* omegascript.html: Document that $date uses UTC. (ticket#314)
templates:
* query: Link to "xapian.org" rather than "www.xapian.org".
* inc/toptermsjs: Use double-quotes rather than single quotes for parameter
values on the <script> tag.
portability:
* omindex: Implement correct handling of paths when calling external filter
programs on Microsoft Windows.
indexers:
* omindex: Make sure that output is flushed after every message, not just after
some of them.
portability:
* Avoid infinite loop in omindex and scriptindex when reading files under
Cygwin with automatic end of line translation enabled. This same bug can
also manifest on Unix platforms if the file is truncated by another process
while being read.
* omindex:
+ If the filter program needed for a file format isn't installed, report this
explicitly when skipping subsequent files with the extension instead of
misleadingly reporting "Unknown extension".
+ Make -s actually work as a short-form for --stemmer (as documented by
"omindex --help" and "man omindex").
+ Drop the copyright info from the output of --version as it's perennially
out of date and we don't report it for any other Xapian programs.
* scriptindex:
+ Add new "valuenumeric" action to add a document value using
Xapian::sortable_serialise() to allow numeric sorting (ticket#260).
* $log now retries a partial write, or one interrupted by a system call.
* cgiparams.html: Note the technique of using a stub database file to allow a
default of searching over multiple databases.
* omindex:
+ Add support for indexing Microsoft Office 2007 formats and XPS files
(bug#290).
+ Fix the extraction of metadata from OpenDocument formats.
+ Fix "-l" which would previously always cause a segmentation fault if used
("--depth-limit" wasn't affected).
* Fix to compile when RLIMIT_AS isn't available (as on NetBSD and OpenBSD).
Instead use RLIMIT_VMEM or RLIMIT_DATA if either is available, else don't try
to limit the memory the filter process can use.
Omega 1.0.10 (2008-12-23):
build system:
* This release now uses newer versions of the autotools (autoconf 2.62 ->
2.63; automake 1.10.1 -> 1.10.2). The newer autoconf fixes a regression
in autoconf 2.62 (and so Omega 1.0.7) with detecting the endian-ness of some
platforms.
Omega 1.0.9 (2008-10-31):
documentation:
* docs/overview.html: Document HTML parsing a bit, including robots
meta and htdig_noindex.
omega:
* omega: Catch std::exception and report what its what() method returns.
* omega: Remove undocumented and non-functional support for numeric sorting
via CGI parameter SORT=#<slot> (SORT=<slot> works as before).
build system:
* configure: Sync warning flag handling changes from xapian-core to eliminate
many warnings from GCC 4.3.
Omega 1.0.8 (2008-09-04):
documentation:
* Fix a few typos and improve wording in a few places.
indexers:
* omindex:
+ If the character encoding is specified using <meta http-equiv=...> in an
HTML document then reparse the document if it isn't the encoding we're
already using so that any preceding <title> is converted correctly
(bug#292).
+ Convert text from meta tag parameters to UTF-8 (bug#293).
+ Handle <meta charset="..."> (new in HTML 5).
+ Fix bug in HTML tag parameter parsing which was probably just a small
performance penalty in real world cases, but could perhaps result in
parsing bogus extra parameters in carefully contrived situations.
portability:
* Add missing <signal.h>, noted on FreeBSD by Henrik Brix Andersen.
Each database is created and updated separately using either omindex
or scriptindex. You can search these databases (or any other Xapian
database with suitable contents) via a web front-end provided by
omega, a CGI application. A search can also be done over more than
one database at once.