2006-11-16 Brian Burton <brian@burton-computer.com>
* Released as 1.4d
* configure.ac: Added ability to selectively disable image
processing using --without-gif, --without-jpeg, and/or
--without-png.
* src/spamprobe/spamprobe.cc (set_headers): Added ability to
selectively ignore individual headers using -H-headername.
* src/includes/Ptr,Ref,Array.h: Restored missing <cassert> include.
* src/parser/PngParser.cc (tokenizeImage): added basic tokens from
PNG images.
2006-11-16 Brian Burton <brian@localhost.localdomain>
* src/parser/PngParser.cc (PngParser): Stub for PNG parsing using
libpng.
* src/parser/JpegParser.cc (tokenizeMarker): Preliminary
implementation of jpeg parsing using jpeglib.
* configure.ac: Auto detect of either libungig or libgif depending
on which one is available.
2007-01-04 Brian Burton <brian@burton-computer.com>
* Released as 1.4c
* spamprobe.1: Modified man page to remove unnecessary informaton
and make it more conformant with man page conventions.
* src/spamprobe/spamprobe.cc (process_extended_options): added
ignore-body option.
* src/parser/HeaderPrefixList.cc (HeaderPrefixList::addHeaderPrefix):
Forced header prefixes and names to lower case instead of
relying on an assert to enforce the restriction.
* src/database/FrequencyDBImpl_hash.cc (hash::FrequencyDBImpl_hash):
Disabled experimental hash database auto-cleaning.
* src/includes/Ref.h: Removed cassert include.
* src/spamprobe/spamprobe.cc (process_extended_options): Added
whitelist option to allow use of SP as a bayesian white list in
conjunction with other filters.
* New manual page
* spamprobe.cc (process_stream): Added -o tokenized option
to allow people to use an external tokenizer with spamprobe.
* SpamFilter.cc (scoreToken): Reduced sorting overhead by
pre-computing and integer sort value with sorting priorities
reflected in the value. This eliminates several calculations
inside of the sort routine.
* SpamFilter.cc (computeRatio): Capped ratios in calculations to
within MIN_PROB and MAX_PROB. Widened that range. This avoids
problems with div/0 and makes it easier to sort terms.
* spamprobe.cc (dump_words): dump command can now optionally
accept a regular expression as an argument and will only dump
terms matching the regular expression.
(purge_terms): Added purge-terms command to purge from the
database all terms matching a regular expression.
Patch submitted by Ossi Herrala <PGP: 0x78CD0337> in private email.