Commit graph

26 commits

Author SHA1 Message Date
ghen
84cf6490a4 Fix typo, spotted by Chris Ross. 2007-10-25 21:45:42 +00:00
heinz
77aa057098 The package supports installation to DESTDIR. 2007-10-25 17:52:17 +00:00
ghen
3f6e22dc4b Update to 1.4d (previous 1.4.2 was 1.4b). Take MAINTAINERship.
2006-11-16  Brian Burton  <brian@burton-computer.com>

* Released as 1.4d

* configure.ac: Added ability to selectively disable image
  processing using --without-gif, --without-jpeg, and/or
  --without-png.

* src/spamprobe/spamprobe.cc (set_headers): Added ability to
  selectively ignore individual headers using -H-headername.

* src/includes/Ptr,Ref,Array.h: Restored missing <cassert> include.

* src/parser/PngParser.cc (tokenizeImage): added basic tokens from
  PNG images.

2006-11-16  Brian Burton  <brian@localhost.localdomain>

* src/parser/PngParser.cc (PngParser): Stub for PNG parsing using
  libpng.

* src/parser/JpegParser.cc (tokenizeMarker): Preliminary
  implementation of jpeg parsing using jpeglib.

* configure.ac: Auto detect of either libungig or libgif depending
  on which one is available.

2007-01-04  Brian Burton  <brian@burton-computer.com>

* Released as 1.4c

* spamprobe.1: Modified man page to remove unnecessary informaton
  and make it more conformant with man page conventions.

* src/spamprobe/spamprobe.cc (process_extended_options): added
  ignore-body option.

* src/parser/HeaderPrefixList.cc (HeaderPrefixList::addHeaderPrefix):
  Forced header prefixes and names to lower case instead of
  relying on an assert to enforce the restriction.

* src/database/FrequencyDBImpl_hash.cc (hash::FrequencyDBImpl_hash):
  Disabled experimental hash database auto-cleaning.

* src/includes/Ref.h: Removed cassert include.

* src/spamprobe/spamprobe.cc (process_extended_options): Added
  whitelist option to allow use of SP as a bayesian white list in
  conjunction with other filters.
2007-10-25 16:00:05 +00:00
wiz
601583c320 Whitespace cleanup, courtesy of pkglint.
Patch provided by Sergey Svishchev in private mail.
2007-02-22 19:26:05 +00:00
wiz
c9035d5d45 Fix build with gcc4. 2006-08-13 10:13:42 +00:00
ghen
f3a7bbd06c Update spamprobe to 1.4b (PKGNAME changed to 1.4.2).
Changes in 1.4a:
This release fixes a bug in decoding malformed base64 encoded data that
manifested itself in some architectures. Also corrects a typo in the online
help. Some code cleanup and minor performance tweaks were added as well.

Changes in 1.4b:
This release fixes pair of bugs related to messages with no lines in their
bodies. No new features were added.
2006-08-04 20:12:31 +00:00
hubertf
7c494d79ed Update spamprobe to version 1.4.
Submitted by Ossi 'ohub' Herrala <oherrala@ee.oulu.fi> in private mail
as discussed on IRCnet #NetBSD.

Changes to software:
 * Spamprobe has a config file!
 * Added comprehensive command line option help.
 * Added support for maildir directories to all file based commands.
 * Added support for auto-purge (-P command line option).
 * Commands create-db and create-config to greate database and config
   file.
 * Added gif parser support using libungif to analyzes image
   attachments and derive useful information from them.

Changes to package:
 * bdb is not in SUGGESTED_OPTIONS anymore.
 * added depency to graphics/libungif and gif is in SUGGESTED_OPTIONS.
2006-01-07 10:27:51 +00:00
wiz
751d30d637 DEPENDS and PKGREVISION bumps because db4's library name changed. 2006-01-06 15:10:01 +00:00
rillig
579e977969 Ran "pkglint --autofix", which corrected some of the quoting issues in
CONFIGURE_ARGS.
2005-12-05 23:55:01 +00:00
hubertf
bb0410fd2a Update spamprobe to 1.2a.
SpamProbe changes:
 - New email parsing implementation
 - New hash database, in addition to Berkeley DB (bdb/db4)
   (=> Hopefully no more "caught runtime exception: get: DB_PAGE_NOTFOUND:
       Requested page not found" on Linux every other day!)
 - And much more, check the ChangeLog for accurate information

pkgsrc changes:
 - Added db4 database backend to be optional (not requirement anymore)

Patches sent by the maintainer Ossi Herrala <oherrala@ee.oulu.fi>
in private mail.
2005-06-29 00:06:51 +00:00
jlam
3b2602907d Pass the correct path to the installed db4 files. Also, transform
"-ldb" to "-ldb4" so that the configure script doesn't use a system
libdb.so instead of the pkgsrc libdb4.so.  This fixes PR pkg/23615.
2005-06-12 08:59:32 +00:00
tv
f816d81489 Remove USE_BUILDLINK3 and NO_BUILDLINK; these are no longer used. 2005-04-11 21:44:48 +00:00
agc
8758983939 Add RMD160 digests. 2005-02-24 09:59:20 +00:00
hubertf
4c36ecdca9 Update spamprobe to 1.0a, patch sent via IRC by the maintainer.
Changes:
	* MimeLineReader.cc: 1.0 branch - fixed MBX record header regex
	* spamprobe.cc (main): Added exec and exec-shared commands.
	  (import_words): modified import command to allow negative values
	  to be specified in the import file.
	* Applied patches for configure.in and aclocal.m4 contributed by
	  Siggy Brentrup for debian compatibility.
	* FrequencyDBImpl_pbl.cc: Invokes new WordData methods to allow
	  storing data in big endian format.
	* WordData.h: Added optional support for storing counts/flags
	  in big endian order for data portability.
	* MimeLineReader.cc (readMBXFileHeader): UW IMAP MBX file format
	  is now auto detected from the first line of the mailbox file.
	* spamprobe.cc (process_extended_options): Removed -o imap-mbx
	  option.
	* spamprobe.cc (process_extended_options): Added -o imap-mbx
	  option to process files as WU-IMAP MBX files rather than mbox
	  files.
	* MimeLineReader.cc (readLine): Added support for WU-IMAP MBX file
	  format.
	* spamprobe.cc (process_stream): Added -o tokenized option
	  to allow people to use an external tokenizer with spamprobe.
	* SpamFilter.cc (scoreToken): Reduced sorting overhead by
	  pre-computing and integer sort value with sorting priorities
	  reflected in the value.  This eliminates several calculations
	  inside of the sort routine.
	* SpamFilter.cc (computeRatio): Capped ratios in calculations to
	  within MIN_PROB and MAX_PROB.  Widened that range.  This avoids
	  problems with div/0 and makes it easier to sort terms.
	* spamprobe.cc (dump_words): dump command can now optionally
	  accept a regular expression as an argument and will only dump
	  terms matching the regular expression.
	  (purge_terms): Added purge-terms command to purge from the
	  database all terms matching a regular expression.
	* spamprobe.cc (main): Fixed bug in command line processing.
	  Thanks to Jem for bug report.
	* spamprobe.cc (train_on_message): Code simplified.  Eliminated
	  redundant recalculation of scores.
	  (train_on_message): Timestamps are now longer updated by
	  train-spam and train-good commands.  They are still updated by
	  train command.
	  (main): Fixed assertion if -P option is specified in a read only
	  operation.
	* spamprobe.cc (main): Added -C command line option to allow users
	  to specify their own min word count.
	* SpamFilter.cc (SpamFilter): Set default minimum word count back
	  to 5 (was 3).
	* spamprobe.cc (process_extended_options): Removed "alt-score"
	  from -o options list because it distributes scores poorly.  New
	  formula achieves the same end with better accuracy.  Added
	  "orig-score" option to allow people to continue using the old
	  formula.  Added "honor-xstatus-header" option for people whose
	  mail server uses X-Status: rather than Status: for the deleted
	  flag.
	  (main): Added -l command line option to allow people to set
	  their own spam threshold if they don't like the default value.
	* SpamFilter.cc (scoreMessage): Added a new scoring formula based
	  on Paul's but taking the nth root of spam and good probabilities
	  to produce more evenly distributed scores.  Lowered the spam
	  threshold to 0.6 to keep accuracy about the same as the original
	  formula.  Highest score seen for a ham so far in tests is 0.44
	  so 0.6 seems safe.  Made the new formula the default instead of
	  Paul's.
2004-11-18 12:46:53 +00:00
tv
c487cb967a Libtool fix for PR pkg/26633, and other issues. Update libtool to 1.5.10
in the process.  (More information on tech-pkg.)

Bump PKGREVISION and BUILDLINK_DEPENDS of all packages using libtool and
installing .la files.

Bump PKGREVISION (only) of all packages depending directly on the above
via a buildlink3 include.
2004-10-03 00:12:51 +00:00
grant
59fc51431b this needs a c and c++ compiler. 2004-06-17 11:54:40 +00:00
lukem
00aa32b525 add warning about older procmail versions 2004-03-11 14:13:51 +00:00
hubertf
fda3ac2657 Update spamprobe to 0.9h. ChangeLog:
* New manual page
* spamprobe.cc (process_stream): Added -o tokenized option
  to allow people to use an external tokenizer with spamprobe.
* SpamFilter.cc (scoreToken): Reduced sorting overhead by
  pre-computing and integer sort value with sorting priorities
  reflected in the value.  This eliminates several calculations
  inside of the sort routine.
* SpamFilter.cc (computeRatio): Capped ratios in calculations to
  within MIN_PROB and MAX_PROB.  Widened that range.  This avoids
  problems with div/0 and makes it easier to sort terms.
* spamprobe.cc (dump_words): dump command can now optionally
  accept a regular expression as an argument and will only dump
  terms matching the regular expression.
  (purge_terms): Added purge-terms command to purge from the
  database all terms matching a regular expression.

Patch submitted by Ossi Herrala <PGP: 0x78CD0337> in private email.
2004-02-03 20:49:34 +00:00
hubertf
47c3a9d519 Update spamprobe to 0.9g2.
Patch provided by Ossi Herrala <$MAINTAINER> in private mail.


CHANGED:
 * This release moves from Paul's original formula to a slightly modified
   one that yields more evenly distributed scores. To continue using old
   formula, use -o orig-score command line option.

ADDED:
 * -C <number> command line option. This tells SpamProbe to assign a default,
   somewhat neutral, probability to any term that does not have a weighted
   (good count doubled) count of at least specified number in the database.
   This prevents terms which have been seen only a few times from having
   an unreasonable influence on the score of an email containing them.
   Default count have changed. It is now 5. Old was 3.
 * Added -o <option_name> command line option to specify alternate way of
   scoring. Consult README.txt for more info.
 * Added -l <number> command line option. Changes the spam probability
   threshold for emails from the default (0.7) to <number>.
 * Added tokenize command. Prints the tokens found in the file one word
   per line in human readable format with spam probability, good count,
   spam count, message count, and word in columns separated by whitespace.

IMPROVED:
 * -H command line option to add more headers to scan.
 * Improved performance by removing some redundant calculations and
   reducing the amount of I/O in train-* mode.
2004-01-20 23:47:23 +00:00
lukem
5f8dccdcb5 Fix typos in the manual page. 2004-01-20 03:03:46 +00:00
hubertf
6a258147e8 Update to spamprobe-0.9e.
Too many changes to list, the old pkg was from 2002(!)

Update probided by ohub @ #NetBSD/IRCnet,
buildlink3 code OK'd by jlam.
2004-01-05 22:11:15 +00:00
grant
e204efe934 reorder lines to appease pkglint (hi, zuntum!) 2003-12-01 12:16:14 +00:00
grant
0155927c43 s/netbsd.org/NetBSD.org/ 2003-07-17 21:41:05 +00:00
wiz
b8737eca68 Bump PKGREVISION because of db4 shlib major bump. 2003-06-10 09:56:16 +00:00
jschauma
e366d0c694 Use tech-pkg@ in favor of packages@ as MAINTAINER for orphaned packages.
Should anybody feel like they could be the maintainer for any of thewe packages,
please adjust.
2003-06-02 01:15:31 +00:00
zuntum
bc7c2317c5 Initial import of mail/spamprobe (version 0.8b)
Spamprobe is a fast, intelligent, automatic spam detector using Bayesian
analysis of terms contained in emails. Works with procmail, maildrop or a
similar tool to produce a complete server or client side spam filtering
system.

Provided by Daniel Farrugia in PR#20286, buildlinkified by me.
2003-02-17 13:53:08 +00:00