Commit graph

11 commits

Author SHA1 Message Date
grant
59fc51431b this needs a c and c++ compiler. 2004-06-17 11:54:40 +00:00
lukem
00aa32b525 add warning about older procmail versions 2004-03-11 14:13:51 +00:00
hubertf
fda3ac2657 Update spamprobe to 0.9h. ChangeLog:
* New manual page
* spamprobe.cc (process_stream): Added -o tokenized option
  to allow people to use an external tokenizer with spamprobe.
* SpamFilter.cc (scoreToken): Reduced sorting overhead by
  pre-computing and integer sort value with sorting priorities
  reflected in the value.  This eliminates several calculations
  inside of the sort routine.
* SpamFilter.cc (computeRatio): Capped ratios in calculations to
  within MIN_PROB and MAX_PROB.  Widened that range.  This avoids
  problems with div/0 and makes it easier to sort terms.
* spamprobe.cc (dump_words): dump command can now optionally
  accept a regular expression as an argument and will only dump
  terms matching the regular expression.
  (purge_terms): Added purge-terms command to purge from the
  database all terms matching a regular expression.

Patch submitted by Ossi Herrala <PGP: 0x78CD0337> in private email.
2004-02-03 20:49:34 +00:00
hubertf
47c3a9d519 Update spamprobe to 0.9g2.
Patch provided by Ossi Herrala <$MAINTAINER> in private mail.


CHANGED:
 * This release moves from Paul's original formula to a slightly modified
   one that yields more evenly distributed scores. To continue using old
   formula, use -o orig-score command line option.

ADDED:
 * -C <number> command line option. This tells SpamProbe to assign a default,
   somewhat neutral, probability to any term that does not have a weighted
   (good count doubled) count of at least specified number in the database.
   This prevents terms which have been seen only a few times from having
   an unreasonable influence on the score of an email containing them.
   Default count have changed. It is now 5. Old was 3.
 * Added -o <option_name> command line option to specify alternate way of
   scoring. Consult README.txt for more info.
 * Added -l <number> command line option. Changes the spam probability
   threshold for emails from the default (0.7) to <number>.
 * Added tokenize command. Prints the tokens found in the file one word
   per line in human readable format with spam probability, good count,
   spam count, message count, and word in columns separated by whitespace.

IMPROVED:
 * -H command line option to add more headers to scan.
 * Improved performance by removing some redundant calculations and
   reducing the amount of I/O in train-* mode.
2004-01-20 23:47:23 +00:00
lukem
5f8dccdcb5 Fix typos in the manual page. 2004-01-20 03:03:46 +00:00
hubertf
6a258147e8 Update to spamprobe-0.9e.
Too many changes to list, the old pkg was from 2002(!)

Update probided by ohub @ #NetBSD/IRCnet,
buildlink3 code OK'd by jlam.
2004-01-05 22:11:15 +00:00
grant
e204efe934 reorder lines to appease pkglint (hi, zuntum!) 2003-12-01 12:16:14 +00:00
grant
0155927c43 s/netbsd.org/NetBSD.org/ 2003-07-17 21:41:05 +00:00
wiz
b8737eca68 Bump PKGREVISION because of db4 shlib major bump. 2003-06-10 09:56:16 +00:00
jschauma
e366d0c694 Use tech-pkg@ in favor of packages@ as MAINTAINER for orphaned packages.
Should anybody feel like they could be the maintainer for any of thewe packages,
please adjust.
2003-06-02 01:15:31 +00:00
zuntum
bc7c2317c5 Initial import of mail/spamprobe (version 0.8b)
Spamprobe is a fast, intelligent, automatic spam detector using Bayesian
analysis of terms contained in emails. Works with procmail, maildrop or a
similar tool to produce a complete server or client side spam filtering
system.

Provided by Daniel Farrugia in PR#20286, buildlinkified by me.
2003-02-17 13:53:08 +00:00