The following is from the web page:
Release notes for htdig-3.1.6 1 Feb 2002
As with previous releases, this version cleans up some remaining bugs and adds a few
heavily-requested features. As the latest stable release, it is recommended for all
production servers.
* Fixed another nasty security hole in htsearch, which would allow a denial of service
attack or forcing htsearch to read in config files outside of the configuration
directory.
* Fixed some problems with htmerge, including problems with words beginning with special
characters and merging multiple databases.
* Fixed a bug in handling hopcounts.
* Fixed problems in handling non-standard relative HTTP redirects.
* Fixed bugs in external parsers support including being confused by charset information
in the Content-Type header and handling binary output from external converters.
* Fixed bugs in the default English endings database. (Under ispell, it wasn't quite
intended for the accuracy needed for our usage.)
* Fixed additional bugs in the endings fuzzy algorithm.
* Fixed bugs with compiling with gcc-3.0 and later.
* Fixed bugs compiling and running on Mac OS X.
* Fixed problems with servers not returning a Last-Modified date--now assums indexing
time as modification time.
* Fixed a variety of bugs in the HTML parser to more flexibly handle non-standard HTML.
* Fixed problems in the TCP connection code and will more reliably timeout when a
connection hangs and will retry bad connections several times before giving up.
* Added the -m "minimal" flag to htdig for only indexing a set list of URLs and made the
-l (log) flag the default behavior so that htdig will stop and restart automatically.
* Added htdump and htload programs for dumping ASCII representations of the databases
and reloading the same.
* Added support for htnotify to collect multiple URLs and allow easy customization of
notification messages, including the new attributes htnotify_replyto,
htnotify_webmaster, htnotify_prefix_file, and htnotify_suffix_file.
* Added a new "accents" fuzzy algorithm to morph accents, including the new accents_db
attribute.
* Added a 'list all' feature to htsearch with a query of '*' or the current
prefix_match_character.
* Added date restricted searching to htsearch including relative dates.
* Added documentation on running ht://Dig and the rundig script.
* Added METADESCRIPTION and NSTARS variables to the htsearch templates as well as
support for $=(var) template variable references.
* Added new config attributes to htsearch for restrict and exclude which work like the
normal htsearch form variables if the form variables are not set.
* Added many new attributes, including ignore_dead_servers description_meta_tag_names,
max_keywords, translate_latin1, url_rewrite_rules, search_rewrite_rules,
anchor_target, ignore_alt_text, search_results_contenttype, boolean_keywords,
boolean_syntax_errors, multimatch_method, maximum_page_buttons, max_excerpts,
plural_suffix, any_keywords and use_doc_date.
* Extended the build_select_lists attribute to support select multiple, radio boxes and
checkboxes.
* Revised the documentation to make it clearer in parts, including the url_part_aliases
attribute.
* Updated various contributed utilities including doc2html, xmlsearch, rundig.sh,
htparsedoc, acroconv.pl, multidig, etc.
* A variety of other bug fixes, and many documentation updates. See the ChangeLog for
details.
Convert most MESSAGE files to new syntax (${VARIABLE} gets replaced,
not @VARIABLE@, nor @@VARIABLE@@).
By default, substitutions are done for LOCALBASE, PKGNAME, PREFIX,
X11BASE, X11PREFIX; additional patterns can be added via MESSAGE_SUBST.
Clean up some packages while I'm there; add RCS tags to most MESSAGEs.
Remove some uninteresting MESSAGEs.
Changes:
* Fixed a nasty security hole in htsearch, which would allow users to
view any file on your site that had read permission.
* Fixed a bug that could cause problems with 8-bit characters on some
systems.
* Made some attempts to get htsearch's output to be more HTML 4.0
compliant. It quotes all HTML tag parameters, and uses ";" instead of
"&" as parameter separator in URLs for next pages. Reserved characters
in parameters are now encoded.
* Fixed handling of SGML entities: htdig will still decode them to store
as single characters in the database, but htsearch now encodes some of
them back for compliant results.
* Added two new formats for variables in htsearch templates, $%(var),
which escapes the variable for a URL, and $&(var), which HTML-escapes
the variable as necessary.
* Fixed htdig's handling of robots.txt, such that only the first
applicable User-agent field bearing its name will be used, rather than
only the last.
* Fixed htdig's handling of servers that return 2-digit years.
* Fixed handling of embedded quotes in quoted string lists.
* Fixed handling of relative URLs with trailing ".." or leading "//".
* Fixed handling of the valid_extensions attribute, which sometimes
failed in the previous version.
* Enhanced the handling of local filesystem indexing with the local_urls,
local_user_urls or local_default_doc attributes, which now allow
multiple directory or file names to be tried.
* Added the build_select_lists attribute to allow the config file to
specify <select> form elements in htsearch output as a template
variable, much like $(SORT) and $(METHOD).
* Added support for two additional configuration attributes:
max_keywords, and nph.
* A variety of other bug fixes, and many documentation updates. See the
ChangeLog for details.
or at <http://www.htdig.org/RELEASE.html>.
To the package
- - Remove NOT_FOR_PLATFORM, as I failed to duplicate the reported problem
(using a mac68k-1.3H system).
- - Move htsearch to libexec/cgi-bin, where apache might look for it.
- - Don't extract included db-2.6.4 directory.
- - Add comment to patch for configure.in, explaining how to reconstruct
patch to configure. Committing them both eases maintenance and permits
building without extracting htdig-*/db, and without autoconf.
suggestions by Kimmo Suominen.
- - Disable build on NetBSD-1.3[A-J] (needs libstdc++).
- - Use databases/db (currently db-2.7.3), instead of
included db-2.6.4.