freebsd-ports/textproc/p5-Search-VectorSpace/pkg-descr
Erwin Lansing 16c825a5ea Add p5-Search-VectorSpace 0.02, a very basic vector-space search engine
perl module.

PR:		ports/73150
Submitted by:	clsung
2005-07-18 11:37:53 +00:00

12 lines
563 B
Text

This module takes a list of documents (in English) and
builds a simple in-memory search engine using a vector
space model. Documents are stored as PDL objects, and
after the initial indexing phase, the search should be
very fast. This implementation applies a rudimentary
stop list to filter out very common words, and uses a
cosine measure to calculate document similarity.
All documents above a user-configurable similarity
threshold are returned.
Author: Maciej Ceglowski <maciej AT ceglowski.com>
WWW: http://search.cpan.org/dist/Search-VectorSpace/