freebsd-ports/textproc/p5-Search-VectorSpace/pkg-descr
Sunpoet Po-Chuan Hsieh ad4f267b7a Update WWW
search.cpan.org is shutting down.
It will redirect to metacpan.org after June 25, 2018.

With hat:	perl
2018-05-27 20:15:16 +00:00

11 lines
504 B
Text

This module takes a list of documents (in English) and
builds a simple in-memory search engine using a vector
space model. Documents are stored as PDL objects, and
after the initial indexing phase, the search should be
very fast. This implementation applies a rudimentary
stop list to filter out very common words, and uses a
cosine measure to calculate document similarity.
All documents above a user-configurable similarity
threshold are returned.
WWW: https://metacpan.org/release/Search-VectorSpace