freebsd-ports/textproc/pystemmer/distinfo
Martin Wilke abb5037267 PyStemmer provides access to efficient algorithms for calculating a
"stemmed" form of a word. This is a form with most of the common
morphological endings removed; hopefully representing a common
linguistic base form. This is most useful in building search engines
and information retrieval software; for example, a search with stemming
enabled should be able to find a document containing "cycling" given the
query "cycles".

PyStemmer provides algorithms for several (mainly european) languages,
by wrapping the libstemmer library from the Snowball project in a Python
module. It also provides access to the classic Porter stemming algorithm
for english: although this has been superceded by an improved algorithm,
the original algorithm may be of interest to information retrieval
researchers wishing to reproduce results of earlier experiments.

WWW:	http://pypi.python.org/pypi/PyStemmer/

PR:		ports/132695
Submitted by:	Wen Heping <wenheping at gmail.com>
2009-03-16 21:47:30 +00:00

3 lines
202 B
Text

MD5 (PyStemmer-1.0.1.tar.gz) = 5fb09517161de1792b2fc1fd8844038b
SHA256 (PyStemmer-1.0.1.tar.gz) = 08f0d64e7c49b77cb0f44937c25a04ed17ea1b9db3d830d47a02d2717ae89ebe
SIZE (PyStemmer-1.0.1.tar.gz) = 331212