f98ad549cc
is simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN, it determines how many of the known stopwords the document contains for each language supported by "Lingua::StopWords". Each word in the document recognized as stopword of a particular language scores one point for this language. The "language_guess()" function takes a document as a parameter and returns the abbreviation of the language that it is most likely written in. Author: Mike Schilli <cpan@perlmeister.com> WWW: http://search.cpan.org/~mschilli/Text-Language-Guess-0.02/ PR: ports/103571 Submitted by: Masahiro Teramoto <markun@onohara.to>
3 lines
227 B
Text
3 lines
227 B
Text
MD5 (Text-Language-Guess-0.02.tar.gz) = 66fbb68b17c3e62febbba633111f852e
|
|
SHA256 (Text-Language-Guess-0.02.tar.gz) = 12ef612c1de0451367d403db73723446b836e2e10adeec5e9386b7baa8ede12f
|
|
SIZE (Text-Language-Guess-0.02.tar.gz) = 5377
|