Commit graph

2 commits

Author SHA1 Message Date
Thierry Thomas
3a00ffa77e Import patches (imported from OpenOffice.org 2.3 by Fedora, at least in
part). These patches, released under a BSD license, seem to improve the
accuracy of language detection, especially those that don't have a
Latin script.
2007-08-23 22:13:35 +00:00
Thierry Thomas
630a0b255a Libtextcat is a library with functions that implement the classification
technique described in Cavnar & Trenkle, "N-Gram-Based Text Categorization".
It was primarily developed for language guessing, a task on which it is known to
perform with near-perfect accuracy.

WWW: http://software.wise-guys.nl/libtextcat/
2006-12-04 21:45:23 +00:00