70a7b10d36
rmmseg-cpp is a high performance Chinese word segmentation utility for Ruby. WWW: http://rmmseg-cpp.rubyforge.org
16 lines
734 B
Text
16 lines
734 B
Text
rmmseg-cpp is a high performance Chinese word segmentation utility for
|
|
Ruby. It features full "Ferret":http://ferret.davebalmain.com/ integration
|
|
as well as support for normal Ruby program usage.
|
|
|
|
rmmseg-cpp is a re-written of the original
|
|
RMMSeg(http://rmmseg.rubyforge.org/) gem in C++. RMMSeg is written
|
|
in pure Ruby. Though I tried hard to tweak RMMSeg, it just consumes
|
|
lots of memory and the segmenting process is rather slow.
|
|
|
|
The interface is almost identical to RMMSeg but the performance is
|
|
much better. This gem is always preferable in production
|
|
use. However, if you want to understand how the MMSEG segmenting
|
|
algorithm works, the source code of RMMSeg is a better choice than
|
|
this.
|
|
|
|
WWW: http://rmmseg-cpp.rubyforge.org
|