freebsd-ports/chinese/p5-Lingua-ZH-WordSegmenter/pkg-descr
Stefan Eßer fb16dfecae Remove WWW entries moved into port Makefiles
Commit b7f05445c0 has added WWW entries to port Makefiles based on
WWW: lines in pkg-descr files.

This commit removes the WWW: lines of moved-over URLs from these
pkg-descr files.

Approved by:		portmgr (tcberner)
2022-09-07 23:58:51 +02:00

11 lines
535 B
Text

This is a perl version of simplified Chinese word segmentation.
The algorithm for this segmenter is to search the longest word at each point
from both left and right directions, and choose the one with higher frequency
product.
The original program is from the CPAN module Lingua::ZH::WordSegment
(https://metacpan.org/author/CHENYR) I did the follwing changes: 1) make the
interface object oriented; 2) make the internal string into utf8; 3) using
sogou's dictionary (http://www.sogou.com/labs/dl/w.html) as the default
dictionary.