SLMSEG(1) | User Contributed Perl Documentation | SLMSEG(1) |
slmseg - maximum matching segment Chinese text.
slmseg -d dict_file [option]... [corpus_file]...
slmseg is a tool for segmenting Chinese text into words using maximum matching algorithm. slmseg segments corpus_file, or standard input if no filename is specified, and write the segmented result to standard output.
Under binary mode, consecutive id of 0 are merged into one 0. Under text mode, no space are inserted between unknown-words.
Originally written by Phill.Zhang <phill.zhang@sun.com>. Currently maintained by Kov.Chai <tchaikov@gmail.com>.
mmseg(1), ids2ngram (1).
2020-11-19 | perl v5.32.0 |