DOKK / manpages / debian 10 / libplucene-perl / Plucene::Analysis::LetterTokenizer.3pm.en
Plucene::Analysis::LetterTokenizer(3pm) User Contributed Perl Documentation Plucene::Analysis::LetterTokenizer(3pm)

Plucene::Analysis::LetterTokenizer - Letter tokenizer

        # isa Plucene::Analysis::CharTokenizer

This is the letter tokenizer class, which divides text at non-letters.

Note: this does a decent job for most European languages, but does a terrible job for some Asian languages, where words are not separated by spaces

2018-04-02 perl v5.26.1