DOKK / manpages / debian 12 / opennlp / opennlp.1.en
OPENNLP(1) User Commands OPENNLP(1)

opennlp - CLI interface to Apache OpenNLP

OpenNLP 2.1.0. Usage: opennlp TOOL where TOOL is one of:

learned document categorizer
trainer for the learnable document categorizer
Measures the performance of the Doccat model with the reference data
K-fold cross validator for the learnable Document Categorizer
converts 20newsgroup data format to native OpenNLP format
learned language detector
trainer for the learnable language detector
converts leipzig data format to native OpenNLP format
K-fold cross validator for the learnable Language Detector
Measures the performance of the Language Detector model with the reference data
builds a new dictionary
character class tokenizer
learnable tokenizer
trainer for the learnable tokenizer
evaluator for the learnable tokenizer
K-fold cross validator for the learnable tokenizer
converts foreign data formats (irishsentencebank,ad,pos,masc,conllx,namefinder,parse,conllu) to native OpenNLP format
rule based detokenizer
learnable sentence detector
trainer for the learnable sentence detector
evaluator for the learnable sentence detector
K-fold cross validator for the learnable sentence detector
converts foreign data formats (nkjp,irishsentencebank,ad,pos,masc,conllx,namefinder,parse,moses,conllu,letsmt) to native OpenNLP format
learnable name finder
trainer for the learnable name finder
Measures the performance of the NameFinder model with the reference data
K-fold cross validator for the learnable Name Finder
converts foreign data formats (evalita,ad,conll03,bionlp2004,conll02,masc,muc6,ontonotes,brat) to native OpenNLP format
Converts 1990 US Census names into a dictionary
learnable part of speech tagger
trains a model for the part-of-speech tagger
Measures the performance of the POS tagger model with the reference data
K-fold cross validator for the learnable POS tagger
converts foreign data formats (ad,masc,conllx,parse,ontonotes,conllu) to native OpenNLP format
learnable lemmatizer
trainer for the learnable lemmatizer
Measures the performance of the Lemmatizer model with the reference data
learnable chunker
trainer for the learnable chunker
Measures the performance of the Chunker model with the reference data
K-fold cross validator for the chunker
converts ad data format to native OpenNLP format
performs full syntactic parsing
trains the learnable parser
Measures the performance of the Parser model with the reference data
converts foreign data formats (ontonotes,frenchtreebank) to native OpenNLP format
trains and updates the build model in a parser model
trains and updates the check model in a parser model
replaces the tagger model in a parser model
links an entity to an external data set
gives the probability and most probable next token(s) of a sequence of tokens in a language model

All tools print help when invoked with help parameter Example: opennlp SimpleTokenizer help

November 2022 opennlp 2.1.0