opennlp - CLI interface to Apache OpenNLP
OpenNLP 2.1.0. Usage: opennlp TOOL where TOOL is one of:
- Doccat
- learned document categorizer
- DoccatTrainer
- trainer for the learnable document categorizer
- DoccatEvaluator
- Measures the performance of the Doccat model with the reference data
- DoccatCrossValidator
- K-fold cross validator for the learnable Document Categorizer
- DoccatConverter
- converts 20newsgroup data format to native OpenNLP format
- LanguageDetector
- learned language detector
- LanguageDetectorTrainer
- trainer for the learnable language detector
- LanguageDetectorConverter
- converts leipzig data format to native OpenNLP format
- LanguageDetectorCrossValidator
- K-fold cross validator for the learnable Language Detector
- LanguageDetectorEvaluator
- Measures the performance of the Language Detector model with the reference
data
- DictionaryBuilder
- builds a new dictionary
- SimpleTokenizer
- character class tokenizer
- TokenizerME
- learnable tokenizer
- TokenizerTrainer
- trainer for the learnable tokenizer
- TokenizerMEEvaluator
- evaluator for the learnable tokenizer
- TokenizerCrossValidator
- K-fold cross validator for the learnable tokenizer
- TokenizerConverter
- converts foreign data formats
(irishsentencebank,ad,pos,masc,conllx,namefinder,parse,conllu) to native
OpenNLP format
- DictionaryDetokenizer
- rule based detokenizer
- SentenceDetector
- learnable sentence detector
- SentenceDetectorTrainer
- trainer for the learnable sentence detector
- SentenceDetectorEvaluator
- evaluator for the learnable sentence detector
- SentenceDetectorCrossValidator
- K-fold cross validator for the learnable sentence detector
- SentenceDetectorConverter
- converts foreign data formats
(nkjp,irishsentencebank,ad,pos,masc,conllx,namefinder,parse,moses,conllu,letsmt)
to native OpenNLP format
- TokenNameFinder
- learnable name finder
- TokenNameFinderTrainer
- trainer for the learnable name finder
- TokenNameFinderEvaluator
- Measures the performance of the NameFinder model with the reference
data
- TokenNameFinderCrossValidator
- K-fold cross validator for the learnable Name Finder
- TokenNameFinderConverter
- converts foreign data formats
(evalita,ad,conll03,bionlp2004,conll02,masc,muc6,ontonotes,brat) to native
OpenNLP format
- CensusDictionaryCreator
- Converts 1990 US Census names into a dictionary
- POSTagger
- learnable part of speech tagger
- POSTaggerTrainer
- trains a model for the part-of-speech tagger
- POSTaggerEvaluator
- Measures the performance of the POS tagger model with the reference
data
- POSTaggerCrossValidator
- K-fold cross validator for the learnable POS tagger
- POSTaggerConverter
- converts foreign data formats (ad,masc,conllx,parse,ontonotes,conllu) to
native OpenNLP format
- LemmatizerME
- learnable lemmatizer
- LemmatizerTrainerME
- trainer for the learnable lemmatizer
- LemmatizerEvaluator
- Measures the performance of the Lemmatizer model with the reference
data
- ChunkerME
- learnable chunker
- ChunkerTrainerME
- trainer for the learnable chunker
- ChunkerEvaluator
- Measures the performance of the Chunker model with the reference data
- ChunkerCrossValidator
- K-fold cross validator for the chunker
- ChunkerConverter
- converts ad data format to native OpenNLP format
- Parser
- performs full syntactic parsing
- ParserTrainer
- trains the learnable parser
- ParserEvaluator
- Measures the performance of the Parser model with the reference data
- ParserConverter
- converts foreign data formats (ontonotes,frenchtreebank) to native OpenNLP
format
- BuildModelUpdater
- trains and updates the build model in a parser model
- CheckModelUpdater
- trains and updates the check model in a parser model
- TaggerModelReplacer
- replaces the tagger model in a parser model
- EntityLinker
- links an entity to an external data set
- NGramLanguageModel
- gives the probability and most probable next token(s) of a sequence of
tokens in a language model
All tools print help when invoked with help parameter Example:
opennlp SimpleTokenizer help