DOKK / manpages / debian 12 / python3-pynlpl / pynlpl-sampler.1.en
PYNLPL-SAMPLER(1) User Commands PYNLPL-SAMPLER(1)

sampler - manual page for pynlpl-sampler 0.7.7

usage: pynlpl-sampler [-h] [-t TESTSETSIZE] [-d DEVSETSITE] [-T TRAINSETSITE]

[-S SEED]
files [files ...]

Extracts random samples from datasets, supports multiple parallel datasets (such as parallel corpora), provided that corresponding data is on the same line.

The data sets to sample from, must be of equal size (i.e., same number of lines)

show this help message and exit
Test set size (lines) (default: 0)
Development set size (lines) (default: 0)
Training set size (lines), leave unassigned (0) to automatically use all of the remaining data (default: 0)
Seed for random number generator (default: 0)
February 2016 pynlpl-sampler 0.7.7