mlpack_preprocess_describe(1) | User Commands | mlpack_preprocess_describe(1) |
mlpack_preprocess_describe - descriptive statistics
mlpack_preprocess_describe -i string [-d int] [-P bool] [-p int] [-r bool] [-V bool] [-w int] [-h -v]
This utility takes a dataset and prints out the descriptive statistics of the data. Descriptive statistics is the discipline of quantitatively describing the main features of a collection of information, or the quantitative description itself. The program does not modify the original file, but instead prints out the statistics to the console. The printed result will look like a table.
Optionally, width and precision of the output can be adjusted by a user using the '--width (-w)' and '--precision (-p)' parameters. A user can also select a specific dimension to analyze if there are too many dimensions. The ’--population (-P)' parameter can be specified when the dataset should be considered as a population. Otherwise, the dataset will be considered as a sample.
So, a simple example where we want to print out statistical facts about the dataset 'X.csv' using the default settings, we could run
$ preprocess_describe --input_file X.csv --verbose
If we want to customize the width to 10 and precision to 5 and consider the dataset as a population, we could run
$ preprocess_describe --input_file X.csv --width 10 --precision 5 --verbose
For further information, including relevant papers, citations, and theory, consult the documentation found at http://www.mlpack.org or included with your distribution of mlpack.
18 November 2018 | mlpack-3.0.4 |