mlpack_lsh(1) | User Commands | mlpack_lsh(1) |
mlpack_lsh - k-approximate-nearest-neighbor search with lsh
mlpack_lsh [-B int] [-H double] [-m unknown] [-k int] [-T int] [-K int] [-q string] [-r string] [-S int] [-s int] [-L int] [-t string] [-V bool] [-d string] [-n string] [-M unknown] [-h -v]
This program will calculate the k approximate-nearest-neighbors of a set of points using locality-sensitive hashing. You may specify a separate set of reference points and query points, or just a reference set which will be used as both the reference and query set.
For example, the following will return 5 neighbors from the data for each point in 'input.csv' and store the distances in 'distances.csv' and the neighbors in 'neighbors.csv':
$ lsh --k 5 --reference_file input.csv --distances_file distances.csv --neighbors_file neighbors.csv
The output is organized such that row i and column j in the neighbors output corresponds to the index of the point in the reference set which is the j'th nearest neighbor from the point in the query set with index i. Row j and column i in the distances output file corresponds to the distance between those two points.
Because this is approximate-nearest-neighbors search, results may be different from run to run. Thus, the '--seed (-s)' parameter can be specified to set the random seed.
This program also has many other parameters to control its functionality; see the parameter-specific documentation for more information.
For further information, including relevant papers, citations, and theory, consult the documentation found at http://www.mlpack.org or included with your distribution of mlpack.
18 November 2018 | mlpack-3.0.4 |