DOKK / manpages / debian 12 / seer / seer.1.en
SEER(1) User Commands SEER(1)

seer - sequence element enrichment analysis

Sequence Element Enrichment Analysis

The .pheno file format is tab separated, two columns with sample name, one with phenotype. Phenotypes of only 0 or 1 will be treated as binary, any other value and the phenotype will be treated as quantitative. Therefore for missing phenotype values the sample should simply be excluded from this file.

dsm kmer output file
.pheno metadata

mds values from kmds
file containing covariates
list of columns covariates to use. Format is 1,2q,3 (use q for quantitative)

number of threads. Suggested: 4

turn off all filtering and peform tests on all kmers input
maximum kmer length
minimum kmer frequency
minimum kmer occurrences. Overrides --maf
p-value threshold for initial chi squared test. Set to 1 to show all
p-value threshold for final logistic test. Set to 1 to show all

print lists of samples significant kmers were found in
prints version and exits
full help message

Basic usage:

seer -k dsm_input.txt.gz --pheno metadata.pheno > significant_kmers.txt

To use the kmds output, increase execution speed and give the most complete output

seer -k filtered.gz --pheno metadata.pheno --struct filtered.dsm --threads 4 --print_samples

This manpage was written by Andreas Tille for the Debian distribution and can be used for any other usage of the program.

March 2017 seer 1.1.2