DOKK / manpages / debian 11 / mapsembler2 / mapsembler2_kissreads.1.en
MAPSEMBLER2_KISSREADS(1) User Commands MAPSEMBLER2_KISSREADS(1)

mapsembler2_kissreads - Checks for each sequence contained into the toCheck.fasta if it is read coherent with reads from readsA.fasta or readsB.fasta

mapsembler2_kissreads <toCheck.fasta> <readsC1.fasta> [<readsC2.fasta> [<readsC3.fasta] ...] [-k value] [-c value] [-d value] [-O value] [-o name] [-u name] [-n] [-I] [-i index_stride] [-m align_file] [-s] [-f] [-h]

Checks for each sequence contained into the toCheck.fasta if it is read coherent (each position is covered by at least "min_coverage" read(s)) with reads from readsA.fasta or readsB.fasta A sequence s from toCheck is treated as follow:

if (s coherent with at least one read set): output the sequence as follows
>original fasta comment|C1:min<avg-corr_avg<max|C2:min<avg-cor_avg<max|C3...: >s
With A:min<avg-cor_avg<max standing for : value of the position having minimal coverage in readsA.fasta < average coverage in readsA.fasta - R-squarred corrected average in readsA.fa < value of the position having maximal coverage in readsA.fasta The coverage is the number of reads that perfectly mapped a position Any other situation (s not coherent with any): couple non read coherent, not outputed

-k size_seed: will use seeds of length size_seed. Default: 25.

-O minimal_read_overlap: a read can be mapped if its overlap is a least "minimal_read_overlap". Default: k

-c min_coverage: a sequence is covered by at least min_coverage coherent reads. Default: 2

-d max_substitutions: Maximal number of substitutions authorized between a read and a fragment. Note that no substitution is allowed on the central position while anaylizing the kissnp output. Default: 1.

-o file_name: write read-coherent outputs. Default: standard output

-u file_name: write unread-coherent outputs. Default: standard output

-n the input file (toCheck.fasta) is a kissnp output (incompatible with -I option)

in this case: 1/ only the upper characters are considered (no mapping done on the extensions) and 2/ the central position (where the SNP occurs) is strictly mapped, no subsitution is authorized on this position.

-I the input file (toCheck.fasta) is an Intl output (incompatible with -n option)

-i index_stride (int value). This is a heuristic for limiting the memory footprint. Instead of indexing each kmer of the sequences contained into the toCheck.fasta, kissreads indexes kmers occurring each "index_stride" position. Default = 1 (no heuristic)

-t max number of threads (also limited by number of input files)

-m align_file, write a file of reads mapped to sequences in file align_file

-s silent mode

-f outputs coherent events in a standard fasta file format

-h prints this message and exit

This manpage was written by Andreas Tille for the Debian distribution and can be used for any other usage of the program.

October 2018 mapsembler2_kissreads 2.2.4