BAMINTERVALCOMMENT(1) | General Commands Manual | BAMINTERVALCOMMENT(1) |
bamintervalcomment - sort BAM files by coordinate or query name
bamintervalcomment [options]
bamintervalcomment reads a BAM, SAM or CRAM file and a file containing a list of named intervals, marks each line in the input with the list of all matching intervals and stores the resulting file in BAM, SAM or CRAM format. The intervals file needs to be given using the intervals key. The file can be either plain or compressed using gzip. The intervals file is expected to contain one interval per line. Each line is assumed to contain a tab separated list of values, where the following columns are used by the program:
For each alignment the matching interval designators are stored in the CO (comment) auxiliary field in the form of a semicolon separated list, where each list element is a pair (A,B) given the two id columns of the respective interval. An example of an interval file can be found at http://hgdownload.cse.ucsc.edu/goldenPath/hg19/database/refFlat.txt.gz .
The following key=value pairs can be given when running the program:
level=<-1|0|1|9|11>: set compression level of the output BAM file. Valid values are
If libmaus has been compiled with support for igzip (see https://software.intel.com/en-us/articles/igzip-a-high-performance-deflate-compressor-with-optimizations-for-genomic-data) then an additional valid value is
verbose=<1>: Valid values are
tmpfile=<filename>: set the prefix for temporary file names
disablevalidation=<0|1>: sets whether input validation is performed. Valid values are
md5=<0|1>: md5 checksum creation for output file. This option can only be given if outputformat=bam. Then valid values are
md5filename file name for md5 checksum if md5=1.
index=<0|1>: compute BAM index for output file. This option can only be given if outputformat=bam. Then valid values are
indexfilename file name for output BAM index if index=1.
inputformat=<bam>: input file format. All versions of bamintervalcomment come with support for the BAM input format. If the program in addition is linked to the io_lib package, then the following options are valid:
outputformat=<bam>: output file format. All versions of bamintervalcomment come with support for the BAM output format. If the program in addition is linked to the io_lib package, then the following options are valid:
I=<[stdin]>: input filename, standard input if unset.
O=<[stdout]>: output filename, standard output if unset.
inputthreads=<[1]>: input helper threads, only valid for inputformat=bam.
outputthreads=<[1]>: output helper threads, only valid for outputformat=bam.
reference=<[]>: reference FastA file for inputformat=cram and outputformat=cram. An index file (.fai) is required.
range=<>: input range to be processed. This option is only valid if the input is a coordinate sorted and indexed BAM file
intervals=<>: file name of intervals file
Written by German Tischler.
Report bugs to <germant@miltenyibiotec.de>
Copyright © 2009-2014 German Tischler, © 2011-2014
Genome Research Limited. License GPLv3+: GNU GPL version 3
<http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it. There is NO
WARRANTY, to the extent permitted by law.
June 2014 | BIOBAMBAM |