SAMBAMBA-MARKDUP(1) | SAMBAMBA-MARKDUP(1) |
sambamba-markdup - finding duplicate reads in BAM file
sambamba markdup OPTIONS <input.bam> <output.bam>
Marks (by default) or removes duplicate reads. For determining whether a read is a duplicate or not, the same `sum of base qualities´ method is used as in Picard https://broadinstitute.github.io/picard/picard-metric-definitions.html.
Picard https://broadinstitute.github.io/picard/picard-metric-definitions.html metric definitions for removing duplicates.
External sort is not implemented. Thus, memory consumption grows by 2Gb per each 100M reads. Check that you have enough RAM before running the tool.
February 2015 |