SAMBLASTER duplicate criteria: Within a group of read to be marked duplicate, how does samblaster pick the one to keep? Does it consider base qualities, read length, num mismatches to reference, etc.?
Alternatives:
MarkDuplicates (not Cigar one) -- does sorting on readname fix past memory usage issues? Does it still mark supplementals in that case?
https://broadinstitute.github.io/picard/command-line-overview.html#MarkDuplicatesWithMateCigar
bamUtil
sambamba -- FrickTobias/BLR#113
SAMBLASTER duplicate criteria: Within a group of read to be marked duplicate, how does samblaster pick the one to keep? Does it consider base qualities, read length, num mismatches to reference, etc.?
Alternatives:
MarkDuplicates (not Cigar one) -- does sorting on readname fix past memory usage issues? Does it still mark supplementals in that case?
https://broadinstitute.github.io/picard/command-line-overview.html#MarkDuplicatesWithMateCigar
bamUtil
sambamba -- FrickTobias/BLR#113